Dynamics of Language An Introduction
Dynamics of Language An Introduction
net/publication/298544659
CITATIONS READS
39 7,373
3 authors:
Lutz Marten
SOAS, University of London
74 PUBLICATIONS 637 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
The Dynamics of Language: Dynamic Syntax as a formal model of syntax View project
All content following this page was uploaded by Ruth Margaret Kempson on 19 May 2014.
final draft
iv
Contents
v
vi CONTENTS
10 REFERENCES 353
viii CONTENTS
11 Index 367
CONTENTS ix
Preface
For the whole of the last half-century, all of us studying language from a formal
perspective have assumed that knowledge of language is different from the tasks
of speaking and understanding. This idea was first introduced by Saussure at
the turn of the century, and later recast by Chomsky (1965) as the distinction
between competence and performance. On both these views knowledge of language
is some capacity identifiable through abstraction away from the use of language in
communication and languages, hence language itself, have to be studied entirely
independently of the way they are used, because explaining the phenomena of
speaking and understanding a language have to make reference to this use-neutral
knowledge which underpins them. During the course of the century, there were
some dissenters, but, since no alternative was ever formally articulated, by and
large, this view held total sway.
This book takes a different tack: it continues the task set in hand by Kempson
et al. (2001) of arguing to the contrary that the common-sense intuition is correct
and that knowledge of a language consists in being able to use it in speaking and
understanding. What we argue is that defining a formal model of how interpretation
is built up across a sequence of words relative to some context in which it might
be uttered is all that is needed to explain structural properties of language. The
dynamics of how interpretation is built up just is the syntax of a language system.
The capacity for language is thus having in place the architecture which enables
us to produce and understand language input in context. What is central to the
explanation, through every twist and turn of the formal intricacies, is the faithful
reflection of the dynamics of processing a string: bit by bit, each unit of information
that constitutes the next input picks up on what is presented before and adds to it
to provide the context for the next addition to be processed.
Given this commitment, such a model might look like a new variant of use-
theories of meaning, a continuation of the later Wittgenstein’s admonition that all
there is to meaning is to look at the usage, nothing more need be said. However,
out of this earlier work, a resolutely anti-formalist stance developed, so that this
tradition is in danger of being anti-theory, dismissing the need for any formal expla-
nation. What we are arguing, however, is that there is room for views in between
these extremes. Our proposal is that properly formal theories of language can be
articulated in terms of the dynamics of how a language enables interpretation to
be progressively built up. Against the use-theory of tradition, we are committed
to the same methodology of previous grammar-writers in the tradition of theoreti-
cal linguistics that a grammar must define what it means for an expression of the
language to be well-formed, and that it must couch such explanations in terms of
distributional properties of the expressions of the language. Indeed, just like anyone
else in this tradition, we present a formal framework for articulating such struc-
tural properties of language. In this respect, the structure of the book is within
the tradition of formal linguistic modelling: we introduce the formal framework; we
apply it to individual analyses in individual languages to show that it provides a
reasonable way of addressing standard structural properties of that language; and
from there we enlarge the horizon to show how it can be used to articulate gen-
eral properties across languages. In particular we show how it is competitive in
addressing a whole range of phenomena that are extremely puzzling seen from the
perspective of other frameworks. In all these respects, the methodology is familiar.
x CONTENTS
However, there is a very different twist, because the formal framework we present
has an attribute which grammars of the past century, with a few honourable excep-
tions, have not had. The formalism which we present as the basis for articulating
grammars of natural language is able to define processes of growth of information
across sequences of expressions; and we argue that natural-language grammars, by
definition, reflect the dynamics of real time processing in context. In essence, we
take syntax to be a process, the process by which information is extracted from
strings of words uttered in some context. This stance not only leads to different
analyses of individual phenomena, and a commitment to very different forms of
explanation; but it opens up a whole new debate about the nature of language,
the formal properties of natural-language models, and the relation of language to
explanations of the way linguistic ability is nested within a more general cognitive
perspective.
The formal framework of Dynamic Syntax was set out in Kempson et al. (2001),
but in writing this book we have had different objectives. First, our task has been
to convey to a general linguistic audience with a minimum of formal apparatus, the
substance of that formal system, without diminishing the content of the explana-
tions. Secondly, as linguists, we set ourselves the task of applying the formal system
defined to as broad an array of linguistic puzzles as possible. On the one hand, we
have covered the kind of data other books might cover – problems displayed by
English. On the other hand, we have shown how the very same system can be used
to develop a detailed account of Japanese as a proto-typical verb-final language
showing that from the perspective we set out, Japanese looks just as natural as any
other language. This is a good result: verb-final languages comprise about half the
world’s languages; yet, to all other current theoretical frameworks, they present a
cluster of properties that remain deeply puzzling. Indeed, on looking back from this
new perspective in which they fall so naturally into the same type of explanation,
it is at the very least surprising that the community of linguists should have been
content for so long with theoretical frameworks that found these languages such a
challenge. We have also looked in some detail at the intricacies of the agreement
system in Swahili, one of a language family – Bantu – which seems to pose quite
different problems yet again from either of these two languages. Having set out a
range of detailed analyses, and a demonstration of how each fits into general ty-
pologies for language structure, the book then closes with reflecting on the novel
perspective which it opens up, showing how old questions get new answers, and
new challenges emerge. Amongst these is the demonstration that with the gap be-
tween a formal competence model and a formal performance model being so much
narrowed, we can fruitfully address challenges set by psycholinguists. One of these
is the challenge to provide a basis for modelling dialogue, the core phenomenon of
language use; and we argue that the Dynamic Syntax framework provides a good
basis for modelling the free and easy interchange between the tasks of speaking and
understanding, displayed in dialogue.
In writing such a book, it is always more fruitful to write as a community, and,
first of all we would like to take the opportunity of thanking each other for having
got ourselves to a point where we none of us feel we would have got to without the
other. As one might expect from such collaboration, the book has spawned other
publications during the process of its completion. Parts of chapter 3 were written up
as Kempson (2003), Kempson and Meyer-Viol (2004), parts of chapters 4-5 as Cann
et al. (2003), Cann et al. (2004), Cann et al. (2005), Kempson, Kiaer and Cann
(forthcoming), Kempson, Cann and Kiaer (2004). Part of chapter 6 is written up as
Kempson and Kiaer (2004), Kempson (2005), parts of chapter 7 as Marten (2000,
2003, 2005), parts of chapter 8 as Cann (forthcoming a,b) The implementation of
CONTENTS xi
1
2 CHAPTER 1. THE POINT OF DEPARTURE
time, but it does not take much scratching at the surface before the problems start
to emerge.
Within the perspective which the current methodology imposes, there are two
central properties displayed by all languages; and these pose a recognised major
challenge for theoretical explanations of language. On the one hand there is the
compositionality problem of how words and what they are taken to mean can be
combined into sentences across an indefinite array of complexity. We have various
means of saying the same thing, using words in a variety of orders:
(1.1) Tomorrow I must see Bill.
(1.2) Bill I must see tomorrow.
And any one of these sequences can always be embedded at arbitrary depths of
complexity:
(1.3) You insisted I think that Harry I must interview today and Bill I must see
tomorrow.
(1.4) The friend of mine who told me that Sue was insisting that tomorrow I
must see Bill was lying.
On the other hand, there is the problem of context-dependence. Pronouns are
a familiar case: they have to be understood by picking up their interpretation
from some other term in some sense recoverable from the context in which they
are uttered. However the phenomenon is much, much more general. Almost every
expression in language displays a dependence on the context in which the expression
might occur. The effect is that any one sentence can be taken to express a vast
number of different interpretations. Even our trivial examples (1.1)-(1.2) differ
according to who is speaker, who Bill is, and when the sentence is uttered. The first
challenge, then, to put it another way, is to be able to state the interaction between
order of words and interpretation within a sentence. The second challenge is to
explain how the interpretation of words may be related to what has gone before
them, either within the sentence itself or in some previous sentence.
The types of solution for addressing these problems are almost never challenged.
The first problem, right across theoretical frameworks, is taken to require a syntac-
tic solution. The second, equally uniformly, is taken to present a semantic problem,
only. These challenges are then taken up by different breeds of theoretical linguist:
syntactician on the one hand, semanticist (or pragmaticist) on the other. Given
this separation, we might reasonably expect that two-way feeding relations between
the phenomena would be precluded. Indeed systematic interaction is virtually in-
expressible given many of the solutions that are proposed. Yet such expectation
would be entirely misplaced. There is systematic interaction between the two phe-
nomena, as we shall shortly see, with context-dependent expressions feeding into
structural processes in a rich assortment of ways.
The significance of this interaction is, we believe, not sufficiently recognised. The
phenomena themselves have not gone unnoticed, but their significance has. Such
interactions as are identified are generally analysed as an exclusively syntactic phe-
nomenon, with those expressions that display interaction with syntactic processes,
characteristically pronouns, analysed as having distinct forms, one of which is sub-
ject to syntactic explanation, the other semantic. We shall see lots of these during
the course of this book. The result is that the phenomena may be characterised by
the various formalisms, but only by invoking suitable forms of ambiguity; but the
1.1. COMPOSITIONALITY AND RECURSION IN NATURAL LANGUAGE 3
underlying significance of the particular forms of interaction displayed has gone al-
most entirely uncommented upon (though exceptions are beginning now to emerge:
Boeckx (2003), Asudeh (2004)).
What we shall argue to the contrary is that what these systematic interactions
show is that the problem of compositionality and the problem of context-dependence
are not independent: syntactic and semantic aspects of compositionality, together
with the problem of context dependence, have to be addressed together. Making
this move, however, as we shall demonstrate, involves abandoning the method-
ological assumption of use-neutral grammar formalisms in favour of a grammar
formalism which directly reflects the time-linearity and context-dependent growth
of information governing natural language parsing.1 Intrinsic properties defining
language, we shall argue, reflect the way language is used in context so directly that
the structural properties of individual languages can all be explained in terms of
growth of structure relative to context. We shall argue accordingly that parsing is
the basic task for which the language faculty is defined; and that both syntactic and
semantic explanations need to be rethought in terms of the dynamics of language
processing.
Setting out the argument is the burden of the entire book, together with the
presentation of the framework of Dynamic Syntax (Kempson et al. 2001). In
this first chapter, by way of preliminary, we set out these two challenges in some
detail, provide some preliminary justification for the stance we take, and give some
indication of the type of analysis we shall provide.
for natural languages that, in defining well-formedness of strings in terms of possible continuations,
also induce strings on a left-right basis (Hausser 1989, 1998).
4 CHAPTER 1. THE POINT OF DEPARTURE
noun phrase to yield a sentence, or, as a closer rendering of what the rules actually
determine, a sentence may be made up of an NP and a VP, and a VP can be made
up of a Verb and an NP. By a very general convention, any realisation of some
application of these rules gets represented as a tree structure into which the words
are taken to fit:2
S → NP VP S = sentence
VP → V NP VP = verb phrase
(1.5)
NP = noun phrase
V = verb
NP VP
V NP
John
upset
Mary
As the starting point for articulating a semantics for such structured strings,
we might assume that there are “semantic” rules which determine how meanings
for sentences are built up on the basis of the meanings of their words and such
structural arrangements. So we might express what it is that (1.5) means on the
basis of first knowing that upsetting is a relation between individuals – something
that people can do to us – that the term Mary refers to some individual bearing
that name, John to some individual called by the name ‘John’. What knowing
the meaning of the sentence then involves, is having the capacity to combine these
pieces of information together following the structure of the sentence to yield the
assertion that John upset Mary (not the other way round). Somewhat more for-
mally, providing the semantic characterisation of a sentence involves specifying the
conditions which determine the truth of any utterance of that sentence; and, as part
of this programme, the meaning of a word is defined as the systematic contribution
made by its interpretation within the sentences in which it occurs. The semantic
rules that guarantee this rely on an independently articulated syntax. So, for (1.5),
following the syntactic structure, the meaning assigned to the verb upset applies as
a function to the individual denoted by the name Mary to yield the property of up-
setting Mary, which in its turn applies to the individual denoted by the name John
in subject position to yield the proposition that John upset Mary at some time in
the past, relative to whatever is taken to be the time of utterance.3 The example we
have given is trivial. But the underlying assumptions about design of the grammar
are extremely widespread. This pattern of defining a syntax that induces structure
over the strings of the language, with an attendant truth-conditional semantics
that is defined in tandem with the defined syntactic operations, has been a guid-
ing methodology for the last thirty odd years, even though individual frameworks
disagree over the detailed realisation of such formalisations.
2 In some frameworks, S corresponds to IP or CP (Inflection Phrase or Complementiser Phrase),
semantics interaction, based on ideas taken from Generalized Phrase Structure Grammar, see
Gazdar et al. (1985).
1.1. COMPOSITIONALITY AND RECURSION IN NATURAL LANGUAGE 5
The particular toy grammar we have given here does not allow for recursion.
But on the assumption that we introduce appropriate syntactic rules to allow for
recursion, this model will have the advantage of being able to reflect the infinitely
recursive property for both syntax and semantics, given the definition of semantic
modes of combination running on the back of the defined syntactic rules.4 As a
reflection of the systematicity of this syntax-semantics relation, we might dub this
view “the jigsaw view of language”: each word and its meaning add together with
its neighbouring words and their meaning to give bigger and bigger pieces which fit
together to form a sentence plus meaning pair for a sentence when all these various
parts are combined together. Knowledge of language, then, constitutes knowledge
of this set of principles; and, accordingly, models of language use will have to take
such models as their point of departure.
fully appropriate mnemonics were coined for labelling distinct syntactic constructions - island
constraints, the Right Roof Constraint (Ross 1967), Crossover phenomena (Postal 1970), etc.
6 CHAPTER 1. THE POINT OF DEPARTURE
The subject of obvious, which we might expect on the pattern of (1.7) to occur
before the predicate word obvious, is the clause that I am wrong (this is what
is obvious); but all we have in subject position is the word it as some kind of
promissory note, itself giving much less information than we need, merely acting as
a wait-and-see device.
To add to the puzzle, there are restrictions on these correlations which do not
seem to be reducible to any semantic considerations. So while establishing a connec-
tion between a left dislocated expression and some position in a string indefinitely
far from this initial position is possible across certain clause boundaries, it is not
across others. Thus, (1.9) is well-formed where each clause is the complement of
some verb, while (1.10)-(1.11) are not. In the latter cases, this is because the
containing clause inside which the removed item is intended to be correlated is a
relative clause.
(1.9) That book of Mary’s, Tom tells me he is certain you can summarise
without difficulty.
(1.10) *That book of Mary’s, Tom tells me he has met the author who wrote.
(1.11) *The book which Tom told me he had met the author who wrote was very
boring.
Left dislocation, though subject to constraints, nevertheless may involve the
indefinite “removal” of an expression from its interpretation site. Analogous “re-
moval” to some right-periphery position is very much more restricted, however.
Dislocation from the point at which the expression has to be interpreted can only
be within the domain provided by the clause in which the expression is contained,
and not any further. This is shown by (1.12)-(1.15):
(1.12) It was obvious I was wrong.
(1.13) That it was obvious I was wrong was unfortunate.
(1.14) It was unfortunate that it was obvious I was wrong.
(1.15) *That it was obvious was unfortunate I was wrong.
(1.12) shows that a right peripheral clausal string like I was wrong can be associated
with an expletive pronoun, it, in subject position, providing the content of that
pronoun. (1.13) shows that this form of construction can be contained within
others. (1.14) shows that this process can be repeated, so that the main subject
pronoun can be provided with its interpretation by the string that it was obvious I
was wrong.
What is completely impossible, however, is the establishment of a correlation
between a right dislocated clausal string such as I was wrong with expletive it which
is subject, not of the matrix clause, but of the clausal subject, That it was obvious.
As (1.16) illustrates, this is the structure exhibited by (1.15) which is completely
ungrammatical and any meaning for the sentence as a whole is totally unrecoverable.
That it might mean the same as (1.13)-(1.14) is impossible to envisage. This is
despite the fact that the clausal sequences are all arguments which, as noted above,
freely allow leftward movement.
(1.16) *[ That iti was obvious ] was unfortunate [I was wrong]i .
Such right peripheral displacements are said to be subject to a “Right Roof Con-
straint” (Ross 1967), imposing a strict locality restriction on how far the clausal
1.1. COMPOSITIONALITY AND RECURSION IN NATURAL LANGUAGE 7
string can be removed to the right from its corresponding pronoun, limiting this to
a single, containing clause.
It was the existence of such apparently semantically blind restrictions that led
in the 70’s to agreement across otherwise warring theoretical factions that syntactic
and semantic explanations could not be expressed in the same terms;6 and follow-
ing this agreement, a number of grammar formalisms were articulated (Gazdar et
al. 1985, Bresnan 2001, Sag and Wasow 1999) all of which presumed on the inde-
pendence of theoretical vocabulary and formal statements to be made for syntactic
and semantic generalisations.7 More idiomatically, as Chomsky has expressed it
(Chomsky 1995), natural languages seem in some sense not to be “perfect”, and
this in itself is a challenge which linguists have to address. A core property of
natural languages lies in these irreducibly syntactic properties - those which, in
minimalist terms, cannot be reduced to properties of either the semantic or phonol-
ogy interface; and it is these above all which Chomsky pinpoints as evidence of the
uniquely defining structural properties of natural language.
(Montague 1974) in which syntax and semantics were expressed directly in tandem, following
earlier articulation of categorial grammar formalisms in which this is true by definition (Lambek
1961, Bar Hillel 1964 and many others since). Though this assumption was agreed to be unsus-
tainable, given the existence in particular of strong island constraints such as extractability from
complex structures to the left periphery (eg. The Complex NP Constraint), and no-extractability
from complex structures to the right periphery (The Right Roof Constraint), the methodology
of striving to establish denotational underpinnings for structural properties of language continues
to this day, attempting in so far as possible to preserve the strongest form of compositionality
of meaning for natural language expressions, by invoking lexical and structural ambiguity for all
departures from it.
7 Categorial grammar formalisms preserve the Montague methodology in its purest form, defin-
ing a mapping from natural-language string onto model-theoretically defined denotations without
any essential invocation of a level of representation, using an array of lexical type assignments
and operations on these types which determine compositionality of meaning despite variability in
word order (Ranta 1994, Morrill 1994, Steedman 1996, 2000, Moortgat 1988). Lexical Functional
Grammar (LFG) is the only framework which systematically abandons any form of composition-
ality defined directly on the string of expressions, for c-structure (the only level defined in terms of
a tree-structure configuration inhabited by the string of words) is motivated not by denotational
considerations, but solely by distribution (Bresnan 2001, Dalrymple 2001).
8 CHAPTER 1. THE POINT OF DEPARTURE
b. Though John and Mary adored each other, he married Sue. The only
time they subsequently met, he upset her so badly she was glad he
had married Sue, not her.
Despite the fact that he upset her is in much the same context in (1.18.a,b), never-
theless in (1.18.a) her means ‘Sue’ and in (1.18.b) it means ‘Mary’, due to the use of
they in (1.18.b) and the assertion that whoever they are had only met once subse-
quent to the marriage, which rules out interpreting her as picking out Sue (at least
given conventional marriages). The problem is that the assumption that semantics
is given by articulating the contribution each word makes to the conditions under
which the containing sentence is true would seem to lead to the assertion that all
sentences containing pronouns are systematically ambiguous: the conditions under
which John upset Mary are not the same as the conditions under which John upset
Sue, ‘Mary’ and ‘Sue’ being the discrete interpretations associated with the pro-
noun her. Ambiguity defined in terms of truth conditions holds when an expression
makes more than one contribution to the truth conditions of the sentence in which
it is contained. The word bank is an obvious example, any one use of the word in
a sentence systematically providing more than one set of truth conditions for the
containing sentence. Problematically, pronouns seem to be showing the same prob-
lem; yet we do not want to be led to the conclusion that a pronoun is an ambiguous
word. For this leaves us no further forward in expressing what it is that constitutes
the meaning of an anaphoric expression: it simply lists the different interpretations
it can have, and there are too many for this to be plausible.
There is a more general problem than this. Pronouns stand in different types of
relation to the denotational content they can be assigned in context, making it look
as though the ambiguity analysis of pronouns is unavoidable in principle. In (1.19),
the pronoun is construed as a bound variable, that is, a variable to be construed
as bound by the quantifier every girl, its interpretation dependent entirely on what
range of girls every is taken to range over:
(1.19) I told every girl that she had done well.
But this is not an appropriate analysis of (1.20). In (1.20), the pronoun is gener-
ally said to be a coreferring pronoun, both it and its antecedent having the same
interpretation, denoting the same individual:
(1.20) Edwina came in. She was sick.
But this form of analysis will not do for the pronoun she in (1.21) - the pronoun
has to be construed as ‘the woman I helped over the road’:
(1.21) I helped an old woman over the road. She thanked me.
But the pronoun in (1.21) is not functioning as a bound-variable, as in (1.19), either.
Different from either of these, the construal of the pronoun in (1.21) involves some
kind of computation over the entire preceding clause.
The particular significance of the differences between (1.19)-(1.21) is that, on
the one hand, the problem does not seem to be reducible to a syntactic problem.
This is not a problem that can be solved by analysing the pronoun as a stand-in
device for the expression that provides its antecedent, since (1.19) and (1.22) mean
entirely different things:
(1.22) I told every girl that every girl had done well.
(1.21) and (1.23) also:
1.1. COMPOSITIONALITY AND RECURSION IN NATURAL LANGUAGE 9
(1.23) I helped an old woman over the road. An old woman thanked me.
Deciding on the most appropriate analysis for this type of pronoun, as is the stan-
dardly acceptable way of describing the problem, has led to an immense amount of
debate; and pronouns that give rise to this form of construal in such contexts are
currently distinguished from other uses by the term E-type pronouns.8
With there being different types of truth conditions which a pronoun can con-
tribute to its containing sentence, it seems that analysing the meaning of a pronoun
in terms of its contributions to the truth conditions of sentences in which it occurs
seems bound to involve diversity, hence ambiguity of the pronoun itself. Yet there is
good reason to think that any solution which freely invokes lexical ambiguity cannot
be right. Not only does this phenomenon of multiple variable-type, referential-type
and E-type interpretations extend systematically across pronouns in all languages.
It also extends systematically across anaphoric expressions in all languages. Here
are a range of paired examples from English, showing the availability of bound
variable and E-type interpretations provided by both the definite article and for
both forms of demonstrative.
(1.24) Every house I have put on the market I have checked, to make sure the
house will not be hard to sell.
(1.25) Most students were there. The entire group got drunk.
(1.26) Every day I drink any wine, I know that later that day I shall have a
migraine.
(1.27) Most people who came early left well before a few people got drunk. That
group were no problem.
(1.28) Every time I don’t take my pills, I think that this time I am better and
will not need them.
(1.29) I went to a party at which several of my friends were playing dominos.
This group were very quiet, though the musicians got very drunk.
The availability of such types of construal also extends to tense, with (1.30) indi-
cating a bound-variable form of construal for the present tense, (1.31) indicating a
referential form of construal, (1.32) an E-type form of construal:
(1.30) Whenever the dog goes out, she pees.
(1.31) The dog went out. She didn’t pee.
(1.32) I saw each student for an hour. I was very patient despite the fact that I
had a headache.
In (1.32), the type of construal is again an E-type form of construal, for the under-
standing of the time interval contributed by was in I was very patient is over the
whole time it took to see the students one after the other.
There is also verb-phrase (VP) anaphora, and nominal anaphora:
(1.33) Even though I write my own songs, most of my friends don’t.
(1.34) Even though I like to sing John’s songs, that particular one isn’t much
good.
8 ‘E’ for existence, so we understand, and not for Gareth Evans, who coined the term (Evans
1980).
10 CHAPTER 1. THE POINT OF DEPARTURE
And this is still not much more than the tip of the iceberg. There are also a number
of different forms of nominal and verbal ellipsis, where the required interpretation
isn’t expressed by words at all, but is picked up directly from the context:
(1.35) Whenever I sing my own songs, Sue refuses to.
(1.36) Even though I like to sing John’s recent songs, not one is as good as most
he had written while a student.
(1.37) She can sing some of my songs, but not John’s.
(1.38) Question: What does Lloyds promote? Answer: Its own interests.
(1.39) What Lloyds promotes is its own interests.
(1.40) She can sing some of my songs, but I haven’t decided which.
Though VP ellipsis has been taken to be a broadly unitary phenomenon since Dal-
rymple et al. 1991, as in (1.33) and (1.35), VP ellipsis and nominal ellipsis (as
in (1.34), (1.36)-(1.37)) are invariably distinguished, with different forms of input
structure. So too are fragmentary answers to questions (1.38), and the correspond-
ing so-called pseudo-cleft structures (1.39), for which a recent analysis has proposed
that the final expression (and correspondingly the fragment answer) is really a full
sentential form ‘Lloyds promotes its own interests’ in which all but the final expres-
sion is deleted (Schlenker 2003). There is so-called sluicing (1.40), generally also
analysed as a form of sentential ellipsis in which only the wh expression remains
(Levin 1982, Chung et al. 1995, Merchant 2001).
The list goes on and on from there. For each major construction, distinct forms
of interpretation are seen to be associated with subtly distinct structural restric-
tions. As we shall see in chapters 3 and 6, there are a number of different types
of relative clause construal, suggesting at least rather different forms of relativis-
ing element, supposedly needing quite distinct forms of analysis. And in chapter
8, we shall confront the problem of be for which at least five different forms have
been posited - presentational, existential, predicative and equative be plus the aux-
iliary. Despite the temptation to posit discrete structures coinciding with discrete
forms of interpretation, the consequences of any decision simply to invoke ambi-
guity have to be faced, as part of the challenge of confronting the phenomenon of
context-dependence in general. Any such decision is no more than an evasion of the
challenge of providing an explanation of what intrinsic contribution the expression
or structure brings to the interpretation process, which at one and the same time
licenses and yet limits the flexibility of its interpretation.
The alternative conclusion stares one in the face, begging to be drawn. The
phenomenon of context dependence is general. It dominates the problem of pro-
viding an adequate account of natural language construal; indeed it constitutes its
very heart. This is not a phenomenon that can be swept aside as a mere ambiguity
of the expressions that display it.9 Furthermore, once we grant that the meaning
of words contained in a sentence is not sufficient to establish the interpretation of
what is conveyed in an utterance, we have to articulate meanings for expressions
that are systematically weaker than the interpretation they may be assigned, with
a definition of the forms of update that take place during the process of building up
such interpretations.10 So what we shall be seeking to provide is a formal charac-
terisation of an expression’s lexical meaning in terms of some specification which is
9 See Carston (2002) for a demonstration that it also infuses conceptual content, with the
only the input to an interpretation process that is sensitive to the context in which
the expression is uttered.
expressions that are in some sense long are moved to the right periphery of a string. However,
this has sometimes led to bizarre conclusions, e.g. that variability in constituent order known as
scrambling is a PF phenomenon, outside core syntax (see Karimi (2003) for different representative
views about scrambling). Indeed, since all dislocation processes are to some extent context-
dependent, this line of thought logically leads to the conclusion that no such phenomena should
be characterised as part of core grammar.
12 Despite the fact that linear precedence is encoded in LFG at the level of c-structure, it is
uniformly agreed that the substantial core of linguistic generalisations has to be expressed at
f-structure, which retains no record of linear order. In Head-Driven Phrase Structure Grammar
(HPSG, Sag et al. 2003), discussions of linear order have tended to be concerned with capturing
generalisations about word orders within particular languages and linear precedence rules are
in general treated independently of the principal syntactic operations on feature structures (e.g.
Uszkoreit 1987, Sag and Wasow 1999), although there are exceptions (Kathol 1995, 2000, Reape
1993). See also Hoffman (1995), Baldridge (2002) who set out a Combinatory Categorial Grammar
(CCG) characterisation of local scrambling in free word order languages in terms of a set-theoretic
type specification which reflects the view that the grammar simply fails to dictate local NP-
ordering relative to the verb.
12 CHAPTER 1. THE POINT OF DEPARTURE
syntactic primitive is subtly shifting in recent minimalist work, without concern that
this might be undercutting basic theoretical assumptions. This move started with
Kayne (1994), who observed that the mapping of dominance structures onto linear
strings imposes an asymmetry on the structures to be licensed, with such constraints
as ‘specifier’ precedes ‘head’, ‘head’ precedes ‘complement’, a view which has been
extremely influential despite the fact that these constraints are highly problematic
to sustain in the face of verb-final languages, imposing a large number of movement
processes in any characterisation of those languages (Simpson and Bhattacharya
2003).13 Nevertheless we take it as no coincidence that the increasing interest in
issues of linearity has also led within minimalism to a view which directly correlates
the grammar formalism with the dynamics of language performance (Phillips 1996,
2003), in a view not dissimilar to our own.
There is a major problem caused by the disavowal of explanations that make
reference to the dynamics of processing order: it creates a tension between the
characterisation of how words are put together to form strings (syntax), and char-
acterisation of interpretation of such strings (semantics). Though syntax is assumed
to provide the input for semantic characterisation (at some LF interface in min-
imalist terms), the remit of semantics is assumed to be an explanation of how
information is built up in context to yield a new context for what follows, a task
which has to be fulfilled despite the fact that syntactic processes are not defined
in terms that support this remit. This creates two problems, one formal, one em-
pirical, the empirical evidence showing that the formal stance adopted cannot be
right. First, we face a design problem for the language model: it is far from obvious
how to fit syntax and semantics together as distinct but correlated components of a
formal system that is designed to reflect what a speaker/hearer knows independent
of any application of that knowledge, given that one of these components but, ap-
parently, not the other has to be defined in dynamical update terms with essential
reference to context. On the empirical side, the facts simply deny the sharp separa-
bility of anaphora construal and the articulation of structural restrictions that this
separation of considerations of structure and context-dependence might lead one
to expect. To the contrary, there is pervasive inter-dependence of anaphora and
structural processes, as we shall spend much of this book showing. All that can be
done to preserve the encapsulation of the grammar formalism and its separation
from more general concepts of context-dependence and the dynamics of language
processing in context is to assume a bifurcation between anaphoric processes which
display interaction with structural processes, and those which appear to depend,
in some sense more loosely, on the larger dialogue context. But this is difficult to
reconcile with the fact that any one pronoun can be used in both ways.14 The result
for such types of solution (Aoun and Li 2003) is a lack of integrated explanation
of any single anaphora/ellipsis process. The formalism merely defines a subset of
cases to which some arbitrarily delimited anaphoric process applies, generally one
that can be reduced to some binding mechanism independently articulated in the
grammar (see Morrill (1994), Hepple (1990) for categorial grammar examples).15
13 We are grateful to Öystein Nilsen for informing us of recent research moves in the direction
pronouns.
15 Despite the emphasis in Discourse Representation Theory of providing a framework in which a
unitary account of anaphora can be expressed (Kamp and Reyle 1993), the same bifurcation occurs
if the version of Discourse Representation Theory presupposes an attendant grammar formalism
which assigns co-indexing between anaphoric and antecedent expressions for grammar-internal
processes, for the DRS construction algorithm which dictates construal of pronouns through an
identification of variables will apply only to supposedly discourse-general processes, excluding
1.1. COMPOSITIONALITY AND RECURSION IN NATURAL LANGUAGE13
Any language can be used to illustrate this. In this chapter, we stick to English.
We start with the resumptive pronoun strategy in English where the pronoun serves
to pinpoint the position in the relative-clause construal at which some reflex of the
head noun has to be seen as contributing to the interpretation of the relative (the
pronoun it in (1.41)):16
(1.41) There was this owl which it had got its foot caught in the goal netting.
The problem that examples like these display is that, on standard assumptions,
the requisite interpretation of the pronoun is the result of a structural process, but
this process is characterised independently of, and logically prior to, the process of
assigning semantic interpretation to the assigned structure. The characterisation is
in terms of some form of co-dependency of elements in a long-distance dependency
structure.17 Whatever the particular mechanisms of the particular formalism, it
forces an analysis of this use of anaphoric expression of an entirely different sort
from the more general phenomenon of context-dependence. One possible way to
avoid the theoretical consequences which these data apparently present is to label
the problematic subcases as a discrete lexical property of some itemised form of
pronoun, possibly phonologically weakened. This is an instance of the problematic
ambiguity tactic very generally adopted in the face of problematic data. But this
tactic really is not well motivated, as all other definite and demonstrative NPs
can be used in the same way where there is no possible analysis in terms of some
phonologically weakened form, and the goal of analysing resumptive pronouns in
a way that integrates them with the more general phenomenon of anaphora is a
recognised challenge (Boeckx 2003, Asudeh 2004):18
(1.42) I watched this woman, who the idiot had got herself into a dreadful
muddle trying to sort out her papers in the middle of the conference hall.
(1.43) This afternoon I’m interviewing a mature student, who this woman is
asking for extra time, and we may not be able to avoid giving it to her.
The other alternative is to exclude the data altogether. Asudeh (2004), for
example, analyses these as ungrammatical (as does Boeckx (2003)), requiring an
additional production mechanism that licenses local strings which determines their
acceptability to the speaker.19 Yet, this common occurrence of optional resumptive
pronouns in relative clauses that are nevertheless deemed to be substandard is not
peculiar to English. It is displayed in other European languages - Italian, Spanish,
Portuguese at least. In contrast, strikingly, resumptive pronouns are not usable
at all in German. But this makes a pure production analysis seem unlikely: how
could the requisite production strategy not be applicable in speaking German? This
problem should not be dismissed as just a one-off production error associated with
pronouns captured by the grammar from its remit. In this connection, it is should be noted
that the Dynamic Predicate Logic (DPL) characterisation of anaphora (Groenendijk and Stokhof
1991) presumes on a prior indexing of pronouns provided by some attendant grammar formalism,
so there is no formal reflex of the underspecification intrinsic to anaphoric expressions.
16 This is a strategy which is used much more freely in other languages. See chapter 4, and Cann
et al. forthcoming. The example was uttered spontaneously by the second author, 13.08.04.
17 This is achieved in various ways in the different frameworks: feature sharing in HPSG, term
of a general characterisation of anaphora, but his analysis involves positing a morphologically null
operator which has the effect of removing the pronominal properties of the resumptive, thereby
ensuring its function as a gap needing binding by the associated gap-binding wh-operator.
19 It is notable, in this connection, that these are often reported as possible to parse, but with the
observation that “I wouldn’t say it that way”, an observation which is at odds with his analysis.
14 CHAPTER 1. THE POINT OF DEPARTURE
there could be some interaction between order and binding. The theory is, however, problematic
in requiring the explosion of functional categories to allow anything like the full range of word
order phenomena found even in a single language.
1.1. COMPOSITIONALITY AND RECURSION IN NATURAL LANGUAGE17
(1.49) a. The report on our project Sue has warned us will be first on next
meeting’s agenda.
b. *That it was obvious was unfortunate I was wrong.
These and other challenges we shall take up in due course.
Before getting into any of the details, we now set this putative shift in per-
spective against the larger psychological background. It is no coincidence that the
concept of underspecification that we espouse provides the tool which will bring
anaphora construal and long-distance dependency together. We shall find that
underspecification of interpretation assigned to some signal is intrinsic to all sig-
nals processed by the cognitive system, a phenomenon much broader than some
language-internal property. And once we grant the need for a general cognitive
explanation of this phenomenon, it opens up the way for articulating in detail the
particular forms of underspecified input plus update which natural languages make
available, against a background in which the phenomenon of language is grounded
directly in a general cognitive framework.
ing different sub-tasks, there has been a recent, rapid growth of interest in inferential approaches,
using Bayesian models of probabilistic reasoning (Knill and Richards 1996).
1.2. INTERPRETING THE WORLD AROUND US: THE CONSTRUCTION OF REPRESENTATIO
for many people to accept that what they see is a symbolic interpretation of the
world – it all seems so like the ‘real thing’. But in fact we have no direct knowledge
of objects in the world” – a view which goes back at least to Helmholtz (1925), who
argued that perception is unconscious inference.
These examples highlight a very important aspect of this process which Fodor
did not lay much emphasis on. This is the systematic gap between the information
provided by the stimulus itself and the information humans recover from it. We
are all always adding to what it is we have literally “seen” in virtue of what else we
know that is available to us at the time. But if signals do not themselves determine
the interpretation that we impose on them, because the retinal image itself fails to
uniquely determine the information to be recovered, what is it that determines how
signals are interpreted?
on its own cannot explain how choices about which information to attend to can
be made. Somehow or other, most information probably interacts with what we
believe already in some way or other, so in practice it is not possible to process
all incoming information and check for potential contextual effects. So Sperber
and Wilson propose that maximisation of contextual effects is counter-balanced by
processing cost. Mental activity involves “cost”: thinking, information retrieval
from long-term memory, deriving conclusions are all activities which need cognitive
resources. These resources have to be allocated so as to derive maximally relevant
information with justified cognitive effort. This Principle of Relevance is then taken
to underpin how all input to the cognitive system is enriched to determine a specific
interpretation in context, whatever the form of input.
In applying such considerations to language construal, Sperber and Wilson
(1995) argue that there is one additional consideration: the signal provided is a
manifest display of the speaker having an intention to communicate, and this in
itself provides extra information. In virtue of this display of intention, the hearer
is justified in spending cognitive effort on processing a communicated message, and
furthermore minimal cognitive effort, because she can assume that the form of the
utterance used is selected as the most relevant one possible in the given situation.25
From this perspective, the puzzle presented by pronouns and elliptical structures is
just part of what is a quite general cognitive process and is not some idiosyncracy
of natural language. All human processing involves constructing a representation
which we take, following Fodor, to be the interpretation provided by some given
signal. The choice as to which interpretation to construct from a signal is dictated
by the very general cognitive considerations encapsulated in a constraint such as
the principle of relevance. We can see this from the examples given earlier, e.g.
(1.18.a,b), repeated below.
1.18 a. Though John and Mary adored each other, he married Sue.
Whenever he upset her subsequently, she would remind him that it
was Mary he should have married.
b. Though John and Mary adored each other, he married Sue. The only
time they subsequently met, he upset her so badly she was glad he
had married Sue, not her.
We interpret he upset her in (1.18.a) as ‘John upset Sue’ because this is the only
choice consistent with what the speaker has said in the clause that follows. Such an
interpretation, easily constructed in the provided context gives us inferences which
are informative in that context in a way that the competing interpretation is not.
In (1.18.b), the same string has to be interpreted as ‘John upset Mary’ for similar
reasons, given the difference in the following clause. Least effort is not enough: in
these cases this yields two competing interpretations. But neither is maximisation
of inferences. It is the two together that determine the particular interpretation
assigned in context. That is, we make decisions swiftly, with least effort scanning
the smallest structure for a term which could provide an interpretation the speaker
could have intended, with the specific context determining what that choice can
be.
The insight that anaphora and elliptical processing are interpreted by the same
general pragmatic principles that drive all interpretive procedures is not uncontro-
exactly the new information interacts with old information. Beliefs might be strengthened or
contradicted, or the new information might provide a premise to derive a conclusion which would
not have followed from the initial information state.
25 This involves a weaker principle which Sperber and Wilson call The Communicative Principle
of Relevance.
1.3. COMPETENCE AND PERFORMANCE: TOWARDS A NEW VIEW 21
versial. Prior to Sperber and Wilson’s work, linguists and psychologists had very
generally assumed that the free processing of language in context, what we label
pragmatics, was different in kind from the process of assigning interpretation to
expressions such as pronouns, which was taken to form part of establishing the
content directly expressed in uttering a sentence. In Gricean pragmatics in partic-
ular, a sharp division is drawn between the propositional content associated with
an utterance, and further more indirect pragmatic effects that might follow from
the fact of its being uttered, with only the latter being constrained by the so-called
maxims of conversation.26 Part of Sperber and Wilson’s contribution has been to
demonstrate to the contrary that the way we assign interpretation to words such
as pronouns is subject to the very same constraints as determines the much richer
array of interpretational effects such as metaphor, irony, and so on.27
models of pragmatics. However, there is the puzzle of the various forms of inter-
action between anaphora construal and structural processes waiting to be solved.
If, as already suggested, we take on the challenge of solving this puzzle by mod-
elling discontinuity effects and anaphoric processes as forms of underspecification
which undergo processes of update, then we can no longer assume that all linguistic
processes happen first and pragmatically driven processes operate solely on their
output. Rather, what is needed is a model which articulates not merely some req-
uisite set of logical structures, but also a process of growth of such structures given
natural-language input, with an assumption that general constraints, such as that of
optimal relevance, can determine how individual choices might be made during any
one such growth process which the model licenses. So the resulting model does not
have the same property of encapsulation as assumed by Sperber and Wilson, along
with many others. The twist in the tale is that a formal account of the individual
time-linear steps whereby structure corresponding to interpretation can be built
up turns out to have just the right flexibility for capturing structural properties of
language without being vacuously general.
Of course, we do not expect anyone to believe this stance without argumenta-
tion; and the purpose of the rest of this book is to provide this. Having set out
the Dynamic Syntax framework, we shall look first at long-distance dependency,
anaphora construal and their various forms of interaction, an account which will
include accounts of all the major types of relative clause formation, and the major
structures that have been distinguished as left and right periphery effects, such as,
Hanging Topic Left Dislocation, Clitic Left Dislocation, expletives, pronoun dou-
bling and Right Node Raising. We shall then turn to looking at Japanese, which,
as a prototypical verb-final language, is notoriously problematic for current for-
malisms: these languages at best are assigned analyses which makes parsing them
seem irreducibly non-incremental, despite the fact that there is lots of evidence that
the parsing of Japanese is just as incremental as parsing any other language (Inoue
and Fodor 1995, Kamide and Mitchell 1999, Fodor and Hirose 2003, Aoshima et
al. 2004).28 Chapters 7 and 8 pursue the enterprise to show how co-ordination and
agreement patterns in Bantu are constrained by time linearity and how the con-
cept of underspecification can be used to articulate a uniform account of a range
of copular clauses in English. In contrast to standard and static approaches, we
shall see that the time-linear perspective of Dynamic Syntax allows an entirely nat-
ural set of analyses of these phenomena using the same concepts of tree growth,
while preserving incrementality of parsing. In addressing these and other current
puzzles, our claim in each case will be that the model we set out will be uniformly
simpler than other more conventional models that are designed not to reflect the
time-linearity of natural language processing.
As a result of this shift in perspective, the competence-performance distinction
looks very different. Though there remains a distinction between the linguistic-
competence model and a general theory of performance, the articulation of that
competence model is no longer disconnected from the articulation of the latter.
To the contrary, the competence model is developed on the assumption that it
provides the architecture within which the choice mechanisms of performance have
to be implemented. The claim that the system is parsing-oriented is nonetheless a
far cry from making the claim that no abstraction from the data of performance is
needed. We are not providing a performance model, despite the concern with the
dynamics of language processing. What we are claiming is that the explanation of
the intrinsic patterns displayed in natural language is best explained in terms of
28 See Phillips (2003), and Schneider (1999) for attempts to devise parsing formalisms which
the dynamics of how interpretation is built up in real time, and the model itself
defines a concept of tree growth that does indeed reflect the dynamics of processing
linguistic input in time, with the left periphery of a sentence string presumed to
provide information that has to be used first in developing the emergent structural
configuration.
new structure can be added to the overall system without effects elsewhere, it is
hard to see what linguistic phenomena this richness in expressive power excludes.
While minimalist approaches to syntax still maintain a rich array of grammatical
devices, the aim of restricting the vocabulary for expressing linguistic properties
nevertheless has important implications for linguistic theory, not least with respect
to simplifying the relation between knowledge of language and language use.
Dynamic Syntax also differs from Discourse Representation Theory (DRT: Kamp
and Reyle 1993), though it shares with DRT the commitment to articulating a level
of representation not inhabited by the string of words as an essential intermediate
step in characterising compositionality of denotational content. The difference be-
tween the two systems lies in the fact that, in DS, this form of representation also
constitutes the basis for syntax: there is no other level of structure with indepen-
dently defined primitives over which syntactic generalisations are expressed. DRT,
on the other hand, simply takes as input to the interpretation algorithm whatever
syntactic formalism is selected as appropriate, defining a construction algorithm
mapping such syntactic structures onto discourse representation structures, in this
respect falling into the same family of formalisms as LFG.
There are however some real similarities between the formal notations which
these theories use and Dynamic Syntax, a matter to which we will return; and the
arguments in favour of DS over either of these is mainly in terms of simplicity,
and the ability to articulate more principled explanations. The Right Roof Con-
straint is one such case. This is the constraint alluded to earlier in introducing
the syntax-semantics mismatch, which applies to rightward dependencies but not
leftward ones, enforcing a strict locality of dependencies at the right-periphery of
a clause. This asymmetry is unexpected in all frameworks in which structural gen-
eralisations depend solely on structural configuration: there is no reason in such
frameworks to expect asymmetry between left and right peripheral effects. How-
ever, as chapter 5 will show, this constraint emerges as a direct consequence of what
it means to be establishing forms of construal in the latter stages of the construc-
tion process, rather than in the early stages. Given the richness of vocabulary of
HPSG and LFG, the more restricted set of facts can doubtless be described by for-
mal specification of the structures over which such dependencies are articulated,29
but the underlying explanation of this asymmetry eludes such static formalisms.
So despite the heavy machinery that the frameworks make available, we shall see
that a simpler architectural assumption can lead to simpler individual analyses. It
might seem that in only defining one type of structural representation, Dynamic
Syntax would be less expressive than multi-level theories, but, as we shall see, DS
is nonetheless able to articulate a rich array of variation in linguistic distribution
by the ability to make reference to intermediate stages in the construction process.
At the other extreme lies Categorial Grammar, which explains natural lan-
guage phenomena solely in terms of mappings from phonological sequences onto
denotational contents without any essential reference to a level of representation
at all. This is the pure Montague approach to natural languages following the for-
mal language pattern (see section 1.1.1). The DS framework shares with Categorial
grammar formalisms the assumption that syntactic categories reflect semantic com-
binatorial operations, as reflected in type theory, and there are other similarities too.
However, DS has the edge over these formalisms in at least two respects. Categorial
grammar formalisms fail altogether to take up the challenge of reflecting the wealth
of interaction between anaphora and structural processes, and in general no unitary
characterisation of anaphora is provided (see Hepple (1990), Morrill (1994), Steed-
29 See Dipper (2003) for an LFG characterisation of German which provides such a stipulation.
1.4. CODA: PARSING AS THE BASIC ACTIVITY 25
man (1996, 2000), though for a notable exception see Ranta (1994)). It might be
argued that there is a concept of delay in semantic combinatorics expressed through
the type-lifting of an NP expression to a functor from VP contents to sentence con-
tents, which might be taken as equivalent in effect to structural underspecification
plus update hence not requiring additional mechanisms to express; but as we shall
see, the left-peripheral placement of expressions is only one of several forms of
underspecification plus update. There is a problem in principle in expressing un-
derspecification of content in categorial grammars, as their explicit commitment to
the view that grammars are logics leaves no room for expressing such systematic
underspecification and enrichment. It is a consequence of this commitment that
verb-final languages, which display a high degree of context-dependence, constitute
a major challenge for categorial grammar formalisms.30
Of all the extant formalisms, the DS formalism is perhaps closest to LTAG - Lex-
icalized Tree Adjoining Grammar (Joshi and Kulick 1997). LTAG projects typed
structures directly from lexical expressions over which type-deduction is defined
(as in categorial grammar formalisms and Dynamic Syntax). Long-distance dis-
continuity effects are characterised by processes of tree-adjunction, mapping pairs
of trees into a single composite tree. In particular, the long-distance dependency
of left-peripheral expressions is defined by splitting apart some lexically defined
tree-structure and injecting into it a further tree corresponding to all the non-local
material intervening between the left-peripheral expression and its site of depen-
dency. This is unlike the DS system, in which the process of tree growth is defined
to apply in a way that strictly follows the dynamics of parsing. Indeed, in be-
ing intrinsically dynamic, the DS system is unlike all these systems, despite many
similarities at the level of tools of description, for the time-linear incrementality of
natural-language construal is built into the formalism itself. The simplicity that we
gain, as we shall set out case by case, lies in the fact that the concepts of under-
specification are twinned with a monotonic tree growth mechanism that is defined
over the left-right sequence of words.
of arguments, any one of which may be optional if recoverable from context. Multi-set typing has
been proposed to offset this problem in part (Hoffman 1996, Baldridge 2002), but this is little more
than a notational economy (see Kiaer in preparation for detailed evaluation of the difficulties facing
categorial grammar formalisms in addressing linguistic distributions in verb-final languages).
26 CHAPTER 1. THE POINT OF DEPARTURE
text, with one additional constraint: the structures being constructed must match
some intended content. Consider the case of answering a question:
(1.50) A: What shall we eat?
B: Curry.
Speaker A sets out a structure by providing a string for B to process where her
request is for a value for the place-holding wh expression. Once B has established
this by parsing the string, all she has to do in replying is to provide this value. She
does not have to make any distinct hypothesis as to the structure A, her hearer,
has as context with which to interpret that answer: she “knows” the context A has,
because she herself has just parsed the string A said. All she has to do in choosing
the word curry is to check that the concept it expresses, rather than, say apples,
matches the content she has in mind when put into the structure set up by A’s
question. One could of course analyse ellipsis as involving a high level production
strategy that is quite independent of parsing, which involves first assuming that
some thought has to be constructed by some process, say the thought ‘B wants
to eat curry’, then selecting the words to express it on the basis of having made
a strategic decision as to what A has as their immediate context, and only then
deciding which of those words need to be said, and in what order. If this were the
appropriate account of production, then the correlation of parsing and production,
as presumed in the informal account just given, would break down. However,
in dialogue, where context dependence is extensive, there is little evidence that
such high level judgements are a necessary part of production, as we shall see in
chapter 9. To the contrary, there is increasing evidence that use of context at all
levels of production choice is a means of cutting to the minimum such high level
decisions (Pickering and Garrod 2004). The whole point of being able to rely on
information which one has just parsed oneself and which therefore by definition is
part of the private cognitive context that one brings to bear in any cognitive task,
is that making choices that are got from one’s own context-store will completely
side-step the need to search through one’s general store of words; and this makes
the production task much easier. In short, like hearers, speakers rely on their own
immediate context, but with the additional constraint that all choices made have
to be checked for consistency with what they have in mind to convey.
Finally, as long as we adopt even the outlines of Relevance Theory, we are com-
mitted to the primacy of understanding, for this explains inferential abilities in
communication as resulting from cognitive abilities relevant for processing infor-
mation, so from interpretation rather than production. Sperber and Wilson derive
communicative behaviour as expressed in the communicative principle of relevance
from general cognitive behaviour, namely from relevance-driven processing as em-
bodied in the cognitive principle of relevance and definition of relevance. In other
words, our ability to assess and choose information in linguistic communication is
a reflex of our ability to handle information in general. Indeed, as things will turn
out, general concepts of information processing, though with language-particular
instantiations, will turn out to be central to every single analysis of syntactic phe-
nomena we provide, for concepts of underspecification and monotonic growth of
representation will be at the heart of them all. In all cases, we shall argue, it is
these which directly make available simpler, more intuitive accounts of linguistic
phenomena. Moreover, the account will have the bonus of providing formal expres-
sion to what might otherwise be dismissed as naive functionalism. So the challenge
we shall be setting out is this: take the dynamics of the parsing process seriously,
and you get a grammar that is simpler both at the level of individual analyses, and
at the level of the grammar architecture.
Chapter 2
The Dynamics of
Interpretation
We have a two-fold challenge ahead. The first is to set out a model of how inter-
pretation is recovered in context. The second is to establish why this constitutes
the basis for syntactic explanations. As we saw in chapter 1, the heart of the ex-
planation is our commitment to reflecting the way humans can manipulate partial
information and systematically map one piece of partial information into another
in language processing, using each piece of information provided as part of the
context for processing each subsequent input. The challenge is to use these, intrin-
sically dynamic, concepts to replace analyses which depend on a discrete syntactic
vocabulary, involving processes such as movement, feature passing, etc. It is in this
respect, above all, that this formalism will depart from all other widely adopted
grammar formalisms.
In this chapter, we set out the basic framework, beginning with a sketch of the
process of building representations of content and subsequently developing the con-
cepts and technical apparatus. The discussion will be kept as informal as possible,
but more formal material is introduced so that readers can get a feel for the formal
basis of the theory.1
1 Full details of the formal characterisation of the system can be found in Kempson, et al.
(2001).
2 We shall use text-capitals for technical terms, in particular for rules defined in the theory.
27
28 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
NP VP C IP
N V NP DP I’
Hilary upset N D NP I VP
Joan N V DP
Hilary upset D NP
Joan
Joan′ Upset′
So what is the difference between these trees? In the first place, the tree in (2.2)
contains no information about word order. There is no claim here that English is
verb final - not even with respect to some hypothetical ‘deep structure’. Instead,
the tree represents the semantic structure of the propositional content expressed
by the string Hilary upset Joan so that what labels the nodes in the tree are
compositionally derived concepts, expressed in some lambda calculus, just as in
certain versions of categorial grammar (Morrill 1994, Carpenter 1998, etc.). The
tree thus reflects a jigsaw view of how we can entertain complex concepts (Fodor
1998), but notably not a jigsaw view about words.3 The trees in Figure 2.1, on
the other hand, reflect putative properties of words in strings that define structure
over those strings. So, VP labels a node that consists of a word that is a verb plus
another word that functions (simultaneously) as a noun and a noun phrase (and
possibly a determiner and determiner phrase). The syntactic structure determined
(or projected) by the words exists independently of the words themselves and is
constrained by independently defined rules or principles, again stated over strings
of words.
3 To maintain a distinction between words and the concepts they express, when we refer to
words we will use italic script, and when we refer to the concept we will use non-italic script, an
initial capital letter and a following prime. The proper name John thus expresses the concept
John′ . We also use italic script occasionally for emphasis, as here, but we assume it will always
be obvious in context which is intended.
2.1. A SKETCH OF THE PROCESS 29
Hilary′
7→ ?T y(t) 7→ ?T y(t)
Hilary′ • Hilary′ •
Cresswell (1985) for arguments that support such a view with respect to the objects of verbs of
propositional attitude.
5 It should be noted here that we are not in this book probing the internal structure of concepts
themselves. So we have nothing to say about the concept Sing′ or Love′ ; and indeed nothing to
say about the relation between the word loves and the concept which it expresses, other than that
it expresses the concept Love′ . This is a huge and complex topic. At the two extremes, some
have argued for definitions associated with words (e.g. Bierwisch 1969), others for a one-to-one
word-concept correspondence (Fodor 1981), with various intermediate positions (e.g. Pustejovsky
1995, Rappaport and Levin 1998). Yet others have noted that concepts vary according to the
particular context in which they are used, so there is a context-dependency to the use of words
to express concepts much like the context-dependence in using pronouns (Carston 2002). For the
purposes of this book, we adopt the view of Fodor (1981) that words express primitive concepts,
with a word and the concept it expresses being in one-to-one correspondence (see also Marten
(2002b)).
2.2. THE TOOLS OF DYNAMIC SYNTAX 31
The formula is just one of several labels that can decorate a node. In addition
to the formula label, we also have a label that gives information about the type
of the formula expression. The type of an expression is its semantic category, as-
sociating an expression with a particular sort of denotation. So, an expression of
propositional type t denotes a truth value; that of type e is a term that denotes some
entity. Functor types are represented as conditional statements so that an expres-
sion of type e → t expresses a (one-place) predicate, since when it combines with a
term (of type e) it yields a proposition (of type t) and denotes a set (see any num-
ber of introductory books on formal semantics, e.g. Chierchia and McConnell-Ginet
(1990), Cann (1993), Gamut (1991), Carpenter (1998)). Although most theories
of types assume a recursive definition of types, yielding an infinite set of types,
Dynamic Syntax makes use of only a small, predefined, set consisting of three basic
types e, t, cn,6 and a restricted set of functors based on these to provide sufficient
structure to account for the number and types of arguments of verbal and nominal
predicates. There is also a type cn → e that is assigned to quantifiers. Type raising
operations and the higher order types associated with Montague and Categorial
Grammar play no part in the grammar formalism, since concepts of underspecifi-
cation of update replace those of type-lifting and composition of functions, as we
shall shortly see. The table in (2.4) lists the most common types used in this book
with a description and examples.7
(2.4) Types in DS
Type Description Example expression
T y(e) Individual term F o(Mary′ ), F o(ǫ, x, Student′ (x))
T y(t) Proposition F o(Sing′ (John′ )),
F o(Upset′ (Hilary′ )(Joan′ ))
T y(e → t) (1-place) Predicate F o(Upset′ (Hilary′ )), F o(Run′ )
T y(e → (e → t)) (2-place) Predicate F o(Upset′ ), F o(Give′ (John′ ))
T y(e → (e → (e → t))) (3-place) Predicate F o(Give′ ), F o(Put′ )
T y(t → (e → t)) (Proposition) Predicate F o(Believe′ ), F o(Say′ )
T y(cn) Nominal F o(x, Student′ (x)),
F o(y, Father′ (John′ )(y))
T y(cn → e) Quantifier F o(λP.ǫ, P )
Trees, therefore, display nodes decorated not only with formulae but also their
associated types, as in (2.5). In fact, all information holding at, or annotating, a
tree node is stated as a declarative unit, or a tree node description. Declarative
Units (DUs) consist of consistent sets of labels expressing a range of different sorts
of information.
(2.5) Representation of content of Eve likes Mary
T y(e → (e → t)),
T y(e), F o(Mary′ )
F o(Like′ )
6 The latter being the type assigned to common nouns where the formula consists of an ordered
pair of variable plus a propositional formula where that variable occurs free. See chapter 3.
7 All quantifying expressions are analysed as terms of type e, for example the term
ǫ, x, Student′ (x) listed is the analogue in these terms of existential quantification ∃, x, Student′ (x).
32 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
This tree displays how information from the functor nodes combines with infor-
mation from the argument nodes to give the complex formula value at the mother
node. As in Categorial Grammar, application of modus ponens over type values is
parallelled by function-application over formula values.8
?T y(t), ♦
We assume that the parse of Eve likes gives rise to the tree in (2.6) through mech-
anisms that we shall see below.
8 However,
in this notation, DS types are conditional types without any implication for the order
of natural language expressions (unlike the “forward-slash” and “backward-slash” of Categorial
Grammars).
9 Requirements may be modal in form in which case the specified goal may be achieved by an
?T y(t)
At this stage, there are three outstanding requirements, ?T y(t), ?T y(e → t), and
?T y(e). The pointer symbol, ♦, shows that, following the processing of Eve likes,
the node under development is the internal argument node of the predicate, as
determined by the lexical actions associated with parsing the verb likes (see below).
The current task state is thus ?T y(e), a requirement to construct a term. If, in
this situation, information from the lexicon provides an expression of T y(e) (such
as by a parse of Harriet ), it can be introduced into the tree at that node, since
it matches, and so satisfies, the current requirement. However, if the next word
is sing, its associated predicate F o(Sing′ ) cannot be introduced into the tree even
though its type, T y(e → t), matches one of the requirements in the tree. This is
because the pointer is not at the node at which this requirement holds. No update
can, therefore, be provided by the word sings, and the sequence of actions induced
by parsing the verb cannot lead to a completed logical form. The parse must be
aborted and, hence, the sequence of words *Eve likes sings is not a grammatical one.
The pointer thus gives important information about the current state of a parse
and the theory of how the pointer moves will form a significant role in the analyses
to be presented in later chapters of the book. As you can see from this sketch
of a simple example, the notion of grammaticality rests, not merely on whether a
certain parse leaves requirements unsatisfied, but also on where the pointer is at
any particular point in a parse.
00 01
010 011
0110 0111
34 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
By convention, the left daughter node of a node n is assigned the index n0 and
the right daughter is assigned the index n1. This information may form part of
a Declarative Unit (the information collected at a node) and is expressed by the
predicate T n (tree node) which takes as value some index, as illustrated in the tree
in (2.8).
(2.8) Tree-locations from a parse of John sings
In this tree, the left daughter is decorated with an argument formula, the right
daughter is decorated with a formula which is a functor that applies to that ar-
gument to yield the decoration at the mother. As a convention, this is a pattern
we shall retain throughout: arguments as left daughters, and functors as right
daughters.
So far our vocabulary can only describe individual nodes. However we can de-
scribe the relation between tree nodes if we add modal operators to the language of
description. The relations between tree nodes can then be described by modal state-
ments which provide a means to state that some information holds at a daughter
or at a mother node, and more important, as we shall later see, a means to express
requirements that need to be satisfied at some node other than the current node.
There are two basic modalities, one corresponding to the daughter relation,
h↓i ‘down’, and one corresponding to the mother relation, h↑i ‘up’. These can
be used with and without the numerical subscript, depending on whether it is im-
portant to distinguish between left (argument) and right (functor) branches. Hence,
h↓1 i refers to the functor daughter, while h↓0 i refers to the argument daughter node
(and similarly for the mother relation, h↑1 i, h↑0 i, but in this case there is only ever
a single mother). As these symbols form part of a modal logic, an expression like
h↓0 iT y(e) at node n, means ‘there exists an argument daughter that node n imme-
diately dominates which is decorated by the label T y(e)’ (i.e. node n has a term
as its argument daughter).
Furthermore, modality operators can be iterated, e.g. h↓ih↓i, h↓ih↑i, etc. This
provides a means of identifying from one node in a tree that some property holds
of some other node.
(2.9) T n(0), Q
T n(00), P T n(01), P → Q
T n(010), R T n(011), R → (P → Q)
It is thus possible to describe the whole of the tree in (2.9) from any node within it.
Hence, the statements in (2.10) are all true of this tree from the node with treenode
address 010, i.e. that one decorated by R.
2.2. THE TOOLS OF DYNAMIC SYNTAX 35
(2.10) a. h↑0 iP → Q
at my mother, P → Q holds
b. h↑0 ih↓1 iR → (P → Q)
at my mother’s functor daughter, R → (P → Q) holds
c. h↑0 ih↑1 iQ
at my mother’s mother, Q holds
d. h↑0 ih↑1 ih↓0 iP
at my mother’s mother’s argument daughter, P holds
The tree may be described in many different ways using the modal operators,
depending on the ‘viewpoint’, the node from which the description is made. Another
is shown in (2.11) which takes the topnode as the point of reference and provides
a complete description of the tree. In this description, each node is described in
terms of the information that holds at that node.
(2.11) a. T n(0), Q
at node 0, Q holds
b. h↓0 iP
at my argument daughter P holds
c. h↓1 iP → Q
at my functor daughter P → Q holds
d. h↓0 ih↓1 iR
at the argument daughter of my functor daughter P → Q holds
e. h↓1 ih↓1 iR → (P → Q)
at the functor daughter of my functor daughter R → (P → Q) holds
As we shall see shortly, the use of this logic is crucial for many analyses in the
system. It allows, for example, a specification of the lexical actions associated with
parsing some word to construct nodes within a tree or to annotate the current or
other nodes with some information, as we shall see below. The modal operators
also come into play with respect to parsing particular words to check aspects of the
local linguistic context (the partial tree constructed so far) to ensure that the parse
is well-formed. As an example, one way of expressing case relations is to articulate
them as a constraint on decorating the current node only if its mother bears a
certain kind of decoration. Hence, one might define a subject pronoun in English,
such as they, as licensed just in case the node that dominates the current node
carries a propositional requirement, h↑0 i?T y(t), i.e. the current node is a subject
node. Such a constraint is satisfied in the first tree in (2.12) (showing part of the
analysis of They sing) but not in the second (disallowing *Kim likes they).
(2.12) Proper and improper contexts for parsing they
OK T n(0), ?T y(t) NOT OK T n(0), ?T y(t)
The modal operators may also be used in conjunction with the notion of require-
ment to constrain further development of the tree. So while an annotation such as
h↓0 iF o(α) indicates that the formula value α holds of my argument daughter (and
is therefore only true if within the tree as currently constructed, F o(α) actually
36 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
does decorate that node), one such as ?h↓0 iF o(α) merely states that at some point
in constructing the current tree, F o(α) must decorate my argument daughter. This
gives us another way to express case – as providing a filter on the output tree.
So we might also express nominative case as providing a decoration of the form
?h↑0 iT y(t).10
Notice that in this second characterisation of case that the ‘?’ precedes the
modal operator, so this formulation expresses the constraint that in order to achieve
a well-formed result, the current node must be immediately dominated by a node
which has the decoration T y(t). This is of course not achieved at any stage prior to
the final goal, hence it is a filter on the output tree. As we can already see, this will
give us a flexibility with respect to the analysis of case; and individual languages
or indeed individual case specifications may differ in the restrictions they impose.
Quite generally, we shall see that such modal requirements are very important and
will be used extensively throughout this book.
10 We recognise that this sort of specification of case as just involving conditions on tree position
(i.e. of argument status) is insufficient as a complete characterisation of case in languages that
have well-articulated systems of morphological case. It is, however, sufficient for our limited
purposes and will be used to provide the basis for a theory of constructive case in chapter 6 with
respect to scrambling in Japanese.
2.3. CONSTRUCTING TREES 37
b.
{. . . φ . . . ♦ . . . }
{. . . ψ . . . ♦ . . . }
As already noted, a pair of transition rules together drive the process of growth
by breaking down goals into subgoals. The rule of Introduction is effectively an
inference from an initial goal to one in which two subgoals are added in the form
of requiring the tree to grow and be annotated by information that together can
satisfy the original goal. Specifically, the rule unpacks a requirement to find an
expression of one type into requirements to have daughters which are decorated by
expressions of other types which can be combined through functional application to
give an expression of the appropriate type. In other words, the rule adds to a node
with a requirement to be decorated with an expression of type X, requirements to
have daughters decorated with expressions of type Y on one node and type Y → X
on the other. This is formally defined in (2.14) in terms of tree descriptions and
shown in terms of tree growth in (2.15).11 Notice that the tree in (2.15) has not
actually grown the daughter nodes: it has merely acquired requirements to have
such nodes. So the tree, as in the input, consists of only one node.
(2.14) Introduction (Rule)
{. . . {. . .?T y(Y ) . . . ♦} . . . }
{. . . {. . .?T y(Y ), ?h↓0 iT y(X), ?h↓1 iT y(X → Y ), . . . ♦} . . . }
The outermost brackets in such tree descriptions allow the rule to apply at any
point in the tree-construction process, and will be a general property of the rules
unless explicitly defined otherwise.
The second rule, called Prediction, introduces the required nodes and deco-
rates them with requirements to be decorated by expressions of the required type.
As before, we give the formal definition in terms of tree descriptions (2.16) followed
by the same information shown in terms of tree growth (2.17).12
(2.16) Prediction (Rule)
as the first of the two nodes to be expanded. Although this is not necessary from the point of
view of the grammatical formalism, we take this move to be universal and a reflection of the fact
that (for example) subjects are typologically more frequently found before their verbs than after
them. We will see below and throughout the book how other word orders may be achieved.
38 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
?T y(Y ), ♦ ?T y(Y → X)
The effect of these rather complex looking rules is, in fact, best illustrated by a
single step of tree growth as shown in (2.20) which shows a tree growing from just
one node decorated with a requirement of type t to a new tree with two new nodes
decorated with requirements to find expressions of types e and e → t.13 Although we
will continue to show the transition rules in their more formal form in this chapter
in order to familiarise the reader with the technical apparatus, we will illustrate the
effects of all such rules using figures that show tree growth directly. In later chapters,
although we will continue to state the formal definitions of transition rules, readers
are encouraged to rely on the more immediately comprehensible illustrations of tree
growth.
(2.20) Introduction and Prediction of Subject and Predicate
?T y(t), ?h↓0 iT y(e), ?T y(t), ?h↓0 iT y(e),
7→
?h↓1 iT y(e → t), ♦ ?h↓1 iT y(e → t)
?T y(e), ♦ ?T y(e → t)
One property of these rules should be noted immediately: all rules are optional, so
the system is constraint-based and in this respect like HPSG and LFG and unlike
minimalism.14
13 In fact, it is solely this variant of Introduction that we shall use, narrowing down the set of
For example, an expression like a proper name is of T y(e) and requires that
there be a current requirement ?T y(e) at the stage at which the lexical entry is
scanned. This provides the appropriate trigger. The THEN statement lists the
particular actions which are performed if the condition in the IF statement is met.
In the case of a proper name, this can be taken to be a simple annotation of the
current node with type and formula information, using the predicate put(). Finally,
the only other possibility is to abort the parse. The lexical entry for a name like
Hilary is thus as shown in (2.22) and its effect is to induce the transition shown in
(2.23) from the output tree in (2.20) above.15
IF ?T y(e) Trigger
(2.22) Hilary THEN put(T y(e), F o(Hilary′ ), [↓]⊥) Annotation
ELSE Abort Failure
15 This view of proper names as having no internal structure will be revised in Chapter 3, when
we incorporate an account of quantification.
40 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
?T y(e), T y(e),
?T y(e), ♦ ?T y(e → t) ?T y(e → t)
F o(Hilary′ ), [↓]⊥, ♦
The one strange decoration in this sequence of actions is [↓]⊥, what we call
“the bottom restriction”. This annotation is to express that the word is used to
decorate a terminal node in the tree. Formally it reads: “necessarily below the
current node (for every node the current node immediately dominates), the falsum
holds”: i.e. the current node has and will have no daughters with any properties at
all. Intuitively, this is the reflection in this system that words provide the minimal
parts from which interpretation is built up. So we reflect a compositionality of
meaning principle like anyone else; but the information projected by words may be
considerably more than merely providing some concept.
Within lexical entries, the order of the action predicates is important. For
example put(. . . ) before make(. . . ) means ‘put information at current node, then
build a new node’, while make(. . . ) before put(. . . ) means ‘build a new node and
put some information there’. This is important, for example, in parsing verbs in
English. We take the parsing of verbs in English to be triggered by a context in
which there is a predicate requirement, ?T y(e → t).16 The actions induced by
parsing finite verbs involve at minimum the annotation of the propositional node
with tense information and the annotation of the predicate node. This is all that
will happen with intransitive verbs, as shown in the lexical entry for danced in
(2.24).
(2.24) danced
IF ?T y(e → t) Predicate Trigger
THEN go(h↑1 i?T y(t)); Go to propositional node
put(T ns(P AST )); Add tense information
go(h↓1 i?T y(e → t)); Go to predicate node
put(T y(e → t), F o(Dance′ ), [↓]⊥) Add content
ELSE Abort
There are a number of things to notice about the information in this lexical en-
try. In the first place, the condition for the introduction of the information from
danced is that the current task state is ?T y(e → t). Then there is movement from
that node up to the immediately dominating propositional node, given by the in-
struction go(h↑1 i?T y(t)) ‘go up to the immediately dominating node decorated by
a propositional requirement’. This node is then annotated by tense information
which we have represented simplistically in terms of a label T ns with simple values
P AST or P RES(EN T ) (for English).17 Notice that this means of encoding tense
obviates the need for ‘percolation’ or ‘copying’ devices, as required in HPSG and
other frameworks, to ensure that information introduced by a word gets to the
place where it is to be interpreted. This is done entirely through the dynamics
16 Differences in the trigger for classes of expression is one of the ways in which cross-linguistic
better account would include proper semantic information manipulating indices so that the tense
label does not look like a simple syntactic label. See also chapter 3 where the account of tense is
slightly expanded in conjunction with the discussion of quantification.
2.3. CONSTRUCTING TREES 41
of parsing the verb.18 The pointer then returns to the open predicate node and
annotates that with type and formula information (and the bottom restriction) as
we saw with parsing Hilary.
Notice that the order in which the action predicates occur is important, reflect-
ing the dynamic nature of the analysis. If the action put(T ns(P AST )) preceded
the action go(h↑1 i?T y(t)), the predicate node would be decorated with the tense
information and not the propositional node. Similarly, the ordering of put(T y(e →
t), F o(Dance′ ), [↓]⊥) before go(h↓1 i?T y(e → t)) would fail to satisfy the proposi-
tional requirement on the node and so lead to a failure of the parse.
So intransitive verbs add information about tense and supply a one-place pred-
icate. Transitive verbs, however, not only add tense, but create new nodes: a
two-place predicate node which it annotates with type and formula values and
a node for the internal argument, decorated with a type e requirement. This is
illustrated in the lexical entry for upset in (2.25).
(2.25) upset
IF ?T y(e → t) Predicate trigger
THEN go(h↑1 i?T y(t)); Go to propositional node
put(T ns(P AST )); Tense information
go(h↓1 i?T y(e → t)); Go to predicate node
make(h↓1 i); Make functor node
put(F o(Upset′ ), T y(e → (e → t)), [↓]⊥); Annotation
go(h↑1 i); Go to mother node
make(h↓0 i); Make argument node
go(h↓0 i); Go to argument node
put(?T y(e)) Annotation
ELSE Abort
The condition for the introduction of the information from upset is that the cur-
rent task state is ?T y(e → t). If this condition is met, a new functor node is built
and annotated with the formula and type values specified, and following the return
to the mother node, a new daughter node is built with a requirement for a for-
mula of type e. To be fully explicit, the decoration F o(Upset′ ) should be given as
F o(λyλx[Upset′ (x, y)]), with λ-operators indicating the number and type of argu-
ments with which the predicate Upset′ has to combine, and the order in which this
functor will combine with them.19
18 It might be objected that tense information should be generalised, otherwise one might expect
different verbs within the same language to behave differently with respect to such inflectional
matters. Possibly, in morphologically regular constructions, the phonological information pro-
vided by the consonant cluster indicates a separate lexical specification for the suffix, an analysis
of phonological clustering advocated within Government Phonology (Kaye 1995). In fact, it is
relatively easy to structure the lexicon within Dynamic Syntax as is done in HPSG (Sag and
Wasow 1999, etc.) so that past tense (for example) can be stated generally as an instruction to
go up to the mother node and decorate that with the past tense label:
tense-past IF T y(e → t)
THEN go(h↑1 i); put(T ns(P AST )); go(h↓1 i)
content
where content is the basic actions induced by all forms of the verb, in the case of dance merely the
decoration of the predicate node with the appropriate formula, type and bottom restriction. There
are differences between HPSG accounts and what is necessary in the current dynamic framework,
but we do not pursue these refinements in this book. What cannot differ between lexical specifica-
tions is the specification of tense necessary for the appropriate semantics to be given, here specified
as a decoration on the type t-requiring node as a promissory note for a formal characterisation.
See also chapter 3, and chapter 6, where we see that the past-tense morpheme of Japanese plays
a particular role in determining the way in which interpretation is incrementally built up.
19 The λ operator is an abstraction operator which constructs functions from some input to a
42 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
To illustrate the effect of the parse of a transitive verb, (2.26) shows the transi-
tion from the output tree in (2.23) to a tree with the pointer at the open predicate
node, which triggers the parse of the verb upset to give the second tree.
(2.26) Parsing Hilary upset
?T y(t) 7→
T y(e), F o(Hilary′ )
?T y(e → t), ♦
[↓]⊥
T y(e), F o(Hilary′ )
?T y(e → t)
[↓]⊥
T y(e → (e → t))
?T y(e), ♦
F o(Upset′ ), [↓]⊥
With the pointer again at an open T y(e) node, it is possible to parse another proper
noun, say Joan, to yield the tree shown in (2.27).
(2.27) Parsing Hilary upset Joan
T y(e), F o(Hilary′ )
?T y(e → t)
[↓]⊥
T y(e → (e → t))
T y(e), F o(Joan′ ), ♦
F o(Upset′ ), [↓]⊥
The first rule provides a means for stating that requirements have been ful-
filled:
(2.28) Thinning
{. . . {. . . , φ, . . . , ?φ, . . . , ♦} . . . }
{. . . {. . . , φ, . . . , ♦} . . . }
All this rule does is to simplify the information accumulated at a node (a Declarative Unit
(DU)): if, at the current node, a DU holds which includes both a fact and the re-
quirement to fulfil this fact, the requirement is deleted and the pointer remains at
the current node. This is the only means of getting rid of decorations, rather than
adding them. Hence, in (2.23), there is a further step before the pointer moves to
the predicate node: getting rid of the type requirement, as shown in (2.29).20
(2.29) Parsing Hilary with thinning
?T y(t) 7→ ?T y(t)
We also need transition rules that move the pointer on from a type-complete
node and by so doing to satisfy the modal requirement imposed by Introduction.
In Kempson et al. (2001), this is done by a rule called completion (which can be
regarded as the inverse of Prediction) which moves the pointer up from a daughter
to a mother and, crucially, annotates the mother node with the information that
it indeed has a daughter with certain properties. This latter move has the effect
of satisfying the modal requirement introduced by Introduction. The rule of
completion is given in (2.30) and the effect in terms of tree growth is shown in
(2.31). Hence, completion states that if at a daughter node some information
holds which includes an established type, and the daughter is the current node,
then the mother node may become the current node. In the tree-display, we use
a ternary branching format to emphasise the neutrality of Completion between
functor and argument daughters.
(2.30) Completion (Rule):21
... h↑iT n(n), T y(X), ♦ ... ... h↑iT n(n), T y(X) ...
20 In general, however, we will not show the final transition determined by thinning, assuming
alise this to cover the return of the pointer having constructed and decorated an unfixed node
(symbolised by the ∗ notation).
44 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
Notice how this formulation of completion brings out how the use of modal
statements reflects the perspective of the node in the tree at which they hold. So in
the rule as formulated, some daughter node is defined to have a mother node T n(n)
above it: i.e. it itself is defined as h↑i iT n(n). In the input state defined by the rule,
T y(X) holds at the daughter node. The effect of the rule is to license the pointer
to shift up to the mother node; and from the perspective of the mother node, to
record the fact that h↓i iT y(X) holds. The latter annotation, as noted, satisfies the
requirement ?h↓ii T y(X) written to the mother node by Introduction and the
node can duly be thinned.
We also need a rule for moving the pointer down a tree which we call antic-
ipation and which moves the pointer from a mother to a daughter which has an
outstanding requirement.22 The rule is given in (2.32) and the effect on tree growth
is shown in (2.33).
(2.32) Anticipation (Rule):
The rules for pointer movement mean that the transitions in (2.26) are mediated
by the further transitions shown in (2.34): the first licensed by completion; the
second by thinning and the third by anticipation. Although these transitions
are formally necessary (and have an effect on licit derivations, as we shall see in
later chapters), we will, in general, ignore this sort of straightforward development,
assuming that the pointer always moves directly from a type-complete node to a
(sister) node hosting an open requirement.
(2.34) Parsing Hilary with completion and anticipation
a. After Thinning
?T y(t), ?h↓0 iT y(e), ?h↓1 i(T y(e → t)
T y(e), [↓]⊥,
?T y(e → t)
F o(Hilary′ ), ♦
b. Completion
?T y(t), ?h↓0 iT y(e), h↓0 iT y(e), ?h↓1 i(T y(e → t), ♦
T y(e), [↓]⊥,
?T y(e → t)
F o(Hilary′ )
22 The term ‘anticipation’ being intended to convey the idea that the movement of the pointer
c. Thinning
?T y(t), h↓0 iT y(e), ?h↓1 i(T y(e → t), ♦
T y(e), [↓]⊥,
?T y(e → t)
F o(Hilary′ )
d. Anticipation
?T y(t), h↓0 iT y(e), ?h↓1 i(T y(e → t)
T y(e), [↓]⊥,
?T y(e → t), ♦
F o(Hilary′ )
T y(Y ), F o(α) T y(Y → X), F o(β) T y(Y ), F o(α) T y(Y → X), F o(β)
Notice that elimination does not introduce a new node, but only changes an-
notations holding at one node: if a given node immediately dominates an argument
and a functor daughter which are both annotated with a formula and a type value,
then the two type values can combine by modus ponens, with the corresponding
F ormula expressions combined by function-application. For example, in complet-
ing the analysis of Hilary upset Joan from the output tree in (2.27d) (shown as
the initial tree in (2.37)), completion licenses the movement of the pointer to the
open predicate node. Then elimination applies to satisfy that type requirement
as shown in the last of the partial trees in (2.37).
23 Note that if construction rules were stated in the same format as lexical actions (as they could
be), the condition could be stated straightforwardly as an instruction to abort the actions, just in
case a daughter has an unsatisfied requirement:
... IF h↓i?φ
THEN abort
ELSE ...
46 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
T y(e), [↓]⊥,
?T y(e → t)
F o(Hilary′ )
T y(e), [↓]⊥,
?T y(e → t), h↓0 iT y(e), ♦
F o(Hilary′ )
24 Notice that in the function-argument notation in the Formula language, the functor always
T y(e), [↓]⊥,
?T y(e), ♦ ?T y(e → t) ?T y(e → t), ♦
F o(John′ )
The verb thinks is now parsed. This is associated with the lexical information
in (2.41) which shows present tense and the creation of an internal argument node
with a type t requirement. The effect of parsing the verb is shown in (2.42).
IF ?T y(e → t)
THEN go(h↑1 i?T y(t));
put(T ns(P RES));
go(h↓1 i?T y(e → t));
(2.41) thinks make(h↓1 i), go(h↓1 i);
put(T y(t → (e → t)), F o(Think′ ), [↓]⊥);
go(h↑1 i); make(h↓0 i); go(h↓0 i);
put(?T y(t))
ELSE Abort
2.3. CONSTRUCTING TREES 49
T y(t → (e → t)),
?T y(t), ♦
F o(Think′ ), [↓]⊥
T y(e), [↓]⊥,
?T y(e → t)
F o(John′ )
The effect of parsing the complementiser is shown in (2.44) and the parse of
the rest of the string proceeds exactly as discussed in the previous section, yielding
the tree in (2.45) up to the point at which the embedded proposition has been
completed. Notice that thinning applies twice to the internal propositional node:
once to eliminate the type requirement ?T y(t) and once to eliminate the requirement
for tense information ?∃x.T ns(x).
50 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
T y(e), [↓]⊥,
?T y(e → t)
F o(John′ )
The completion of the tree then proceeds through steps of completion, elimina-
tion and thinning on the predicate and top propositional nodes to yield the final
output shown in (2.46).25
(2.46) Completing John thinks that Hilary upset Joan
T ns(P RES), F o(Think′ (Upset′ (Joan′ )(Hilary′ ))(John′ )), T y(t), ♦
T y(e), [↓]⊥,
T y(e → t), F o(Think′ (Upset′ (Joan′ )(Hilary′ )))
F o(John′ )
Notice in this connection that lexical specifications may do more than just dec-
orate the terminal nodes of a tree, but decorating the terminal nodes is essential
to what they do. So there is a sense in which the simple form of the traditional
principle of compositionality (that sentence meanings are composed of meanings of
words in a certain configuration) is retained. It is just that the information that a
word may convey may be more than just some concept: it may provide information
about what other concepts it may combine with; it may introduce other nodes in
the tree (e.g.the object argument for a verb); or in the case of tensed verbs, the
word may project decorations on other nodes. On the other hand, a word like the
subordinating complementiser that does not decorate a terminal node; indeed, it
25 At this point, the simplistic account of tense we have provisionally adopted means that we
cannot express the compilation of tense as part of the propositional formula. Defining this involves
essential interaction with quantification, so this aspect has to wait until chapter 3.
2.4. LEFT DISLOCATION STRUCTURES 51
In words, this is the interpretation ‘John thinks that Hilary upset Joan’ in which
the formula itself is derived compositionally in a regular bottom-up way from the
terms introduced into the tree structure. The primary difference from the normal
syntax-semantics correspondence is that this structure has been introduced on a
left-to-right basis, rather than on a bottom-up basis as is standard.
T n(0), Q, h↓∗ iR
There are four decorated nodes in this tree, but only three of them are in fixed
locations. The fourth is described as holding at h↑∗ iT n(0) indicating that it holds at
some node within the tree along a sequence of daughter relations from the topnode
but without that sequence being further specified. In short, the only information
52 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
similar to the concept of ‘functional uncertainty’ of LFG (Kaplan and Zaenen 1989), though the
resulting analyses are distinct: see chapter 5 section 5.6.
27 Pronounced “star adjunction”, the name being intended to express the idea of a node which
The output tree in (2.52) provides an environment in which the proper name
Joan can be parsed. Completion can apply to the type-complete node because
h↑∗ i is one of the modalities that it refers to (see (2.30)) and the pointer moves
back up to the topnode.28 The parse of the string in (2.50) then proceeds through
Introduction and Prediction to provide subject and predicate requirements,
permitting the parse of Hilary upset, with the unfixed node remaining on hold, so
to speak, as shown in (2.53).29
(2.53) Parsing Joan, Hilary upset
a. Parsing Joan
T n(0), ?T y(t)
F o(Joan′ ), T y(e),
T n(00), ?T y(e), ♦ T n(01), ?T y(e → t)
h↑∗ iT n(0), ?∃x.T n(x)
28 This generalisation of completion is not a license to move the pointer up through arbitrary
fixed nodes in a tree. Kempson et al. (2001) make a distinction between a relation of ‘internally
dominate’, which applies to unfixed nodes only, and one of ‘externally dominate’ which is the
general dominate relation ranging over all dominated tree positions, fixed or unfixed. External
domination is shown using up and down arrows without the angle brackets. So, ↑∗ is the external
modality corresponding to the internal modality h↑∗ i. Only the latter is referred to in the rule
of completion so movement of the pointer upwards can only take place from a currently unfixed
node (decorated also by the treenode requirement ?∃x.T n(x)) or from a fixed argument or functor
daughter, but not from any other nodes.
29 In all trees that follow, the bottom restrictions imposed by lexical specifications will be omit-
At the point reached in (2.53d), the pointer is at a node with an open type e
requirement. Since this matches the type of the unfixed node, the latter may merge
with that position.30 The output of this process is the unification of the information
on the unfixed node with that on the internal argument node. This ensures that
the open type requirement on the latter and the outstanding requirement on the
unfixed node to find a treenode address are both satisfied, as can be seen from
the DU in (2.54a). Thinning applies (over ?∃x.T n(x) and T n(010) and ?T y(e)
and T y(e)) to give (2.54b), where because the specific modality (h↑0 ih↑1 iT n(0)) is
subsumed by the general one (h↑∗ iT n(0)), only the former need be retained.
(2.54) a. {h↑∗ iT n(0), h↑0 ih↑1 iT n(0), ?∃x.T n(x), T n(010), ?T y(e), T y(e),
F o(Joan′ ), [↓]⊥, ♦}
b. {h↑0 ih↑1 iT n(0), T n(010), T y(e), F o(Joan′ ), [↓]⊥, ♦}
This process is shown informally in (2.55) where the dashed arrow indicates the
merge process and the second tree shows the output. Notice that when compiled
and completed, the tree in (2.55b) is identical to the output tree for Hilary upset
Joan in (2.38). The informational differences between the two strings are thus not
encoded in the representation, as in many current theories of syntax, either as a
separate layer of grammatical information (Vallduvı́ 1992) or in terms of functional
categories projected at the left periphery of the clause (Rizzi 1997, and others fol-
lowing him). Instead, the differences, we assume, derive from the different ways
that the final tree is established. With the left dislocated object, a term is pre-
sented to the hearer which provides an update for a propositional structure to be
established from the rest of the string – a type of focus effect (see Kempson et al.
(2004) for further discussion).31
30 The process of merge used in DS should not be confused with the entirely different process
of Merge in the Minimalist Program (Chomsky 1995). A better term in DS would be unify, but
we adhere to the term introduced in Kempson et al. (2001).
31 Note, in passing, how the process directly reflects an idea of focus as involving an isolated
term which provides an update to some, given, propositional form (Rooth 1985, Erteschik-Shir
1997), a perspective which we pursue somewhat further in chapter 4.
2.4. LEFT DISLOCATION STRUCTURES 55
T n(00), T y(e),
T n(01), ?T y(e → t)
F o(Hilary′ )
At this point, we need some reflection on the differences between the formal
apparatus and what the tree displays seem to be reflecting. With the unfixed node
appearing to the left of the main propositional tree attached to the topnode, it
looks from the tree displays that it is somehow associated with some position on
the left periphery, analogous perhaps to some Topic or Focus projection. However,
this is merely an artefact of the use of tree diagrams to illustrate the concepts
used. Recall that trees give representations of meaning, not words and phrases,
so the representation of the node to the left or the right is immaterial. More
importantly, unfixed nodes do not inhabit any determinate position in the tree
and, technically, the information associated with the unfixed structure (i.e. the
declarative unit that describes that structure), is checked against each partial tree
as successive pairs of daughter nodes are introduced. So in effect, the information
from such nodes is passed down the tree, step by step, until a fixed position for
the node can be established. Merge may then take place at any stage where the
information on the unfixed node is compatible with that on the fixed position. The
development shown in (2.53) might thus be more accurately shown as (2.56), where
the information associated with the unfixed node is carried down the tree along
with the pointer. The information associated with the unfixed node is shown inside
the dashed boxes .
(2.56) Parsing Joan, Hilary upset
a. *Adjunction 7→
T n(0), ?T y(t),
h↑∗ iT n(0), ?T y(e), ?∃x.T n(x), ♦
b. Parsing Joan
T n(0), ?T y(t),
h↑∗ iT n(0), T y(e), F o(Joan′ ), ?∃x.T n(x), [↓]⊥, ♦
56 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
Note that at each of these intermediate nodes, the decorations on the unfixed
node are checked for consistency with that intermediate node. And at any one
of these nodes T n(a), the unfixed node can be described as h↑∗ iT n(a) because,
according to this concept of ‘be dominated by’, the relation holds between two
nodes if there is any sequence of daughter relations between the two nodes in
question, including the empty one, hence if the property holds at that node.32
At this point we need to be a bit more precise as to what the process of merge
involves. Quite simply, all it does is to unify two node descriptions, here referred
to as DU, DU′ (a pair of DUs) as indicated in the formal rule in (2.57).
(2.57) Merge
{. . . {. . . DU, DU′ . . . } . . . }
{. . . {. . . DU ⊔ DU′ . . . } . . . }
♦ ∈ DU′
The only constraints on this completely general rule are that: (a) the pointer is
one of the decorations on one of the DUs (the fixed node) and (b) that the two
32 In this way, the concept of the unfixed node shows some similarity with the SLASH passing
mechanisms of HPSG (Sag and Wasow 1999). A significant difference, however, is that, once
discharged through merge, no record remains of the fact that the left peripheral expression was
ever unfixed.
2.4. LEFT DISLOCATION STRUCTURES 57
2.5 Anaphora
We have so far been considering the processing of sentences pretty much as though
in isolation. We now have to rectify this, and take up the challenge put out in
chapter 1 to provide a unitary characterisation of pronominal anaphora. To do this,
we assume that a pronoun gives rise to a place-holder for some logical expression
which has been constructed within the context of utterance. Antecedents, though
they may be given by previous words, cannot be the words themselves as this gives
quite the wrong result over and over again. Presuming that the antecedent of the
pronoun in (2.58a) is the quantifying expression wrongly fails to predict what is
the appropriate interpretation of (2.58a), and wrongly predicts that it should have
the same interpretation as (2.58b).
(2.58) a. Every child thinks that he should get a prize.
b. Every child thinks that every child should get a prize.
Assuming the general stance that words provide lexical actions in building up rep-
resentations of content in context, in contrast, gets us on the right track. We can
thus say that the pronoun may pick out some logical term if that term is provided
in the discourse context, whether it is a full logical name or a variable introduced by
some quantifying expression, and so on. We consider only the simplest cases here,
leaving a discussion of the quantificational cases until the next chapter, but the
effect of such an assumption, together with the adoption of the epsilon calculus to
provide an account of quantification (see chapter 3) is to capture both the fact that
pronouns contribute very differently to interpretation depending on the antecedent
that they have, and that a pronoun is nevertheless not lexically ambiguous in the
sense of having a number of quite different interpretations defined in the lexicon.
To achieve the simple notion that pronouns pick out some logical term from the
discourse context, we again have recourse to underspecification, in this case to the
underspecification of content, rather than underspecification of position, as with
our account of left dislocation. So we extend the vocabulary of our Formula values
to allow for placeholders for specific values. These we call metavariables in the
logical language and represent as boldface capitals U, V and so on. A pronoun
then projects one such metavariable as the F o value given by its lexical actions. As
a metavariable is just a placeholder for some contentful value, it is associated with
a requirement to establish such a value, ?∃x.F o(x), just as unfixed nodes with the
underspecified modality h↑∗ iT n(n) are associated with a requirement to find a value
for its treenode label, ?∃x.T n(x). Following the general pattern that all require-
ments have to be eliminated in any well-formed derivation, the formula requirement
ensures that metavariables will be replaced by a term in the F ormula language as
part of the construction process. Such replacement is established through a prag-
matically driven process of substitution which applies as part of this construction
process.
As an illustration, in processing an example such as (2.60), uttered in the context
of having just parsed an utterance of (2.59), we assume the steps of interpretation
in processing the subject and object expressions shown in (2.61).34
(2.59) John ignored Mary.
(2.60) He upset her.
34 The trees are shown schematically; and types and other requirements, once established, are
F o(U), T y(e),
?T y(e → t)
?∃x.F o(x), ♦
b. Substitution
?T y(t)
T y(e),
?T y(e → t), ♦
F o(John′ )
c. Parsing He upset
T ns(P ast), ?T y(t)
T y(e),
?T y(e → t)
F o(John′ )
T y(e → (e → t)),
?T y(e), ♦
F o(Upset′ )
T y(e),
?T y(e → t)
F o(John′ )
F o(V), ♦,
T y(e → (e → t)),
T y(e),
F o(Upset′ )
?∃x.F o(x)
2.5. ANAPHORA 61
e. Substitution
T ns(P ast), ?T y(t)
T y(e),
?T y(e → t)
F o(John′ )
It is important to notice that the metavariables are replaced, at steps (2.61b), and
at step (2.61e), so there is no record thereafter of there having been a pronominal
form of input. The tree resulting from the parse of John upset Mary and that of
He upset her, as uttered in the context of having just parsed (2.59), is effectively
identical in the two cases, the only difference being the occurrence of meta-variables
redundantly decorating the node once Substitution has taken place in the con-
strual of the string containing pronouns.
The lexical specification for the pronoun he can now be given. It must express
both the need for the pronoun to have a value established in context, and specify
whatever other constraints the pronoun imposes. So, for example, he is associ-
ated with a constraint on substitution that whatever term is substituted for the
metavariable projected by the pronoun is describable (in context) as having the
property M ale′ . This sort of constraint (a presupposition) we show as a subscript
on the metavariable.35 Additionally, the case of a pronoun constrains which node
in a tree the pronoun may decorate. For nominative pronouns in English, where the
morphological specification of case is very restricted, we take nominative case spec-
ification to take the form of an output filter: a requirement that the immediately
dominated node be decorated with type t. In English, of course, there is a further
constraint that nominative pronouns only appear in finite subject positions, so that
the actual constraint is shown as a requirement to be immediately dominated by a
propositional node with a tense decoration.36
(2.62) he
IF ?T y(e)
THEN put(T y(e); Type statement
F o(UMale’ ); Metavariable and Presupposition
?∃x.F o(x); Formula Requirement
?h↑i(T y(t) ∧ ∃x.T ns(x)); Case Condition
↓ [⊥]) Bottom Restriction
ELSE Abort
Other pronouns and other case-forms project analogous information. So, for
example, the first and second pronouns project presuppositions that whatever is
substituted for the metavariable must be the speaker or the hearer/addressee, re-
spectively (i.e. USpeaker’ , UHearer’ ). Notice that, just as with third person pronouns,
the metavariable projected by I/me or you will be replaced by some logical term,
picking out the speaker or hearer, and so the output propositional form ceases to
35 We do not, in this book, go into presuppositional effects or the formal analysis of such things,
3, there is more than one way to express case restrictions. This will be taken up in more detail in
chapter 6, where we shall see that in Japanese, nominative case specification is arguably different
from other case specifications.
62 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
SUBJ ?T y(e → t)
What this does is to exclude all local arguments as possible substituends for any
metavariable. Additionally, it also excludes as a potential substituend a formula
decorating an unfixed node. This is because, as we saw above, the unfixed node is
evaluated at each node along the path between the node where it is introduced and
the node where it merges. The decorations on the node, including F o(α), are eval-
uated for whether they hold at that very node, and this entails h↑0 ih↑∗1 ih↓0 iF o(α).
Hence, an unfixed node is always local in the sense intended. With this approach,
then, neither Hilary likes her nor Hilary, she likes can ever be associated with the
propositional formula Like′ (Hilary′ )(Hilary′ ). The tree-description vocabulary can
thus capture the type of generalisations that other frameworks rely on such as fa-
miliar concepts of locality (indeed rather more, since the characterisation of partial
trees is the central motivation for turning to a formal tree-description language).
This substitution process is assumed to be defined over a context of structures
from which putative antecedents are selected. At this stage, we rely on a purely
intuitive concept of context, with context construed as an arbitrary sequence of
trees, including the partial tree under construction as the parsing process unfolds
incrementally (see chapter 9 for a more detailed discussion of context). So we
presume that the context relative to the parsing of John thinks that he is clever
at the stage of parsing the pronoun includes the partial tree under construction,
hence making available F o(John′ ) as a putative antecedent. This is so because John
decorates the matrix subject which is not a co-argument of the node decorated by
he, the modality between the latter node and the former is mediated by a second
argument node: h↑0 ih↑0 ih↑1 ih↓0 i.38
in the most common circumstances. However, Substitution is not involved here but the lexical
actions associated with a reflexive identify a local formula and use that as a substitute as part of
the parsing process directly. A word like herself is thus associated with a lexical entry like:
IF ?T y(e)
THEN IF h↑0 i?T y(t)
THEN Abort
ELSE IF h↑0 ih↑∗1 ih↓0 iF o(α)
herself
THEN put(T y(e), F o(α), [↓]⊥)
ELSE Abort
ELSE Abort
ELSE Abort
See chapter 6 for a discussion of the anaphor zibunzisin in Japanese.
39 The terminology in this area is extremely confusing, with ‘topicalisation’ as one standard term
for long-distance dependencies that are characteristically used for the focus effect of isolating one
term as update to some remaining structure.
64 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
material. Such an effect can be seen in the clitic doubling structures evident in lan-
guages such as Modern Greek, as illustrated in (2.67). The phrasal expression ti
Maria we take to project internal structure (that of an iota expression, a position
we take up for all proper names from chapter 3) and hence necessarily developing
the treenode decorated by the clitic tin.
(2.67) ti Maria tin ksero
theACC Maria herACC I know
‘I know Maria.’
We go into such constructions in more detail later in the book, but essentially the
clitic tin in (2.67) projects a metavariable but does not decorate that node with
the bottom restriction. This allows the unfixed node decorated by ti Maria to
merge with that node, yielding a structure identical to that derived by parsing the
non-dislocated string ksero ti Maria.
The example in (2.67) also shows another common property of natural lan-
guages: pro-drop. This is the phenomenon in many languages of licensing the
occurrence of a verb without any independent expression providing the argument
for the predicate, presuming on its identification from context. The dissociation of
a metavariable from the bottom restriction illustrated by clitic doubling, however,
also provides a straightforward way to analyse such languages.
As we have seen in this chapter, subject nodes in English are introduced by
the construction rules Introduction and Prediction while the lexical actions
associated with verbs are triggered by a predicate requirement, ?T y(e → t). As
such, transitive verbs do not decorate their subject nodes and overt subjects are
thus necessary to guarantee the well-formedness of strings because, without one,
the pointer can never reach the triggering predicate node. However, certain lan-
guages are best treated as having the parsing of verbs triggered by a propositional
requirement, rather than a predicate node. So, for example, in a VSO language
like Modern Irish, we may analyse (non-copular) verbs as being triggered by a type
t requirement and providing a full propositional template with the pointer left at
the subject node for development next. Thus, the verb chonaic ‘saw’ in (2.68) may
be given the lexical entry in (2.69) which has the effect shown in Figure 2.70. The
fact that the pointer is at the subject node allows the immediate parsing of the
first person pronoun mé in (2.68) and the pointer then travels down the tree using
anticipation (twice) to allow the internal object to be parsed.40
(2.68) Chonaic mé an cú
saw I the dog
‘I saw the dog.’
(2.69) chonaic
IF ?T y(t)
THEN put(T ns(P AST )); Tense
make(h↓1 i); go(h↓1 i); put(?T y(e → t)); Predicate Node
make(h↓1 i); go(h↓1 i);
put(F o(Chon′ ), T y(e → (e → t)), [↓]⊥; ) Main Functor
go(h↑1 i); make(h↓0 i); go(h↓0 i); put(?T y(e)); Internal Argument
go(h↑0 ih↑1 i); make(h↓0 i); go(h↓0 i); put(?T y(e)) Subject
ELSE Abort
40 Here and below, we represent the concepts expressed by content words in some language in
terms of primed versions of the stems of those words. So we expression the content of chonaic
in (2.69) as Chon′ , not in terms of their translations into English such as See′ . We do this in
order to sidestep the issue of whether lexicalised concepts can be realised fully in terms of simple
translation equivalents.
2.5. ANAPHORA 65
?T y(e), ♦ ?T y(e → t)
F o(Chon′ ), [↓]⊥,
?T y(e)
T y(e → (e → t))
There is, of course, much more to say about the analysis of Modern Irish, but this
brief sketch of the parsing of main verbs shows one of the ways in which word order
variation can be achieved within Dynamic Syntax: through the manipulation of
triggers (between propositional, main predicate and n-place predicate requirements)
and where the pointer remains after the lexical actions have occurred. It also
provides a means of accounting for subject pro drop languages such as Modern
Greek. While in Modern Irish the subject node is decorated by a type requirement
(at least for analytic forms such as chonaic), indicating the need for an overt subject,
in Modern Greek we may analyse verbs as decorating their subject nodes with
metavariables. Such metavariables, like those projected by pronouns in the way
we have seen, are associated with a formula requirement and so must be updated
with some contentful expression. This is achieved by lexical actions such as those
associated with the verb ksero ‘I know’ shown in (2.72). (2.73) shows the parse of
the sentence in (2.71) with substitution of the subject metavariable being made on
the assumption that Stavros utters the string.41 The resulting proposition is, as
expected:42 F o(Kser′ (Maria′ )(Stavros′ )).
b. Parsing ksero:
?T y(t)
F o(Kser′ ), [↓]⊥,
?T y(e)
T y(e → (e → t))
c. Substitution:
?T y(t)
F o(Kser′ ), [↓]⊥,
?T y(e), ♦
T y(e → e → t)
d. Parsing ti Maria:
?T y(t)
?T y(t)
T y(e → (e → t)),
?T y(e)
F o(Kser′ ), [↓]⊥
internal structure of proper names which in chapter 3 will be rectified when they are treated as
terms which have internal structure, as a form of quantifying expression.
68 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
The important thing to notice about this description is that lines b and c, giv-
ing the descriptions of the initial and introduced subject nodes, describe the very
same node, the argument node immediately dominated by the top node. The full
description of that node should thus be that in (2.77a) with the two descriptions
unified, which itself reduces to (2.77b) once fulfilled requirements and uninformative
information (i.e. the metavariable) are removed.
(2.77) a. {h↓0 iT n(0), T y(e), F o(Stavros′ ), [↓]⊥, F o(U), ?∃x.F o(x), ♦}
b. {h↓0 iT n(0), T y(e), F o(Stavros′ ), [↓]⊥, ♦}
The tree in (2.78b) is thus equivalent to that in (2.78a). This example thus shows
the usefulness of the tree description language which highlights directly the number
and type of nodes in a tree which may be obscured by apparently contradictory
sets of computational and lexical actions.
(2.78) Parsing o Stavros kseri with fixed initial term
a. Introduction and Prediction for subject and predicate:
?T y(t)
T y(e), F o(Stavros′ ),
?T y(e → t)
[↓]⊥, ♦
b. Parsing kseri
?T y(t)
T y(e → (e → t)),
?T y(e)
F o(Kser′ ), [↓]⊥
c. The result:
?T y(t)
T y(e), [↓]⊥,
?T y(e → t)
F o(Stavros′ ), ♦
T y(e → (e → t)),
?T y(e)
F o(Kser′ ), [↓]⊥
an output is achieved and, as we shall see later in this book, different parsing
strategies (possibly signalled by variations in prosodic information) may give rise
to different pragmatic effects that may be exploited to convey non-truth conditional
information.
Before turning to a more detailed discussion of well-formedness and ungram-
maticality it is useful to note, in concluding this section, that the basis for our
account of linguistic variation is in differences in lexical actions associated with
classes of words. So, in Greek, unlike Irish and English, the parse of a verb induces
a structure in which the type-assignment of the subject node is already satisfied
by the projection of a metavariable, thus giving rise to pro-drop effects. In both
Greek and Irish, on the other hand, unlike English, the parse of a verb projects
a full propositional template, giving rise to VSO orders that are excluded in En-
glish.44 In the chapters that follow, we will use different sets of lexical actions for
verbs and pronouns along lines suggested in this section in accounting for different
grammatical properties, in particular in accounting for the interaction of dislocated
or scrambled expressions with anaphora.
F o(John′ ) F o(Upset′ )
F o(Mary′ ) F o(Upset′ )
Given what the framework provides, why is it not possible for John to be construed
as subject and Mary as object in parsing John, Mary upset ? Several possible
derivations might spring to mind. For example, Mary might be taken to decorate
an object node which is introduced first on developing the predicate-requiring node
or, alternatively, to be taken to decorate an unfixed node introduced after the
subject node is established and decorated, which is then merged with the object
node. Why are these not licit alternatives to the parse that achieves the correct
result?
The answer is simple enough, though a bit laborious to establish. As so far
set out, there are two choices at the outset of the construction process. Either
*Adjunction applies and the word John is taken to decorate an unfixed node
with F o(John′ ); or Introduction and Prediction are applied and John is taken
to decorate the subject node. The former strategy gives us what we want, so we
can leave that on one side as unproblematic. Suppose to the contrary that John,
as the first word in the string, is taken to decorate the subject node as in (2.82).
46 This is, of course, not a problem specific to Dynamic Syntax, but is actually true of all major
theories of syntax.
2.6. WELL-FORMEDNESS AND UNGRAMMATICALITY 71
At this point, we immediately face a difficulty in parsing the second noun phrase
Mary, for once the subject node is parsed, the pointer goes back to the top node,
by completion and moves onto the predicate-requiring node, via anticipation.
*Adjunction cannot now apply: it is defined to apply only when the pointer
is at the node decorated with ?T y(t) (i.e. not from the predicate-requiring node).
*Adjunction also cannot apply more than once from any one node: this is ensured
by the requirement that it apply only when there are no other nodes in the (sub)tree
of which it is the top. This is the significance of the double brackets ({{) in the
input tree description condition in the definition of *Adjunction in (2.51):
As soon as either one application of *Adjunction has taken place, or one appli-
cation of Prediction, this condition will no longer apply. With *Adjunction
inapplicable, given the assumption that John has been taken to decorate a sub-
ject node, then the only option available is to develop the predicate goal.47 But
if Prediction indeed non-trivially applies yielding a node to be decorated with the
predicate requirement, ?T y(e → t), then the condition for carrying out the actions
of the name Mary cannot be applied – its trigger of ?T y(e) has not been provided
and no parse can proceed.
Applying the actions of the verb could of course be carried out if the verb
were next in the sequence, as this requirement is the appropriate condition for its
sequence of update actions. However, it is Mary that lies next in the sequence
of words. But with Introduction and Prediction only applying in English
at the level of expanding a propositional node, i.e. with ?T y(t) as the condition
on their application, the trigger for the actions provided by the word Mary is
not provided and the sequence aborts. Hence, the impossibility of building an
interpretation ‘John upset Mary’ from the sequence John, Mary upset. The only
way for a successful derivation to take place, once the decoration on the subject
node has been established, is for the update action of the verb to be carried out
first. And to enable this to take place, the verb has to be positioned before the
noun phrase to be construed as object.
This explanation turns on what may seem to be an ad hoc stipulation that the
only form of Introduction and Prediction that is available in English is the
one that leads to the formation of subject and predicate nodes. If, instead of this
stipulation, Introduction and Prediction are taken to apply in the general,
type-neutral, forms given in the rules in (2.14) and (2.16), then they could apply
in principle at the predicate-requiring node to yield the (schematic) tree in (2.83)
with the pointer at the object node.48
47 In principle, another daughter node could be introduced, but any such move would be trivial,
as any such node being already introduced would immediately satisfy the requirement ?T y(e)
removing the requirement. By a very general principle of tree growth, such trivial updates are
debarred.
48 Recall that Prediction puts the pointer at the open argument node.
72 CHAPTER 2. THE DYNAMICS OF INTERPRETATION
inferences over these. We leave this aspect of context entirely to one side.
73
74 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
The context also, of course, includes the current tree under construction, allowing
substitution of some term, already constructed, for a metavariable projected later in
the parse process, as in (3.2), without any recourse to other trees in the context.
(3.2) Mary thought she was overworked.
In order to characterise relative clauses,2 we extend our notion of context to
allow one partial tree to be built which provides the context in which some other
structure is then built. The process of completing the first tree then takes place
in the context of having constructed the intermediate tree as a side-routine. In
this way, we allow the processor to build structures in tandem, which provide
contexts of interpretation for each other. This is what we propose happens in
relative clause construal, which we take as the proto-typical case of such joint
structure-building. In this way, our analysis treats a major subpart of natural-
language recursion as the progressive projection of individually simple structures
which share a context. The methodology, as before, is to closely follow the dynamics
of time-linear processing, showing how complex interpretations can be established
through the use of underspecification interacting with tree growth processes.3
with some additions, in particular the discussion of non-restrictive relative clause construal.
4 We take up the analysis of restrictive relatives once we have introduced quantification: see
section 3.2.
3.1. LINKED STRUCTURES 75
• and then a subsequent completion of the main clause through a parse of the
verb smokes.
This is exactly the process we use to account for relative clause construal in
general. We take the process of projecting a second propositional structure as
involving the notion of a “link” between one completed node of type e in a partial
tree and the top-node of another tree decorated by a propositional requirement
(?T y(t)). The word who, which we treat as a relative pronoun, following Jespersen
(1927) and many another traditional grammarian, provides a copy of the formula
decorating the node from which the link is projected (the “head”). The parse of the
remaining string then proceeds in the way that we have seen for other left dislocation
structures in chapter 2, to give an output formula value F o(Like′ (John′ )(Sue′ ))
just as in a parse of the string John, Sue likes. The initial tree is then completed
through a parse of the verb to yield a propositional formula F o(Smoke′ (John′ )),
with the whole structure then being interpreted in terms of conjunction of the two
propositions decorating the two trees. We show the output structure schematically
in the figure in (3.5) where the “link” relation is shown by the thick black arrow.
(3.5) The representation of (3.3):
T y(t), F o(Smoke′ (John′ ))
that L is a valid argument of the treenode predicate T n in addition to 0 and 1, the address of a
node which is inversely “LINKed” to a head with address T n(a) being given as T n(aL). So we
76 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
head
z }| {
{. . . {T n(a), F o(α), T y(e), ♦} . . . }
{. . . {T n(a), F o(α), T y(e)} . . . }, {hL−1 iT n(a), ?T y(t), ?h↓∗ iF o(α) , ♦}
| {z } | {z }
head formula requirement
| {z }
LINKed node
Firstly, notice that, in requiring a copy of the formula of the head to be found
within the LINK structure, this rule encapsulates the idea that the latter tree is
constructed in the context provided by the first partial tree. The rule thus cannot
operate on a type-incomplete node and ensures that both structures share a term.
Secondly, the requirement to find the copy of the shared term (F o(John′ ) in this
case) is modal in form, indicating that the copy should be found somewhere in the
unfolding propositional tree but without specifying where in that tree it should oc-
cur. This, recall, is the means by which left-dislocation structures are characterised
as having an initially unfixed node. Expressed here as a requirement, it embodies
the restriction that somewhere in the ensuing construction of the tree from this
node, there must be a copy of the formula decorating the head. This restriction
is notably transparent, stated in this modal logic vocabulary. The structure thus
develops for English with an application of *Adjunction to provide an unfixed
node in anticipation of securing that copy, giving rise to the tree in (3.8).
tree displays.
3.1. LINKED STRUCTURES 77
T n(0), ?T y(t)
h↑0 iT n(0),
?T y(e → t)
T y(e), F o(John′ )
This requirement need not be satisfied immediately: all that ?h↓∗ iF o(John′ )
requires is its satisfaction at some point before the LINKed tree is completed.
When we turn to other languages, this will be significant, as this combination of
requirement and modal statement provides a natural means of expressing a range
of non-local discontinuities between either the head or some clause-initial comple-
mentiser and a pronoun within that clause functioning resumptively. At no point
does there need to be adjacency in the structure between the term providing the
requisite formula as head and the term realising the requirement for a copy of it:
the modality in the formula specification captures all that is needed to express
long-distance dependencies of this form. In English, as it happens, the requirement
imposed by the construction of a LINK transition is met immediately, a restriction,
as noted, which is induced by the lexical actions of the relativisers who, which, and
that. These are defined as rule-governed anaphoric devices that decorate an unfixed
node within a LINKed structure, as shown by the lexical actions associated with
who given in (3.9).
IF ?T y(e), ?∃x.T n(x), h↑∗ ihL−1 iF o(x)
(3.9) whorel THEN put(F o(x), T y(e), [↓]⊥)
ELSE Abort
The context for parsing the word is complex in order to enforce its strict distribu-
tion. What the conditional action says is that if from an unfixed node, there is the
LINK relation defined to some node decorated by some formula F o(x), then a copy
of that formula should be provided for the current node. This action is notably like
that of a pronoun, even to the decoration of the node with a bottom restriction,
except that the value is fully determined as being that decorating the head from
which the LINK transition is defined. The effect of these actions in parsing John,
who is shown in (3.10), the copy requirement now being obviously satisfied.
78 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
T n(0), ?T y(t)
h↑0 iT n(0),
?T y(e → t)
T y(e), F o(John′ )
h↑0 iT n(0),
T y(e), ?T y(e → t)
F o(John′ )
T y(e → (e → t))
?T y(e), ♦
F o(Like′ )
Once the merge step has taken place, the linked structure can be completed
by steps of completion, thinning, and evaluation, all as before, until the top
of the LINKed tree is reached. At this point, however, we can get no further as
there is currently no rule to move the pointer back to the head node to allow a
completion of the parse. However, we may revise the rule of completion from the
last chapter as in (3.12) to move the pointer from a type-complete node across any
daughter or inverse LINK relation.
(3.12) Completion (Revised):
{. . . {T n(n) . . . }, {hµ−1 iT n(n), . . . , T y(X), . . . , ♦} . . . }
{. . . {T n(n), . . . , hµiT y(X), . . . , ♦}, {hµ−1iT n(n), . . . , T y(X), . . . , } . . . }
µ−1 ∈ {↑0 , ↑1 , ↑∗ , L−1 } µ ∈ {↓0 , ↓1 , ↓∗ , L}
3.1. LINKED STRUCTURES 79
With this small revision in place, the matrix proposition can be completed, essen-
tially to yield the tree in (3.5).
There is, however, one further step to take and that is to provide a means of
interpreting such a tree. As noted above, non-restrictive relative clauses give rise
semantically to a conjoined propositional structure. We therefore introduce the first
in a series of what we call LINK evaluation rules, as given in (3.13). This rather
complex looking rule takes a completed propositional tree with treenode address
T n(a) that has embedded in it somewhere a propositional structure LINKed to
some node dominated by T n(a) and returns a conjunction of the two formula values
decorating the rootnodes of those propositional trees. We end up, therefore, with
the tree in (3.14).
(3.13) LINK Evaluation 1 (Non-restrictive construal):
T y(e → t),
T y(e), F o(Sue′ )
F o(Like′ (John′ ))
Notice what this account of relative clauses involves and how this differs from
most other accounts. In the first place, the whole account rests on two concepts of
underspecification, of position within a tree and of content, both of which are used
to characterise other phenomena. Secondly, and most importantly, the story of
relative clause construal is fully anaphoric, involving no concepts of quantification
or binding. Relative pronouns are analysed in the same way as we have analysed
personal pronouns: as expressions that have underspecified content. The differ-
ence between the two types of pronoun is that personal pronouns may get their
content through a pragmatic process of substitution (fairly) freely from the con-
text, while relative pronouns directly copy some term specifically identified in the
unfolding structure, i.e. that provided by the head. Because of this, and because
of the properties of merge which merely unifies the information associated with
two constructed treenodes, there is no record in the output that the string that
gives rise to the structure contains a relative pronoun. Indeed, as we shall in later
chapters, the structure given in (3.14) is exactly the same as the output that would
80 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
result in a successful parse of (3.15), where the LINK structure is projected by the
conjunction and and he is construed as John.
(3.15) John, and Sue likes him, smokes.
We will be exploring the consequences of this account of relative clauses through-
out much of the rest of this book, but before we do so, it is worth noticing a further
property of the analysis, associated with the concept of LINK. This has to do with
our account of left dislocation. The modality introduced for unfixed nodes by *Ad-
junction is, as we have seen, h↑∗ i (and its inverse h↓∗ i). This modality ranges over
the closure of ‘dominated by’, ↑ (or ‘dominate’ ↓) relations. It does not range over
LINK relations hL−1 i or hLi. This means that no unfixed node of this sort can be
merged with a position that is contained within some structure LINKed to the tree
into which it is introduced, as schematically shown in (3.16).
(3.16) Illicit merge:
∗T n(a), ?T y(t)
This provides us with a straightforward account of strong islands and precludes the
derivation of strings like those in (3.17), which exhibit violations of Ross (1967)’s
Complex NP Constraint.
(3.17) a. *Who did John, who likes, smoke?
b. *Sue, John, who likes, smokes.
3.1.4 Crossover
In (3.18) the word he cannot be construed as picking out the same individual as
the expression John, while, in (3.19), it can:8
(3.18) John, who Sue is certain he said would be at home, is in the surgery.
(3.19) John, who Sue is certain said he would be at home, is in the surgery.
When first observed by Postal (Postal 1970), this phenomenon was called crossover
to reflect the fact that analysing word order variation in terms of moving expressions
from one position to another, the wh expression is moved, ”crossing over” a pro-
noun, an effect which debars interpretation of wh, pronoun and gap as co-referring.
In more recent work such as Postal (1993), this restriction is still classified as a
mystery, and remains a major challenge to any linguistic theory.
The problem is that the data have to be seen, as in the original Postal analysis,
as splitting into at least two different phenomena, called strong and weak crossover,
and each is yet further subdivided into two further subcategories: ‘extended strong’
and ‘weakest’ crossover. In Lasnik and Stowell’s (1991) account, a discrete concept
of gap called a ‘null epithet’ is posited in addition to the array of empty categories
already posited at the time. There remains no satisfactory integrated explanation
within movement frameworks. Indeed it has become standard to treat these phe-
nomena entirely separately, all requiring very different analyses, as though such a
proliferation of discrete categories is not problematic (see e.g. Boeckx (2003), de
Cat (2002), for recent discussion). There have been a number of relatively recent
analyses in non-movement frameworks attempting to reduce the plethora of facts
to a single phenomenon, but it remains very generally intransigent in any account
which fails to reflect the linearity intrinsic to the distribution.9
The extent of the puzzle can be traced directly to the static methodology of the
jigsaw view of language. The problem arises in that type of framework, because the
observed patterns are analysed exclusively in terms of the hierarchical relationship
between a pronoun and a posited empty position, ‘the gap’. All reference to the
dynamics of left-right processing is, by standard assumptions, debarred in syntactic
explanations of linguistic data (see chapter 1). According to this standard perspec-
tive, if pronoun and gap are both arguments and the pronoun c-commands the gap
(as (3.18) would standardly be analysed), an interpretation in which the pronoun
and wh-expression are taken to denote the same entity is not possible given binding
principle C. This is because that principle debars an empty category from being
interpreted as referring to the same entity as any c-commanding argument expres-
sion whatever, the gap supposedly having name-like properties which require it to
be free.10
8 We cite these here without any indexing indicating the licensed interpretation, contrary to nor-
mal methodology, as these sentences are fully well-formed. It is simply that some interpretations
are precluded.
9 See Hepple (1990), which requires an account of all cross-sentential anaphora as entirely
distinct; and Safir (1996), which the author himself notes fails to extend appropriately cross-
linguistically. Dalrymple et al. (2001), Kempson and Gabbay (1998), Kempson and Meyer-Viol
(2002), Shan and Barker (forthcoming), by contrast all invoke concepts of linearity as an integral
part of the explanation.
10 Incidentally, this observation of unique binding of the gap by the wh expression, which is
construed as an operator, guarantees that, despite the adoption of predicate-logic forms of binding
as the working metaphor for LF representations, nevertheless the “binding” of the gap by the
associated wh has to be distinct from a quantifier-variable binding operation because quantifiers
can bind an arbitrary number of variables, hence the specially defined concept of a “syntactic
operator” (Chomsky 1981, Koopman and Sportiche 1982).
82 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
This principle-C style of analysis will not apply if the pronoun precedes but does
not c-command the gap, because the filter will not apply. Crossover environments in
which the pronoun is contained in a complex determiner are accordingly analysed as
a distinct phenomenon “weakest crossover”, since under some circumstances these
are able to be construed as co-referring as in (3.20).11
(3.20) Johni , whoi Sue said hisi mother worries about ei , has stopped working.
The surprise for this form of analysis, in the standard terminology, is that if the
pronoun c-commands the gap but the binder is complex, then judgements of gram-
maticality do not coincide with strong crossover cases as might be expected. In-
stead, they pattern with those observed of weak crossover in allowing co-referring
interpretations, despite their classification with the much stronger ‘strong crossover’
restriction, the so-called ‘extended strong crossover’ restriction (3.21):12
(3.21) Johni , whosei motherj hei worries about ej far too much, has stopped
working.
The parallelism in the acceptability judgements of examples such as (3.20) and
(3.21) is entirely unexplained. It is problems such as these that led Lasnik and
Stowell to posit the “null epithet” as a further form of empty category needed for
what is itemised as ei in (3.20). This behaves, not like a name as the principle C
account would expect, but like a pronoun, hence subject to principle B. But positing
an additional type of empty category just adds to the puzzle (see Safir 1996, 1999).
Whatever the merits of the analysis in providing new theoretical constructs that
are compatible with the data, it is clear that this does not constitute anything like
an explanation of how anaphoric processes and long-distance dependency interact;
and indeed the phenomenon has never been addressed by syntacticians at this level
of generality.13
With a shift into a methodology which pays close attention to the way in which
interpretation is progressively built up within a given structural context, the ac-
count is, by comparison, strikingly simple. Using steps introduced in chapter 2 and
section 1 of this chapter, the emergent structure of (3.18), which we now take in
detail, can be seen as built up following a combination of computational and lexical
actions. The expression John is taken to decorate a fixed subject node, and, from
that, a LINKed structure is projected by LINK Adjunction which imposes the
requirement to find a copy of F o(John′ ) in this structure, as we have seen. An un-
fixed node is constructed within that newly introduced structure via *Adjunction
which is duly decorated with that formula by the lexical actions associated with
parsing the relative pronoun. Subsequent parse steps then lead in succession to the
setting up of a subject node, and its decoration with F o(Sue′ ); the introduction of
a predicate-requiring node (decorated with ?T y(e → t)); the construction of a func-
tor node (decorated with F o(Certain′ )) and its attendant second-argument node
11 In this and the following examples, we revert to the standard methodology of co-indexing to
indicate the relevant possible interpretation. In particular, we use the trace notation without any
commitment to such entities to indicate the position in the interpretation process at which the
unfixed node associated with the parsing of who will be resolved.
12 The debate is often discussed in connection with wh-question data, but here we restrict our
attention to crossover phenomena in relatives. See Kempson et al. (2001) for a fuller discussion
which includes wh-questions.
13 Boeckx (2003) and Asudeh (2004) are recent advocates of an integrated account of what they
both identify as resumptive pronouns, in the Asudeh analysis purporting to nest this as a mere
sub-case of pronominal anaphora; but, as things turn out, many of the relevant phenomena, in
particular the use of resumptive pronouns in English, they both set aside as involving a distinct
form of pronoun, “an intrusive pronoun”, thereby denying the possibility of the complete account
that is claimed.
3.1. LINKED STRUCTURES 83
(decorated with ?T y(t)). This leads to the construction of the embedded subject
node, with the lexical actions of the pronoun providing a metavariable F o(U) of
type e as decoration. The structure at this point in the parse is shown in the tree
display in (3.22).
(3.22) Parsing John, who Sue is certain he:
T n(0), ?T y(t)
h↑0 iT n(0),
T y(e), ?T y(e → t)
F o(John′ )
T y(e),
?T y(e → t)
F o(Sue′ )
T y(t → (e → t)),
?T y(t)
F o(Certain′ )
Given that all Kleene star relations include the empty relation, this description
applies to any formula decoration that is at the node itself (i.e. h↑0 ih↓0 i). So
substituting the formula F o(U) by F o(John′ ) as value is precluded, even though
the node from which some other copy of that formula has been established might
be in a suitably non-local relation (such as the matrix subject node in (3.22)).
Hence, merge is the only means of securing F o(John′ ) as the selected update
to the metavariable projected by the pronoun in the embedded subject position.
Yet this also turns out to be a precluded choice of strategy. This is because, if
merge is adopted at the point of having processed the pronoun, things go wrong
thereafter, because the unfixed node, being now fixed, is no longer available for
updating some later partial structure. In parsing John, who Sue is certain he
said would be at home, is in the surgery, the crunch point comes after parsing
the verb said. The word in the string that follows this is another verb requiring
(in English) a predicate goal to trigger its lexical actions. However, what follows
said is a propositional structure in which introduction and prediction apply to
derive subject and predicate goals, but with the pointer on the subject node. This
is decorated with the requirement ?T y(e). If no merge of the unfixed node has
previously occurred, then it can provide the necessary update, satisfying the type
requirement and the parse can continue. However, if the unfixed node has been
fixed earlier, there is no possible update to satisfy the type requirement and the
parse will abort.
Since substitution is precluded and merge of the unfixed node with the
embedded subject node in (3.22) leads inevitably to a failed parse, he in (3.18)
cannot be construed as being co-referential with John. The strong crossover effect
is thereby accounted for straightforwardly in terms in terms of the interaction of
updating different types of underspecification: tree position and content.
3.1. LINKED STRUCTURES 85
Notice how the difficulty here arises solely because the pronoun is construed as
picking out the doctor in question. If however, he had been interpreted as ‘Tom’,
or ‘Dick’, or ‘Harry’, etc., indeed anything other than the name picking out the
indicated individual John, then there would be no problem at any juncture in the
parsing process. The unfixed node with a term picking out John would still be
described as unfixed at the point at which the embedded predicate is reached, and
so at this point, this unfixed node could be updated appropriately. Hence it is only
interpretations of the pronoun as anyone other than John that lead to a well-formed
result in (3.18).
In (3.19), no such problem arises. This is because the process which merges
the unfixed node with the first embedded subject node occurs before the pronoun
is processed. So when the interpretation process reaches that point, there is no
problem interpreting the pronoun. It can be interpreted as John because there is
no local node now decorated with F o(John′ ), because the information associated
with the unfixed node, now fixed, is no longer being passed down the tree. The
tree display in (3.23) illustrates this process schematically up to the point where
substitution applies to the metavariable (shown by the double uparrow, ⇑, as
regularly from now on).14
(3.23) Parsing John, who Sue is certain said he:
T n(0), ?T y(t)
h↑0 iT n(0),
T y(e), ?T y(e → t)
F o(John′ )
T y(e),
?T y(e → t)
F o(Sue′ )
T y(t → (e → t)),
?T y(t)
F o(Certain′ )
T y(e), F o(U), ♦
⇑ ?T y(e → t)
F o(John′ )
The answer to the question as to why the pronoun can be identified with the
wh relativiser in both (3.20) and (3.21) now reduces to the question of whether the
14 Notice that the figure is purely schematic, and is intended to show the main steps in the
derivation, up to the point at which the pronoun is parsed. It does not show strictly just the
structure at the point at which substitution takes place, since by this time the unfixed node
would be fixed properly in the relevant position.
86 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
node decorated by the pronoun is relevant to identifying the fixed tree position for
the unfixed node. The simple answer is that it is not, in either case. In (3.24), if the
pronoun, he, is identified as having a formula value identical to the wh expression,
whose, by virtue of being interpreted as a copy of the head, this has no consequences
for identifying the tree position of the unfixed node. This follows because the wh
expression projects merely a subterm for the formula decorating the unfixed node.
That is, construal of he as ‘John’ cannot be the result of establishing the tree
position for the node annotated by a formula constructed from whose mother.
(3.24) Johni , whosei motherj Sue says hei worries about ej far too much, is at
the hospital.
Not only will consistency of node annotations rule out any application of merge;
so too will the terminal node restriction imposed by the pronoun. To similar effect,
in the case of weak crossover, no node internal to a determiner can provide an
update for the unfixed node projected by the relative pronoun who. In this case,
with the wh expression not being complex, the account turns on the analysis of
genitive pronouns such as his. These might be analysed as defining a complex
operator introducing a metavariable as part of its restrictor without assigning it
any independent node, in which case the terminal node restriction imposed by
the wh pronoun will fail to be met. Alternatively, his might be analysed along
with more complex genitive constructions as projecting a distinct linked structure
whose internal arguments cannot provide an update for a node unfixed within the
structure initiated by the relativising particle.15 Either way, as with extended
strong crossover effects, there will be no interaction between the construal of the
wh expression and the pronominal, and so an interpretation of the pronoun as
picking out the same individual as is picked out by the head is available:16
(3.25) Johni , whoi Sue said hisi mother was very worried about ei has stopped
working.
The short version of this rather long story about crossover is that we get
crossover effects whenever there is interference between the updating of an unfixed
node and the updating the underspecified term provided by a pronoun. When-
ever no such interference is possible, for whatever reason, the interpretation of
the pronoun is free. We thus have the beginnings of an account of crossover data
that spans strong, weak and extended strong crossover data without any grammar-
internal stipulations particular to these sub-cases: the different forms of interpre-
tation available emerge solely from the interaction of the process of building up
structure for the construal of the relative and the process of establishing a value
for a pronoun.
This kind of feeding relation between processes of construing strings displaying
long-distance dependency and the construal of a pronoun is extremely surprising
if the problem of pronoun construal is taken as different in kind from the problem
posed by long-distance dependency (the “imperfection” problem – see chapter 1).
15 See Kempson et al. (2001: 144-148).
16 The analysis of genitives as ‘implicit relatives’ is well attested in the literature (Baker 1996,
Kayne 1994, etc). Notice that this form of analysis provides a basis for characterising their island-
like status: dependencies are in general not available between a pair of terms, one external to some
structure projected from a genitive marker and one within that structure. An alternative is to
analyse genitives as locally unfixed with respect to the node introduced from a ?T y(e) decoration.
This characterisation of a genitive as decorating an unfixed node will prevent application of some
unfixed node as putative argument of a genitive imposed structure, as the appropriate tree update
will not be available (see chapter 6 for discussion of Local*Adjunction in connection with
Japanese).
3.2. QUANTIFIER PHRASES AND RESTRICTIVE RELATIVE CLAUSES 87
Indeed, this feeding relation can only be modelled directly if we assume that the
interpretation of both involves updating tree structures on a left to right basis
with decorations leading to constructed logical forms. So the explanation turns on
treating pronoun construal and long-distance dependency as two sides of the same
coin: both project aspects of underspecification in a structure building process.
It also provides our first case where explanation of a syntactic distribution has
invoked intermediate steps in the interpretation process. The distribution cannot be
defined over the input to the relative construal process (trivially, since the input is
on this assumption merely a single top-node requirement); but nor can it be defined
over the output structure. What is critical is the dynamics of how some interpre-
tation may be licensed only if a certain kind of choice is made at an intermediate
step.
The significance of this account is that there has not been any need to define
strong, weak, weakest, or extended strong crossover, as discrete sets of data, to
be analysed in terms of different structures requiring quite different analyses.17
This is quite unlike most analyses, in which weak crossover is uniformly seen as a
separate phenomenon from strong crossover, not even requiring a related basis of
explanation; and the generality of the underlying explanation as a consequence of
the feeding relation between two simple and independently motivated processes is
missed altogether.18
T y(cn → e)
T y(cn)
QUANTIFIER
This tree shows how quantified noun phrases, although taken to project terms of
type e, are nevertheless associated with internal structure, indeed, more or less
the structure that one would expect on any standard account of quantified noun
phrases. Given our perspective that what is built by syntactic processes is semantic
structure for a resulting logical form, this may seem surprising. How can this be
what is wanted for quantifying expressions?
To see what is going on here, we need to start from the stance that others
adopt about quantification, and the internal structural properties of noun phrases,
in order to see to what extent the Dynamic Syntax account is different. The point
17 Wehave not discussed crossover in WH-questions: see Kempson et al. (2001: 213-216).
18 This
is not to deny that the challenge of reducing the plethora of crossover types to a single
phenomenon is not recognised by some (Hepple 1990, Dalrymple et al. 2001), but without the
dynamics of intermediate stages in the update process, no fully integrative approach is possible.
88 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
of departure for all formalisms is that predicate logic, in some sense to be made
precise, provides an appropriate underpinning for representing the semantic prop-
erties of natural language quantification. The existential and universal quantifiers
of predicate logic give the right truth conditions, at least for the analogous natural
language quantifiers, every and the indefinite singular a/some. However there is
an immediate problem, because, at least on the face of it, the syntax of predicate
logic and the syntax of natural languages are not alike. In predicate logic, the
quantifiers are propositional operators: they take open propositional formulae, and
bind variables in them to yield a closed propositional formula. Moreover, the open
formulae are complex and require different connectives depending on the quantifier
selected. So, the existential quantifier is associated with conjunction (∧) and the
universal with material implication (→). The predicate logic translations of (3.27a)
and (3.28a) are thus (3.27b) and (3.28b), respectively.
(3.27) a. Every student laughed.
b. ∀x(Student′ (x) → Laugh′ (x))
(3.28) a. A lecturer cried.
b. ∃y(Lecturer′ (x) ∧ Cry′ (x))
Considered in terms of its combinatorial properties, in other words its logical
type, the quantifier ∀ is accordingly of type t → t, an expression that maps proposi-
tional formulae into propositional formulae. This makes it structurally quite unlike
quantified expressions in natural languages, which fill argument positions just like
other NP expressions.
The move made in Montague semantics and all formal semantic formalisms fol-
lowing that tradition (Montague 1974, Dowty et al. 1981, Carpenter 1998, Morrill
1994, etc.) is to retain the insight that the predicate logic formula for a univer-
sally quantified statement expresses the right truth conditions for a sentence such
as Every student laughed, but recognise that the words of English are not mapped
onto the predicate logic formula directly. Montague’s proposed solution is to define
the types of every, student and laughed so as to make sure that the determiner
may combine first with the noun, and then with the predicate (as its argument) to
yield a propositional formula. Since student and smoke are (one-place) predicates
according to standard assumptions of predicate logic, this means defining every as
having a high type ((e → t) → ((e → t) → t)). When such a type combines with
a noun the result is an expression of type ((e → t) → t) which may combine with
a (one-place) predicate, expressed by some verb phrase, to yield a propositional
formula (of type t). The derivation of the formula in (3.27b) proceeds as follows:
Expression Type Translation
every ((e → t) → λQλP ∀x.Q(x) → P (x)
((e → t) → t))
student (e → t) Student′
every student ((e → t) → t) (λQλP ∀x.Q(x) → P (x))(Student′ )
≡ λP ∀x.Student′ (x) → P (x)
laughed (e → t) Laugh′
every student laughed t (λP ∀x.Student′ (x) → P (x))(Laugh′ )
≡ ∀x.Student′ (x) → Laugh(x))
The details are not in fact important here.19 A significant consequence, how-
ever, is that the noun phrase subject in particular is no longer the last argument
19 See the introductory semantics textbooks for discussion, Dowty et al. (1981), Gamut (1991),
that the predicate combines with to yield a propositional formula. Instead, it is the
verb phrase which provides the argument: the functor/argument roles of subject
and predicate are flipped from what they might be expected to be. An advantage
of this move is that the semantics of universally quantified sentences of English can
be preserved as being that of the corresponding predicate logic formula, while pre-
serving the parallelism with syntax that all noun phrases can be analysed as having
a single type. However, its disadvantage is that the argument-predicate roles in the
case of the subject to some verb are different from those of all its other arguments
(unless the type of verb, and everything else accordingly is driven up to increased
levels of complexity of type specification).20 The general problem underpinning
this is the methodology it presumes, which might well be called “preparing for the
worst”. That is, the analysis is being driven by the example which displays the
need for greatest complexity.
However, there is an alternative point of departure for looking at NPs in natural
languages, and that is to look not at predicate-logic formulae, but at predicate-logic
proofs. In particular, given the goal of modelling the way in which information is
incrementally established in natural language, it is instructive to look at the clos-
est analogue to natural language reasoning. This is the natural deduction proof
method, in which valid inference is defined through the step-by-step process by
which some conclusion is established from a set of premises. Natural deduction
proofs for predicate logic all follow the same pattern: there are licensed steps for
getting rid of the quantifiers, replacing them with so-called arbitrary names sub-
stituted in place of the variables. The body of the proof is then carried out with
these names as illustrated in (3.29).21
20 There have always been problems in manipulating the higher type ((e → t) → t) to non-
subject noun phrases, as the predicate with which it is defined to combine is never available. The
problem of type-raising is further compounded if things like plurality are properly incorporated
in the system.
21 There is a regular objection to the move to use arbitrary names for natural-language se-
mantics. It is that quantifiers such as most are essentially generalized quantifiers, for which no
proof-theoretic account is available. Only the generalized quantifier analysis has the appropriate
generality to be applicable to all noun phrases. This, however, is not an appropriate form of
criticism to the proposed analysis. In the DS system, it is the progressive compilation of the
logical form which requires all terms projected by noun phrases to be of type e. (This style of
natural deduction proof is due to Prawitz (1965). For an introduction to logic in terms of such
natural deduction, see Lemmon (1965).) The logical forms that result are then subject to a step
of evaluation, as we shall see shortly; and in this step, one might define a step onto a generalized
quantifier formula, e.g. in cases involving most. The system as it stands is restricted by the
expressive power of the selected calculus, but in principle the assumption that computation of
semantically relevant structure is via the working assumption of type e terms has more general
application.
90 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
it is this we are going to adopt (Hilbert and Bernays 1939). For a recent study, see Meyer-Viol
(1995).
3.2. QUANTIFIER PHRASES AND RESTRICTIVE RELATIVE CLAUSES 91
of such terms and, in particular, the scopal relations between terms. The latter
involves collecting scope statements at a propositional node and then evaluating
the propositional formula of that node with respect to these. This is the topic of
section 3.3. Here we are concerned with the process of building up the logical forms
of the terms from the words.
As noted above, what we need as a projection of structure for a noun phrase is
a complex form of name, of type e but containing within its structure a quantifier
node to carry the information about the kind of quantification to be projected, a
node to be decorated with the restrictor predicate, Man′ , Book′ , etc., and another
node for a variable, itself also of type e, as set out in (3.26).
In the process of constructing such a tree, a determiner projects the variable-
binding term-creating operator, in the case of the indefinite, a promissory note for
an epsilon term (what this is we shall take up shortly), shown as a lambda term
binding a variable P of type cn. There is also a classification feature Indef (+).
This is shown in the lexical actions associated with the indefinite article in (3.30).
The tree growth associated with this is shown in (3.31) which begins a parse of A
lecturer cried:24
IF ?T y(e)
THEN put(Indef (+)); make(h↓1 i); go(h↓1 i);
(3.30) a put(F o(λP.(ǫ, P )), T y(cn → e), [↓]⊥);
go(h↑1 i); make(h↓0 i); go(h↓0 i); put(?T y(cn))
ELSE Abort
(3.31) Parsing a:
T n(0), ?T y(t) 7→ T n(0), ?T y(t)
T y(cn → e),
?T y(cn), ♦
F o(λP.(ǫ, P )), [↓]⊥
distinguishes between indefinites which are relatively unconstrained, and non-indefinites, which
follow linear order more closely, see section 3.3. Such features are used as sparingly as possible
here and are, given our commitment to the hypothesis that syntactic processes create semantic
structures, merely promissory notes for a semantic characterisation that needs to be made explicit.
25 The sequence of actions that achieves the ‘freshput’ instruction is a rather primitive device
scanning the set of variables already associated with a scope statement, and aborting the lexical
action for each one that corresponds to such a variable. For a full account, see Kempson et al.
(2001) ch.7.
92 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
IF ?T y(cn)
THEN go(h↑0 i); put(?SC(x));
go(h↓0 i); make(h↓1 i); go(h↓1 i);
(3.32) lecturer
put(F o(λy(y, Lecturer′ (y))), T y(e → cn), [↓]⊥);
go(h↑1 i); make(h↓0 i); freshput(x, F o(x)); put(T y(e));
ELSE Abort
There are a number of things to notice about this complex of actions. In the first
place, it imposes on the dominating ?T y(e) node a new requirement ?SC(x) which
is satisfied just in case a scope statement is constructed at some point in a parse.
(We will discuss this aspect later in section 3.3.) Secondly, the content of the
common noun, lecturer, is shown as a complex lambda term λy(y, Lecturer′ (y)),
which binds the fresh variable that it introduces on the internal type e node. The
effect of parsing the first two words of A lecturer cried is thus as shown in (3.33).
T y(cn → e),
?T y(cn)
F o(λP.(ǫ, P )), [↓]⊥
Steps of Completion and Evaluation (see chapter 2) will apply from the
bottom of the tree in (3.33) to yield a F ormula decoration at the cn node, of
F o(x, Lecturer′ (x)), and again to yield the F ormula decoration F o(ǫ, x, Lecturer′ (x))
at the top type e node. The verb can now be parsed and, assuming a step of scope
creation to be discussed below, the final output tree in our example is that given
in (3.34).
(3.34) Parsing A lecturer cried:
T n(0), S < x, T y(t), F o(Cry′ (ǫ, x, Lecturer′ (x))), T ns(P AST ), ♦
T y(cn → e),
T y(cn), F o(x, Lecturer′ (x))
F o(λP.(ǫ, P )), [↓]⊥
The formula, F o(ǫ, x, Lecturer′ (x)), is called an ‘epsilon’ term which stands
for some arbitrary ‘witness’ of the set denoted by the restrictor, here Lecturer′ .
Although we will discuss the evaluation of such terms in more detail later in this
3.2. QUANTIFIER PHRASES AND RESTRICTIVE RELATIVE CLAUSES 93
T y(e → cn),
T y(e), F o(x)
F o(λy.(y, John’(y))
8. See also Kempson et al. (2001) where such a specification is taken to be a constraint on model-
theoretic evaluation of the resulting term. See also Cann (forthcoming b) for an interpretation of
such formulae as involving LINK structures.
29 Though in the case of names, with which we started, we simply presumed that the logical
the internal node equally to be available. This provides a natural basis for re-
strictive relative construal, since we have to hand a means of articulating further
propositional structures containing a copy of the variable.
Notice that our characterisation of the lexical actions associated with common
nouns, such as in (3.32), leaves the pointer on the node decorated by the variable
it introduces. There is nothing to prevent LINK Adjunction from applying at
this point to launch a propositional LINKed structure containing a requirement
to find a copy of this variable somewhere inside it. In parsing a string like that
in (3.39), there is a transition from the tree constructed after parsing a man and
before parsing who that gives the tree in (3.40).
(3.39) A man who Sue likes smokes.
(3.40) Parsing A man with LINK Adjunction:
T n(0), ?T y(t)
?T y(e) ?T y(e → t)
T y(cn → e),
?T y(cn)
F o(λP.(ǫ, P )), [↓]⊥
This rule of LINK Adjunction thus applies exactly as for non-restrictive forms
of construal. As before, what the rule does is to introduce a new LINKed structure
built from the node decorated by the variable as ‘head’, and at the top node of that
LINKed structure it puts a requirement for the occurrence of a copy of the very
variable projected by the noun.
There is then an intermediate step of *Adjunction, which then provides the
condition which enables the relative pronoun to provide the required copy (3.41),
the latter being indifferent to what formula decorates the head. There is thus no
need to define any ambiguity for the relative pronoun between restrictive and non-
restrictive uses.
96 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
T n(0), ?T y(t)
?T y(e) ?T y(e → t)
T y(cn → e),
?T y(cn)
F o(λP.(ǫ, P )), [↓]⊥
From this juncture on, the process of parsing the string is exactly as in simple
clauses. Nodes are introduced by the rules of Introduction and Prediction or
by lexical actions given by the verb, and a step of Merge duly takes place unifying
the unfixed node that had been decorated by the relative pronoun with some open
node, in a parse (3.39), the object node projected by parsing likes.
The only additional rule needed to complete a full parse of (3.39) is a second
rule of LINK evaluation for restrictive relative clauses that puts together the
information provided by the completed LINKed structure with the nominal predi-
cate to yield a complex restrictor for the variable at the cn node. The somewhat
complicated rule in (3.42) actually has a simple effect: it takes the propositional
formula (φ) decorating the LINK structure and conjoins it with part of the formula
decorating the T y(e → cn) node (ψ) applied to the variable, x, decorating the in-
ternal type e node (ψ(x)). This yields a complex restrictor of the variable because
the formula, φ, derived from the LINK structure contains a copy of the variable.
The final step, then, is to ‘abstract’ the variable to give a well-formed common
noun representation F o(x, φ(x) ∧ ψ(x)).30
(3.42) LINK Evaluation 2 (Restrictive construal)
struals, because the resulting decoration is of a T y(cn) node, and not a propositional one. Never-
theless, it involves conjunction of two propositions, one in the matrix tree and one derived from
the LINKed structure, thus sharing common properties.
3.2. QUANTIFIER PHRASES AND RESTRICTIVE RELATIVE CLAUSES 97
The effect of this rule in a parse of the subject noun phrase A man who Sue likes
is shown in the tree display in (3.44) (with much irrelevant detail omitted). Once
this nominal restrictor is compiled, here as the complex conjunction, the compound
type cn formula can be combined with the quantifying formula to provide the
specified epsilon term: F o(ǫ, x, Man′ (x) ∧ Like′ (x)(ι, z, Sue′ (z))), an epsilon term
that picks a witness from the intersection of the set of men with the set of things Sue
likes, as expected. The rest of the parse of A man who Sue likes smokes continues
as expected and we ultimately derive the propositional formula in (3.43)
(3.43) F o(Smoke′ (ǫ, x, Man′ (x) ∧ Like′ (x)(ι, z, Sue′ (z))))
?T y(e) ?T y(e → t)
F o(x) F o(Like′ )
ing from the movement of the wh expression to some (Spec)-CP position. See Chomsky (1981)
and many references thereafter. In Minimalist analyses, this movement of the wh expression it-
self may not be necessary, since in some analyses, movement is restricted to feature-movement.
Nonetheless, the underlying variable-binding operator analysis of relative clauses is retained, as it
is in essence in all other generative frameworks that we are aware of (Sag et al. 2003, Dalrymple
2001, Morrill 1994).
98 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
while English allows restrictive relative clauses without any overt relativiser this is
not true of non-restrictive relative clauses.32
(3.45) a. The student Bill admires won the class prize.
b. *The student, Bill admires, won the class prize.
The availability of restrictive relative construal without any relative pronoun is very
simple to define. We have to allow the analogue of a null complementiser, a phono-
logical free-ride that is defined to allow the free creation of a copy at some unfixed
node (introduced by successive steps of LINK Adjunction and *Adjunction)
as long as the node which constitutes the head is itself immediately dominated by a
cn-requiring node. The lexical characterisation in (3.46) is identical to that of the
relative pronoun which except for the extra condition that the head formula be in-
ternal to the type e sub-structure, so even in this null complementiser construction,
we sustain an anaphoric style of analysis.
IF ?T y(e), ?∃x.T n(x),
h↑∗ ihL−1 iF o(x)
THEN IF h↑∗ ihL−1 ih↑0 i?T y(cn)
(3.46) ∅relcomp
THEN put(F o(x), T y(e), [↓]⊥)
ELSE Abort
ELSE Abort
mismatch, given the display of semantic co-ordination properties, but syntactic subordination.
34 Relative clauses formed with that have an intermediate status. They are often reported to
T n(00),
F o(ǫ, x, Man′ (x)∧ ?T y(e → t)
Dislike′ (x)(ι, y, Sue′ (y)))
T n(000)
F o(λy.(y, Man′ (y)))
F o(x)
F o(x) F o(Like′ )
T nhL−1 iT n(00), ?T y(t), ?h↓∗ iF o(ǫ, x, Man′ (x) ∧ Dislike(x)(ι, y, Sue′ (y))), ♦
As can be seen from this tree, the projection of a LINK structure that provides a
non-restrictive construal involves building that structure from the higher of the two
T y(e) nodes in the term. Hence, this can only be introduced once this higher type
e node has been completed and decorated with a F o value. But this satisfaction of
the type requirement on this node only takes place if the determiner node and cn
node have been fully decorated, with their type requirements fulfilled. Of necessity
35 Scope information is not specified here, but in fact there would be a scope statement associated
with the type e term immediately prior to the introduction of this second LINKEd tree. See section
3.3
100 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
then, this step must follow any process which involves constructing and completing
some LINKed structure introduced as a transition defined on some T y(e) node
dominated by a cn node and decorated by a variable.
The pointer can never return to some daughter of the Ty(cn) node from the top
T y(e) node, so no construal of a relative clause restrictively can be achieved after
some clause that has a non-restrictive construal. Indeed, even if it could, the process
of constructing a LINKed structure at that later stage would involve retracting the
decorations on the cn and higher type e nodes, in order to introduce the newly
added modification from this latterly introduced LINKed structure. But this is
independently precluded: all building up of interpretation is monotonic, involving
progressive enrichment only. The only decoration that can ever be removed from a
node is a requirement, once that has been satisfied – nothing else.
Given that this linear-order restriction emerges as a consequence of pointer
movement, and not from any idiosyncratic decoration on the individual nodes, we
would expect that it would apply equally to all noun phrases, indefinite and non-
indefinite alike. And indeed it does:
(3.49) a. Every interviewer you disliked, who I was on good terms with, liked
our cv’s.
b. *Every interviewer, who I was on good terms with, you disliked,
liked our cv’s.
Finally, contrary to a conventional implicature analysis, on the present analysis,
we expect that non-restrictively construed relative clauses can contain antecedents
for subsequent pronominals:36
(3.50) I saw a man, who ignored a friend of mine. When she hit him, he
continued to ignore her.
The data present no more of a problem than regular cross-sentential identification
of a pronoun with some epsilon term as antecedent, leading to the construction
of a new extended epsilon term in the logical form constructed for the sentence
containing the pronoun. As we would expect, this pattern may also occur within a
single sentence:
(3.51) A parrot, which was loudly singing a song, appeared to understand it.
So, given an analysis of all noun phrases as type e terms, and the processing of
building up paired linked structures, the distinction between restrictive and non-
restrictive forms of construal is remarkably straightforward to reflect. There are
no hidden mysteries of a discrete level of structure, in some sense beyond logical
form, no structures which have to be assigned some unexpected status as filters
on content, and no radical difference in computation between the two forms of
construal.
definition a conventional implicature is a filter on content, and not part of it. Yet for the pronoun-
antecedent relation to be established, the content of the conventional implicature must have be-
come part of the context, requiring a specific accommodation process.
3.3. QUANTIFIER SCOPE AND ITS EFFECTS 101
which collects scope-relevant information during the construction process, and the
subsequent scope-evaluation process (see Kempson et al. (2001) ch.7 for details).
Scope information takes the form of statements of the form x < y, which is short-
hand for “a term with variable x has scope over term with variable y”. These
statements are collected at the local node requiring a formula of type t as they are
made available by words and their actions during the parsing process.37 Once a
propositional formula of type t is derived at the top node of some local tree, this
node will have a pair of (i) a scope statement of the form Scope(Si < x < . . . )
and (ii) a formula of type t. (i) and (ii) together will then be subject to a scope
evaluation algorithm which spells out what that pairing amounts to. We take this
lexically driven route because there is lexical variation in how much scope freedom
the terms projected have, which comes primarily from the determiners (but may
also come from verbs). We begin with a discussion of the major problem case:
the indefinites, noun phrases containing determiners include such as a, some, all
numeral expressions, and many.
This asymmetry between individual quantifiers is a puzzle for all standard ac-
counts of quantification. On the basis of the freedom with which indefinites can
be construed, it has been assumed that this will have to be a general process:
since quantifier scoping has to be defined over a propositional domain, it cannot be
treated as a lexical property. All quantifiers, accordingly, are said to be subject to
such scope variation, with a general process of either quantifier raising (in the syn-
tax, e.g. May (1985)) or quantifier storage (in the semantics, e.g. Cooper (1983)).
But this then leaves as a complete mystery as to why there should be variation
between individual quantifiers in the first place.
One response to this is to suggest that indefinites are ambiguous between a
regular quantifier and some kind of name, referring to a particular individual (or
group) taken to be picked out by the speaker. Such an analysis has the advantage
of leaving undisturbed a generalised quantifier analysis of a sub-class of indefinites,
analysing the others as a specialised naming device. On such an analysis, there
would be no need of a general process of quantifier-storage. However, as observed
by a number of people over the years (Farkas 1981, Cormack and Kempson 1991,
Abusch 1994, Winter 1997, Reinhart 1997), such an ambiguity account fails to
capture cases in which the indefinite can be construed as taking intermediate scope,
with scope over the clause or noun phrase within which the indefinite is contained
but nevertheless still interpreted as within the scope of some preceding quantifier:
(3.56) a. Every professor insisted every student of theirs read a recent MIT
thesis.
b. Each student has to come up with three arguments that show that
some condition proposed by Chomsky is wrong.
Each of the examples in (3.56) allows an interpretation, as one amongst several, in
which the indefinite in the embedded clause can be understood as taking broader
scope than the nearer of the two quantified expressions preceding it, but narrower
scope than the subject quantifier. For example (3.56a) allows as one possible inter-
pretation: ‘For each professor there is a recent MIT thesis that the professor insisted
that their students read’. So the scope relation is every professor < a recent MIT
thesis < every student: one MIT thesis per professor for all their students, but not
the same thesis. So for these cases, in any event, indefinite expressions must appar-
ently be analysed as expressions that function as quantifiers taking scope over some
arbitrarily larger domain. Analysing a sub-case of interpretations of indefinites as
name-like does not solve the problem.
If, however, we look at this phenomenon from a processing perspective, then
there is an alternative account in terms of interpreting the indefinite relative to
its context, which is naturally expressible only if we analyse quantified expressions
as a form of name, initially underspecified. We can get at this alternative way of
looking at the problem by asking how ambiguity displayed by (3.56a) arises. How,
for example, does it differ from a simpler structure such as (3.57)?
(3.57) Every student is reading a recent MIT thesis.
One obvious answer is that (3.56a) has three quantified expressions, (3.57) only
two. The clue, we suggest, lies exactly in this.
The ambiguities in (3.57) are straightforward enough. Either the second quan-
tifier is interpreted relative to the first, or the first quantifier is interpreted relative
to the second. This is the standard way of describing this phenomenon. However,
there is another way of expressing this observation, by expressing the dependence
solely in terms of the indefinite. Either the expression a recent MIT thesis in (3.57)
3.3. QUANTIFIER SCOPE AND ITS EFFECTS 103
ing the construal of individual members of the set quantified over independently of the island in
which the indefinite expression is contained. However, in (i), given the disambiguation provided
by the presented elaboration, wide scope potential for the indefinite seems possible ((ii) is Ruys’
example):
(i) If two of my sons ever help me do the washing up I buy them a beer by way of thanks. If they
both do, I get out the champagne.
(ii) If two of my uncles give me a house, I shall receive a fortune.
104 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
It might seem surprising that choice of scope should be said to parallel choice
of pronoun construal, because cataphoric interpretations for pronouns in English
are generally not available in English, yet scope inverted interpretations surely
textitare.39 We can interpret (3.61) as meaning that all the patients got interviewed,
but not necessarily all by the same nurse because of what we know about the
burdens of the national health service:40
(3.61) A nurse interviewed every patient.
Yet he in (3.62) cannot be interpreted as picking up on its interpretation from the
following occurrence of John:
(3.62) He knows that John is clever.
This is certainly true; but this is not the end of the matter. There are the expletive
pronouns, which are interpreted in precisely a forward-looking manner:
(3.63) It is likely that I’m wrong.
The it in (3.63) is construed as identical to the interpretation assigned to the fol-
lowing clausal expression I’m wrong. In the literature, such pronouns are analysed
as completely different phenomena, the so-called expletive pronouns, unrelated to
their anaphoric counterparts. However, there is reason to think that they are nev-
ertheless but a subcase of the pronoun it, a matter we come back to later (chapters
5 and 8). Here it is sufficient to know that cataphoric processes of anaphora are
available subject to certain locality constraints.41
There is further evidence that allowing indefinites to depend on some quantifier
introduced later in a sequence is the right move to make. Analysing the apparent
reversal of scope of a pair of quantifiers when the first is indefinite and the second
non-indefinite provides a solution to what is otherwise a baffling form of variation
in the scope construal of non-indefinites poses. The puzzle is that quantifiers such
as most appear to give rise to wide-scope construal when they follow an indefinite
noun phrase, but not when they follow a non-indefinite noun phrase. In (3.53), for
example, we observed that the expression most good scripts may take wide scope
over two examiners but nevertheless not wide scope over everyone:
(3.53) Everyone agreed that two examiners marked most good scripts.
Beside that, however, the interpretation of (3.64) generally follows the linear order
in which the quantified expressions occur. The only natural interpretation of (3.64)
is as an assertion about all examiners that each marked most scripts:
(3.64) All examiners marked most scripts.
39 Apart from expletives, cataphoric effects may be expected if there are grounds for analysing the
node the pronoun decorates as unfixed, since there is a later stage at which the needed substitution
operation can apply. But this is not the operative consideration in simple SVO sequences.
40 These scope inverted interpretations are regularly said not to be preferred, but nevertheless
agreed to be available.
41 There are also cases of cataphora involving initial adjunct clauses, often generically interpreted
And, conversely, the only natural interpretation of (3.65) is an assertion about each
member of some larger proportion of scripts that all examiners checked it:
(3.65) Most scripts, all examiners marked.
If we analyse all quantified expressions as having a wide-scope potential, these
facts are mysterious, for sometimes quantified expressions containing most seem to
allow interpretations in which they take scope over other quantified expressions,
but sometimes not. On the other hand, the puzzle is resolved if we analyse all
scope inversion for pairs of quantified expressions as an idiosyncracy of indefinites:
that they can be interpreted as taking scope narrower than some term, and that
in some contexts this can be constructed from an expression following them in
the sequence of words. Most itself provides no potential for being interpreted
outside the immediate structural context within which it is processed. It is only the
indefinite which provides an underspecified term, with a choice of scope dependency
that ranges over any term made available during the construction process.42
In any case, the analysis that is being proposed is not that indefinites and pro-
nouns are identical in their process of interpretation, so the expectation is not for
absolute parallelism in their mode of interpretation. The claim is simply, more
weakly, that both have a context-dependent element to their interpretation: for
pronouns, it is the context that provides the value of the metavariable which these
pronouns project, for indefinites, it is the context of the proposition under con-
struction that provides a discrete term relative to which the relative scope of the
indefinite can be identified.
choice go hand in hand in all languages. If the language is very sensitive to linearity in its choice
of pronouns, then so is its choice of scope construal for indefinites. For detailed demonstration of
this see Kempson and Meyer-Viol (2004) and Kempson et al. (2001).
106 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
different to the other requirements that we have seen because, although it is non-
modal in form, it is satisfied only when a scope statement is established on the
most local propositional node. It should, therefore, more accurately be written as a
modal requirement to establish a relation in the Scope predicate with the variable
introduced by the common noun:43
?SC(x) =def ?h↑0 ih↑1∗ i∃y.Scope(y < x) ∨ Scope(x < y)
In general, we will continue to represent this complex requirement in its simpler
form, as ?SC(x). Once the information provided by the determiner and by the
common noun are put together, determiner-particular scope statements get added
to the local type-t-requiring node to satisfy this scope requirement. A sequence of
scope statements is thereby accumulated during the construction process at some
type node decorated by ?T y(t).
In general, quantifier scope follows linear order, so a quantifier processed first
on a left-right basis is presumed to have scope over a quantifying term that follows
it, with each scope statement successively added to the end of the sequence and
fixing its scope relative to what precedes. Indefinites, however, provide a systematic
exception to this general principle, and are lexically defined to have a more open
characterisation of scope effects, subject to choice.44 We can now express the
parallelism between indefinites and pronouns by defining an indefinite as adding to
the set of scope statements a relatively weak scope restriction that it be interpreted
as taking narrow scope with respect to some term to be chosen, represented by the
meta-variable U:
U < x
The value of a metavariable U as projected by the lexical specification of a pronoun,
is established in the structural context in which it occurs. Though these statements
are collected locally, the value of the metavariable U itself is not structurally re-
stricted, so a term may get entered into a scope statement for which there is no
corresponding occurrence in the formula itself. As a result the scope requirement
?SC(x) may be satisfied; but evaluation of the effect of that particular scope de-
pendency cannot be resolved until higher in the tree. In all such cases, this scope
statement gets progressively lifted up the tree until it arrives at a node where the
locally compiled formula contains an occurrence of this selected term.
In determining the value for U in a scope statement, there is one difference from
regular anaphoric processing in that there can be no possible indexical construal,
where the value for U is taken from some external context. The term under con-
struction is a quantifying term and must thus ultimately be subject to evaluation as
part of the overall proposition being constructed, as we shall see below. However,
apart from the constraint that the choice of value to be made for the metavari-
able must range only over other terms used in the interpretation of the string, it is
otherwise structurally unrestricted. This gives us a basis for characterising the ar-
bitrarily wide scope properties of indefinites, if we make the additional assumption
that representations of time are also taken as a term in the language.
43 Notice the modality h↑ ih↑1 i which is more constrained than h↑ i in pointing to a node that
0 ∗ ∗
can be reached by going up one argument node and zero or more functor nodes. See the locality
defined for Substitution in chapter 2, (2.64) on page 62.
44 Another exception is when the terms decorate a node which is not yet fixed in the structure.
Some languages allow quantified expressions to decorate such nodes, others do not. Those that
license such left-peripheral occurrence will allow scope statements to be entered, subsequent to
the step of merge which assigns the unfixed node, a definitive position in the tree. There is some
indication here of sensitivity to whether the process allows separation of the quantifying term
from that position across a propositional boundary. See chapter 7.
3.3. QUANTIFIER SCOPE AND ITS EFFECTS 107
Suppose that every formula of type t is of the form Si : ψ, with Si a term denot-
ing the time at which the formula ψ is said to hold.45 This is the one entry in the
scope statement which is assumed to be fixed independently. To reflect this, we also
modify the starting point ?T y(t) so that it also contains one term in the attendant
scope statement, namely Si , the constructed index of evaluation. Morphological
tense (or adjuncts) then add predicates on Si restricting its construal, so that what
we have been writing as T ns(P AST ) may be construed as T ns(Si < Snow ). With
this in place, the relatively unrestricted range of scope effects for indefinites fol-
lows immediately. If the value selected for the first argument of the scope relation
is some variable associated with a discrete quantifying determiner, we get narrow
scope interpretations, if it is chosen to be a term of the form Si which is itself not
construed as taking narrow scope with respect to such quantifying expressions, then
the indefinite will be construed as able to take wide scope relative to those quanti-
fying expressions.46 In (3.66), there is only one choice, this being that the indefinite
is interpreted relative to Si , the term representing the index of evaluation:
(3.66) A student fainted.
Notice the mode of interpretation for indefinites and pronouns do not project
identical forms of dependence on context. To start with, indefinites cannot be in-
terpreted as dependent on some term outside the process of construction. They are
in any case distinct forms of input: pronouns are not scope-inducing terms. Any
idiosyncracy distinguishing them is thus easily expressible in the distinct sets of
lexical actions.47 By contrast the parallelism between anaphora and indefinites is
inexpressible on accounts of quantification in model-theoretic terms, as generalised
quantifiers. There is no room to express under-specification of a generalised quanti-
fier. So the very smoothness with which the anaphoric-style of account of indefinites
and the analysis of all quantifying expressions as names goes together provides fur-
ther evidence for this analysis of the compilation of quantification structures in
type-e terms.48
To illustrate the process of scope construction, we will go through the relevant
steps in parsing (3.67) with narrow scope for the indefinite.
(3.67) A student read every book.
As we saw in section 3.2.1, parsing the determiner introduces the node for the
binding operator itself and the cn-requiring node, adding to the type-e-requiring
node the feature Indef (+). The actions associated with parsing a common noun
noun then introduce the two daughter nodes of the ?T y(cn) node: a T y(e → cn)
node decorated with the formula that expresses the concept associated with the
noun; and an internal type e node decorated with a fresh variable. It also adds
to the higher type-e-requiring node, as we have seen, the requirement for a scope
statement for that variable ?SC(x) interpreted as a modal requirement for the
variable to appear as part of the Scope predicate on some dominating node, without
45 Throughout this book, we make no attempt to formally address tense or mood construal. See
Perrett (2000) for an preliminary DS characterisation.
46 Unlike DRT, where wide scope effects for indefinites have been said to involve movement of
the discourse referent to the top box (Kamp and Reyle 1993), the assumption that scope of an
indefinite is a relatively unrestricted choice allows for intermediate interpretations in addition.
47 See Kempson et al. (2001) for full lexical specifications of the determiners every, a.
48 This lexicalised approach to projection of scope effects allows idiosyncratic specifications for
other determiners. For example, no should arguably be analysed as projecting a composite se-
quence of actions, the projection of a falsity operator at the closest type t node, and an epsilon
term defined to take narrow scope with respect to this operator. Since we do not address problems
of negation in this book, we do not define this here.
108 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
which the term cannot be completed. The non-terminal nodes on this subtree then
get accumulated in the normal way by Completion, Thinning and Evaluation
and the result at this point is (3.68):
(3.68) Parsing A student:
T y(cn), T y(cn → e)
F o(x, Student′ (x)) F o(λP.(ǫ, P ))
With the combination of properties accumulated at the higher T y(e) node through
parsing the determiner and he common noun, i.e. Indef (+) and ?SC(x), a set
of actions is now initiated that causes the pointer to move to the most locally
dominating node decorated with ?T y(t) and, because the term is marked as indef-
inite, adds the statement U < x to the Scope predicate at the top node to yield
Scope(Si , U < x).49
Because of the underspecification associated with the use of the metavariable,
the choice of which term the variable is dependent on is determined by pragmatic
factors. However, this choice is not as free as the analogous choice for pronouns:
what is being constructed is one of a number of terms all of which have to be inter-
preted relative to the particular propositional formula. So what the metavariable
ranges over is other fully determined terms in the construction process, at the point
in the construction process at which the choice is made, where “fully determined”
means a term with a fixed scope-dependency relation. For a wide-scope reading
of the indefinite, its scope relation can be fixed relative to the index of evaluation,
Scope(Si < x), immediately upon processing the expression itself. However, like all
other processes, this substitution process is optional, and so the scope relation can
be left open at this point, allowing it to be determined after some later term is intro-
duced, when the pointer is able to get back to the node where the scope statement
is by licensed moves, i.e. by moves associated with Completion, Elimination
and Anticipation.
Despite the fact that the scope requirement ?SC(x) is not yet satisfied, the
provision of a type e specification and a formula value has sufficiently completed
the parse of the indefinite expression for the pointer to move on (Completion only
needs a specification of a type value and is blind to other requirements that may
sit on the node, unlike Elimination). So the pointer can move to the predicate
node and proceed with the parse of the verb phrase, i.e. read every book. This is
shown in (3.69), again up to the point at which elimination applies to yield the
universal term as formula at the higher of the two type e nodes (notice where the
pointer is in (3.69):50
49 The details of how these scope actions are formulated is not going to be needed later, so these
we leave on one side. See Chapter 7 of Kempson et al. (2001) for formal details.
50 Recall that universal quantification is indicated by the tau (τ ) operator.
3.3. QUANTIFIER SCOPE AND ITS EFFECTS 109
T y(e), ?SC(y),
T y(cn), T y(cn → e)
Indef (−), F o(Read′ )
F o(x, Student′ (x)) F o(λP.(ǫ, P ))
F o(τ, y, Book′ (y)), ♦
Just like the indefinite, the created quantifying term has a feature determining
what kind of determiner it is, Indef (−), and also a scope requirement. The scope
requirement on this higher type-e node now needs to be satisfied, and this takes
place by the creation of a scope statement on the propositional node, thereby de-
termining the relation of the the term binding the variable y to the remainder of
the formula under construction. As the term is non-indefinite, its associated scope
action simply adds the variable y to the last term that has been entered into the
scope statement with a fixed scope dependency relation. This is the reflection of
the fact that the scope of non-indefinites is determined strictly linearly. However,
here is where the incompleteness of the scope specification for the previously parsed
indefinite makes a difference. Had the choice been taken previously for the indef-
inite to have wide scope over any later introduced term, the term projected from
the indefinite would have been construction-wise complete (with no requirements
outstanding), and this would have given rise to the scope statements:
Scope(Si < x, x < y) (= Scope(Si < x < y))
However, in the derivation leading to the inverse scope interpretation, the scope
of the indefinite was left undetermined so the specification of that term is not yet
complete. This means that there is only one possibility for fixing the scope of the
universal term; and that is to enter the variable y projected by the expression every
book into the scope statement as dependent on the one term in the sequence of scope
statements that does have a fixed value, and that is Si , the temporal variable. So
the scope statement associated with the term τ, y, Book′ (y) is Si < y. With a spec-
ified scope statement now on the local propositional node, the requirement ?SC(y)
(interpreted in the way specified above as a local modal requirement over the Scope
predicate and the variable y) is now satisfied, completing the specification of all re-
quirements on this node. Since there are no requirements outstanding on the object
node, Elimination can apply to to yield the predicate F o(Read′ (τ, y, Book′ (y))).
Completion moves the pointer back onto the propositional node and the value of
the metavariable U in the scope statement at the top node can be satisfied through
being identified as y, since the term containing y is at this juncture one of those
with a fixed scope statement. With this sequence of actions, the effect is wider
scope for the universal:
Scope(Si < y, y < x) (= Scope(Si < y < x))
110 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
which all occurrences of the expression under evaluation are replaced by the con-
structed name. The name itself then has exactly the same internal structure as this
compound formula (reflecting the epsilon/predicate-logic equivalence), except that
the name is replaced by the variable and the whole is prefixed by the appropriate
operator.
The formal definition of this algorithm is given in (3.76).
(3.76) Scope Evaluation Algorithm:
Formulae of the form:
φ(ν1 , x1 , ψ1 ), . . . , (νn , xn , ψn )
are evaluated relative to a scope statement:
hSi < x1 < . . . < xn , . . . , ti φ[ν, xn , ψn /xn ]
hSi < x1 < . . . < xn−1 , . . . , ti fνn ,xn ,ψn (φ),
where for x occurring free in φ and Si a (temporal) index, the values
fνxψ (φ), for ν ∈ {ǫ, τ, Q}, and fSi (φ) are defined by:
a. fτ xψ (φ) = (ψ[a/x] → φ[a/x])
where a = τ, x, (ψ → φ)
b. fǫxψ (φ) = (ψ[b/x] ∧ φ[b/x])
where b = ǫ, x, (ψ ∧ φ)
c. fιxψ (φ) = (ψ[c/x] ∧ φ[c/x])
where c = ι, x, (ψ ∧ φ)
d. fSi (φ) = (Si : φ)
In the case of the formula in (3.75), there is only a single possible scope statement
Si < x, so the algorithm proceeds in two steps. First we evaluate the epsilon
term (because x has narrower scope). By (3.76), we first construct (3.77b) from
(3.77a).
(3.77) a. hSi < xi Burn′ (ǫ, x, Cake′ (x))
b. hSi i fǫ,x,Cake′ (x) (Burn′ (x))
(3.76b) now applies to this to replace x in the argument position with the relevant
constructed term, at the same time conjoining this with the restrictor, again with its
instance of x replaced by the constructed term (3.78a) (= (3.75)). The evaluation
is completed by ‘discharging’ the index of evaluation to give the fully evaluated
propositional formula in (3.78a).
(3.78) a. hSi i Cake′ (a) ∧ Burn′ (a)
where a = (ǫ, x, Cake′ (x) ∧ Burn′ (x))
b. Si : Cake′ (a) ∧ Burn′ (a)
where a = (ǫ, x, Cake′ (x) ∧ Burn′ (x))
This example involves just one occurrence of the term under evaluation; but
in more complex cases the enriched term will replace all occurrences of the partial
term under evaluation. To see this, let us take a case of universal quantification
with a pronoun that is to be construed as bound by the quantifying expression.
A parse of the string in (3.79a) yields the propositional formula in (3.79b) with
two occurrences of the partial tau term, as it has been used as substituend of the
metavariable provided by the pronoun in the embedded clause.
3.3. QUANTIFIER SCOPE AND ITS EFFECTS 113
form of discourse-licensed parenthetical, merely adding incidental information about the object
described. See Fabb (1990), Safir (1996) who analyse them as involving some post-LF level of
LF’, a level whose status is not well understood, either formally or conceptually.
55 The observation that there is some form of semantic distinction between the two types of
construal is currently analysed using the only other conventional means of characterising content,
viz as a type of conventional implicature (Chierchia and McConnell-Ginet 1990 and Potts 2001).
On this analysis, the content of a non-restrictive relative is a filter on the projection of the primary
content, but not part of its resulting truth-conditions, an analysis which does not provide a basis
for explaining the interaction of anaphora and non-restrictive relative construal in many of the
data listed in this section.
3.3. QUANTIFIER SCOPE AND ITS EFFECTS 115
names:56
(3.84) a. Every referee, who I had personally selected, turned down my
research application.
b. Every parachutist, who the pilot had instructed meticulously, was
warned not to open his parachute too early.
c. Before I left, I managed to see most of my students, who I gave
something to read to discuss with me when I got back.
In these cases, it is the term under construction that is imposed as a requirement on
the linked structure, duly copied at the unfixed node by the actions of the relative
pronoun, and subsequently unified as an argument node in the predicate structure.
In (3.84b), for example, it is the tau term decorating the subject node that is
copied into the LINK structure, as shown schematically in (3.85).57
(3.85) Parsing (3.84b):
Si < x, ?T y(t)
T n(00), T y(e),
?T y(e → t)
F o(τ, x, Parachutist′ (x))
T y(e → (e → t)),
?T y(e), ♦
F o(Instruct′ )
Notice how in this case, the scope statement Si < x is assigned at the top node prior
to the construction of the LINK transition, since the type e term is provided. It is
then the combination of the LINK evaluation rule and a subsequent single process
of scope evaluation which ensures that the construal of the linked structure forms
a conjunction in the consequent of the conditional associated with the evaluation
of the tau term. The resulting logical form of (3.84b) is given in (3.86a) with its
fully evaluated form in (3.86b).58
(3.86) a. hSi < xi (Warned′ (τ, x, Parachutist′ (x)) ∧
(Instructed′ (τ, x, Parachutist′ (x))
b. Si : Parachutist′ (a) → (Warned′ (a) ∧ Instructed′ (a))
a = τ, x, Parachutist′ (x) → (Warned′ (x) ∧ Instructed′ (x))
56 On the assumption that wh-question words are place-holding devices (see Kempson et al.
2001), they will disallow non-restrictive modification, since the value to be assigned to the relative
pronoun must be some fixed Formula value, and not some open place-holder.
57 Much irrelevant structure is omitted from this tree display and the content of the definite the
warned not to open his parachute too early are simplified to Instructed′ and Warned′ , respectively,
for ease of exposition.
116 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
The LINK evaluation step for non-restrictive relative clauses thus has the effect
of a scope-extending device, despite being combined merely as the addition of a
conjoined formula. This is the result of having an incremental method of building
up scope dependencies as terms are constructed and an account of non-restrictive
relative construal as anaphorically inducing a copy of such terms into a LINKed
structure.
A further consequence of our analysis is that we expect that quantifiers in the
main clause should be able to bind pronouns in the relative clause (Safir 1996):
(3.87) a. Every nurse alerted the sister, who congratulated her on her prompt
reaction.
b. Every parrot sang a song, which it ruined.
In the processing of (3.87b), there are two terms under construction, (τ, x, Parrot′ (x)),
and (ǫ, y, Song′ (y)). The non-restrictive relative clause modifying the object thus
has a copy of the epsilon term within it. Nothing, however, precludes the tau term
from being identified as the antecedent to the pronoun, it in that relative clause
structure, as illustrated (again schematically) in (3.88) where the substitution
step is shown by ⇑.
(3.88) A possible parse for Every parrot sang a song which it:
T n(0), Si < x < y, ?T y(t)
T y(e),
?T y(e → t)
F o(τ, x, Parrot′ (x))
Hence, we predict the truth-conditional equivalence of the results (at least in the
singular indefinite case) despite the different procedures used to construct them.
We have, correctly, a processing ambiguity, but not one necessarily resulting in
denotational distinctiveness.
The two forms of subordination differ from co-ordination in that the latter is
constructed as two entirely independent structures, with, solely, possible anaphoric
links. The restrictive form of construal involves subordination in the sense of the
content derived from the LINKed structure being compiled in as a compound re-
strictor on the domain of the variable quantified over. And the non-restrictive form
of construal, as the intermediate case, involves subordination only in the sense of
the content derived from the LINKed structure being quantified over as a single
scope-binding domain, hence lacking the independence of quantification possible
with conjoined sentences.
3.4 Summary
Let us take stock of what we have achieved. We have an account of relative clauses
which provides a natural basis for distinguishing restrictive and non-restrictive rel-
ative clause construal, while nevertheless assuming a single rule for introducing the
structures which underpin these different types.59 Quantification is defined in terms
that allow maximum correspondence between the syntactic properties of quantified
noun phrases and the requisite scope-taking properties of quantifying expressions.
We have a preliminary account of crossover phenomena in relative clauses, which
shows how anaphoric and strict update processes can interact.60
59 Closely related to non-restrictive relative clause construal is the phenomenon of apposition:
of construal, as the data are said to be less clearcut with restrictive forms of construal. See
3.4. SUMMARY 119
In all cases set out so far, the syntactic properties of the structures have been
shown to follow from the process of tree growth. Long-distance dependency phe-
nomena, crossover phenomena, and restrictive and non-restrictive relative forms of
clause construal, have all been characterised exclusively in terms of the structure
representing the final logical form, the input provided by lexical specifications, and
the process of goal-directed tree growth that is defined over a sequence of update
actions. The different syntactic and semantic properties of expressions are seen to
emerge from the dynamics of how logical structures are built up in a time-linear
parsing perspective.
So, though incomplete in many respects, we have the first detailed evidence
that the formalism set out can be seen not just as a tool for modelling the process
of interpretation, but as a putative grammar formalism. We now turn to seeing
how these three strategies of building linked structures, building unfixed nodes, and
decorating nodes with underspecified metavariables can interact in different ways
to yield cross-linguistic generalisations.
Kempson et al. (2001) for a full discussion of crossover phenomena across relative clauses and
questions, and Cann et al. (forthcoming) for discussion of the problem of how best to characterise
borderline judgements of grammaticality.
120 CHAPTER 3. RELATIVE CLAUSE CONSTRUAL
Chapter 4
121
122 CHAPTER 4. TREE GROWTH AND LANGUAGE TYPOLOGIES
account, then, is to determine which forms of update are particular to human lan-
guage, which are particular to the individual language, and which are a consequence
of general cognitive constraints.
Our over-all aim in chapter 4 and 5 is to explain the interaction between
anaphoric construal and structure building processes, and left and right periph-
ery effects. The first step is relatively modest. We set out a preliminary partial
typology of restrictive relative clauses, focusing on head-initial relatives. We then
build on the partial parallelism between this and construal of left-peripheral expres-
sions to set out a typology of left-periphery effects, using the concepts of LINKed
structures, unfixed nodes and anaphora resolution. These areas have been subject
to intensive study, and, in this chapter, we aim merely to show that the DS tools
provide a natural basis for the gradient set of effects observable in left periphery con-
structions while retaining a unitary analysis of any individual pronoun. As things
turn out however, the feeding relations between the different tree-update processes
provide the basis for a more fine-grained characterisation of the data both in relative
clause and left-periphery constructions than a two-way distinction between struc-
tural and anaphoric forms of dependency is able to express. The principles remain
simple and general even though the interactions between the various constraints on
their implementation may give rise to overlapping tree-growth processes, giving the
appearance of complexity.
Then in the following chapter, we turn to right-periphery effects which, expletive
pronouns aside, have been the subject of much less study; and we shall show that
the same concepts of anaphorically pairing LINKed structures and building unfixed
nodes can be applied to rather different effect to capture generalisations about how
interpretation is built up at later stages of the construal process. This is a more
surprising result, as many right-periphery effects are subject to the much tighter
Right Roof Constraint, which in other frameworks remains a poorly understood
constraint, requiring structure-specific stipulation.
From there we shall turn to head-final languages and use the same tools all
over again, both to characterise word order effects in Japanese, and to capture the
various forms of relative clause construal. Thereafter, we shall turn to the Bantu
languages and look at the puzzle of co-ordination and mis-matching agreement data,
where again we shall see the influence of the dynamics of processing. The result
will be a rounding out of the typology of relative clauses to determine predictions
of both the limits on the type of processes that natural languages make available,
and on the type of variations to expect within that landscape. Right across the
different types of languages, asymmetries between left and right periphery effects
will be explained as consequences of the dynamics of on-line processing.
languages. Modern Irish glas does not map on to Modern English green, since part of its spectrum
includes grey (of animals).
4.1. TOWARDS A RELATIVE CLAUSE TYPOLOGY 123
Arabic, all related to the definite article. In the main, the Arabic we shall use as illustration is
Egyptian Arabic.
4.1. TOWARDS A RELATIVE CLAUSE TYPOLOGY 125
What these actions stipulate is that in the presence of a terminal type e node, a
LINK transition is constructed onto a new topnode with two requirements: one of
?T y(t) and an additional requirement for a copy of the formula decorating the head
node from which the LINK relation was built. This specification is notably like
the computational action of LINK Introduction as defined for English, but here
defined as a lexical action, triggered by the complementiser ?p illi. However, there
are differences. Firstly, this sequence of actions is restricted to applying to nodes
decorated only by variables, since the condition on action lists the bottom restric-
tion as a necessary condition, so it will only trigger restrictive forms of construal.
Secondly, this complementiser is restricted to occurring with definite noun phrases
and we stipulate a definite environment.3 Thirdly, the requirement for the copy
involves a modality not yet encountered. This is shown by the operator hDi which
ranges over both daughter, h↓i, and LINK relations, hLi, while its inverse, hU i,
conversely ranges over mother, h↑i and inverse LINK relations, hL−1 i. Because it
ranges over LINK relations this modality does not observe strong islands and thus
imposes no structural constraint whatever on where in the construction process the
copy head is to be found. Indeed, the required pronoun may occur across a relative
clause boundary, a classic island-restriction violation:4
(4.3) t
arrafna ?p ala l-muxrizh yalli laila sheefit l-masra#iyye
met1st.sg on the-director that laila saw3rd.sg.f em the-play
yalli huwwe ?p axraž-a [Lebanese Arabic]
that he directed3rd.sg.masc -it
‘We met the director that Laila saw the play that he directed it.’
The first consequence of defining ?p illi in this way is that, with no particular
lexical item defined in the language as projecting the required copy, the only way of
meeting it is to use the regular anaphoric process of the language, for this by defini-
tion involves a form which must be interpreted by some term taken from somewhere
else in the context. Hence, there must be some pronoun occurring at some point
in the subsequent string, and it must be interpreted as having an interpretation in
which its value is token-identical to that of the formula provided by the head node.
Any other interpretation of the pronoun (replacing the pronoun’s meta-variable
with some independently available term) will leave the LINKed structure with a
requirement outstanding, hence not well-formed. Any number of occurrences of a
pronoun may occur in the subsequent string: all that matters is that one of them be
interpreted as identical with the head from which the relative structure is induced.
The significance of this account is that unlike other characterisations of resump-
tive pronouns, this makes use of an entirely general account of pronouns. No special
resumptive form for the pronoun needs to be posited, and there is no invocation of
any “intrusive” pronoun either: the modal requirement does all the work. Despite
the emptiness of the locality restriction that this particular form of requirement im-
poses (using the least constraining hDi operator), the requirement still has the effect
of enforcing application of the general process of substitution during the construc-
tion process. In order to meet the well-formedness condition that no requirements
remain unsatisfied in the resulting logical form, the only possible sequences of tran-
sitions, given the non-anaphoric properties of the relativiser ?p illi, will be ones in
which at least one anaphoric device is assigned a construal which allows the modal
3 Notice again that this remains to be given a proper semantic characterisation, although if
tree in Arabic following upon the transition ensured through the use of ?p illi is no more than
a superficial puzzle, given the analysis of Arabic as a subject pro-drop language, with verbs
introducing a full propositional structure from a trigger ?T y(t), decorating the introduced subject
node with a metavariable. In cases in which an explicit subject expression is present in the string,
there may then be an application of *Adjunction – the subject value decorating the unfixed node
and merging with that given by the verb.
6 Demirdache (1991) and others have argued that there are crossover effects involving epithets.
However, it has been counter-argued, by Aoun and Choueiri (2000) that such cases constitute a
Principle C effect. We leave this issue on one side.
4.1. TOWARDS A RELATIVE CLAUSE TYPOLOGY 127
7 See Kempson et al. (2001) for a detailed, similar, specification of the required lexical and
computational actions.
128 CHAPTER 4. TREE GROWTH AND LANGUAGE TYPOLOGIES
F o(U), T y(e), ♦
F o(Darab′ ),
⇑
T y(e → (e → t))
F o(x)
It is notable in this regard that the same rule of LINK evaluation for restrictive
construals operates identically in both languages, yielding a construal of the noun
phrase in (4.5b) as (4.8).
(4.8) F o(ǫ, x, Mudarris′ (x) ∧ Darab′ (x)(ι, z, Magdi′ (z))).
they are often characterised as rescue devices (Sells 1984 etc.), but as these examples show, all
of which except (4.9b) are collected data, their occurrence is very far from being restricted to
such uses. Many of the examples quoted here are part of the collection made by Tami Kaplan
over a period of six years and we are grateful to her for allowing us to use them. See Cann et al.
(forthcoming) for detailed discussion.
4.1. TOWARDS A RELATIVE CLAUSE TYPOLOGY 129
acceptable if said with heavy stress on the pronoun, with the implication of contrast
between this term and some other term, which is not made explicit:
(4.11) a. John, who had to be the one who said we did very little teaching,
was roundly condemned.
b. ?John, who he had to be the one who said we did very little
teaching, was roundly condemned.
c. John, who he had to be the one who said we did very little teaching,
was roundly condemned.
But this is the extra implication which would warrant the use of a pronoun. Put
crudely, if you want to say something with contrastive implication, you need to
stress it, and you cannot stress silence. Under these circumstances, a pronoun
becomes fully acceptable, just as Relevance-theoretic assumptions lead us to expect.
Conversely, when subordinate structures have to be constructed, use of a pro-
noun seems more acceptable than their use in monoclausal sequences as in (4.9b),
(4.10b)), and, though there is difficulty in evaluating what constitutes an appropri-
ate level of difficulty that would warrant use of a pronoun, this might be taken to
indicate a level of complexity in parsing that makes the use of a pronoun more ac-
ceptable because of some clarificatory function. This too is along the lines that Rel-
evance Theory leads us to expect, this time the motivation being the minimisation
of effort by which the hearer is expected to establish the intended interpretation.9
Various other effects, both pragmatic and processual, may be involved in the
parsing of resumptive pronouns in English (see Cann et al. (forthcoming) for dis-
cussion). But this still leaves us with the puzzle of why speakers do not uniformly
judge them to be well-formed, if sentences such as these can indeed be accept-
ably used. We address the concepts of well-formedness and production in detail in
chapter 9, but in the mean time what is notable about these examples is that the
context which motivates a speaker’s use of such data and the context which the
hearer is able to bring to bear in interpreting the sentence may not be the same,
even though for both the string is processable. For example, a speaker may have a
particular entity in mind, hence in her own discourse context, for which the hearer
can only construct the appropriate term having gone through the parsing process,
so not having the analogous term in his own discourse context. As we shall see in
chapter 9, successful communication involves the parser and the producer building
structures that are in exact correspondence. This suggests that informants’ judge-
ments of lack of well-formedness are due to a recognition of mismatch between the
contexts assumed by speaker and hearer, hence an assessment that the conditions
for successful communication are not fulfilled.
Whatever the pragmatic/performance-based explanation of why these data are
peripheral in English, confirmation that the restriction on resumptive pronoun use
in English is indeed system-external and not a constraint to be encoded in the
grammar formalism comes, surprisingly, from construal of pronouns in subject po-
sition in Arabic. Despite the obligatory use of pronouns in non-subject position, in
subject position pronouns are generally avoided, and dispreferred. However, Ara-
bic displays a restricted freedom exactly similar to that more generally available in
9 Though we do not address the problem in detail, the fact that definite NPs can be used
resumptively is also not unexpected, since definite NPs can be defined, as suggested in chapter 3,
as a complex form of anaphoric device (see chapter 8):
(i) John came in. The poor dear was upset.
(ii) That friend of yours who the idiot had to be the one to admit that our teaching loads were
low, was roundly condemned.
(iii)John, who Sue tells me the poor dear is suffering dreadfully from overwork, is still at the office.
4.1. TOWARDS A RELATIVE CLAUSE TYPOLOGY 131
T y(cn → e),
T y(cn), F o(WH, Book′ (WH))
F o(λP.(ǫ, P ))
T y(e → cn),
T y(e), F o(WH)
F o(λy.(y, Book′ (y)))
discussion of ellipsis and Kempson et al. (2001) chapter 5 for a discussion of questions using a
slightly different set of assumptions.
4.1. TOWARDS A RELATIVE CLAUSE TYPOLOGY 133
the lack of resumptive pronouns in the language. In these, the explanation of the crossover phe-
nomenon will be due solely to the locality constraint imposed on possible values for the pronoun.
We are grateful to Gisbert Fanselow and Ad Neeleman for making us aware of these data.
134 CHAPTER 4. TREE GROWTH AND LANGUAGE TYPOLOGIES
which excludes a range of data incompatible with a specificity restriction that her analysis purports
to explain. She merely notes in a footnote that the specificity restriction holds under conditions
of normal stress assignment.
14 As McCloskey reports (1979, 1990, 2001, 2002), to whom all data are due, the distinctions
reported here are disappearing in all but the most conservative variants of Modern Irish, as it has
in Scots Gaelic, which uses a gap strategy.
4.1. TOWARDS A RELATIVE CLAUSE TYPOLOGY 135
locally to the position to be bound or at some intermediate clausal boundary between binder and
this position.
136 CHAPTER 4. TREE GROWTH AND LANGUAGE TYPOLOGIES
IF hU ihL−1 iF o(x)
recall that hU i is the weak upwards relation that ranges over dominance and LINK
relations. So this is a requirement that at some arbitrary previous point in the tree
development (even immediately before), a LINK transition has been defined from
some head node decorated with F o(x).
The actions associated with the two complementisers then differ according to
whether they induce the construction of some unfixed node and decorate it directly
with the formula x, as with aL (4.21), or merely impose a requirement for a copy
of that formula at some subsequent point in the tree development, as with aN
(4.22).16
16 We do not give here the distinct tense specifications with which these lenition forms are
associated, but the lexical definitions are defined with the pointer starting at some node requiring
?T y(t) so that the addition of such specifications is unproblematic, at least in principle.
4.1. TOWARDS A RELATIVE CLAUSE TYPOLOGY 137
IF ?T y(t)
THEN IF ↓∗ ⊤
THEN Abort
ELSE IF hU ihL−1 iF o(x)
(4.21) aL
THEN make(h↓∗ i); go(h↓∗ i)
put(T y(e), F o(x))
ELSE Abort
ELSE Abort
IF ?T y(t)
THEN IF ↓∗ ⊤
THEN Abort
(4.22) aN ELSE IF hU ihL−1 iF o(x)
THEN put(?hDiF o(x))
ELSE Abort
ELSE Abort
There is a lot more that needs to be said about such specifications. First, the
characterisation of aL does not force its presence at each boundary, although it is
compatible with such markings.17 The incorporation of underspecification into the
articulation of tree relations, using such expressions as
allows just the right amount of flexibility to be able to express the generalisations
needed in a relatively elegant way. Unlike accounts of relative complementisers as
providing a predicate-abstraction operator, which enforces a disjunctive specifica-
tion on all cases of iterated complementisers (since one must not bind, and the
other must), an anaphoric account merely needs to define the actions of the com-
plementiser as compatible with the incremental update of the tree on the basis of
the earlier specification.
We can assume that the first occurrence of aL in a sequence of the same com-
plementiser checks the availability of some antecedent term across a LINK relation,
and then induces the building of a decorated, unfixed node which then gets passed
down the tree. The second occurrence of this complementiser, characterised by
the very same underspecified tree description, again checks out the availability of
a LINK relation with some provided antecedent head, and then initiates the in-
troduction of a new unfixed node with its own decoration. With the unfixed node
from the first instance of aL getting passed down, the second will initiate a sec-
ond unfixed node. But this will be token-identical to the first, and the two will
harmlessly collapse with each other (indeed they cannot be distinguished). With
duplicated aN, on the other hand, no information is percolated down the tree, since
what parsing this imposes on the emergent LINKed tree is a requirement for a copy
of the head formula. Nevertheless if this requirement is reiterated at a lower node
in that subsequent derivation, it will be harmless, as both specifications will be sat-
isfied by the decorations provided by some subsequent parsed pronominal. These
requirement-specifications are not operators requiring discrete binding domains.
So, on the one hand, the problems that arise for predicate-abstraction accounts
of these complementisers (McCloskey 2002) do not arise in the anaphoric style of
account.
17 McCloskey (2001) notes that the effect of lenition is often not audible, so we leave this under-
specified. Forcing its specification would need a definition of locality in terms of there being no
closer representation. See chapter 6 where such a characterisation is provided for anaphor-marking
in Japanese.
138 CHAPTER 4. TREE GROWTH AND LANGUAGE TYPOLOGIES
strategy being used in simple clauses where there is no further embedding in the relative; the
resumptive pronoun strategy being used where the long-distance dependency is across from one
propositional structure to another. Defining these strategies to be mutually exclusive is easy
enough, but since this appears not to be a hard and fast restriction, we leave these as equally
available options. Whether or not the gap-requiring strategy is due to an analysis of še along
German or English lines remains to be established, though, given its morphological simplicity, an
analysis along English lines seems preferable.
4.1. TOWARDS A RELATIVE CLAUSE TYPOLOGY 139
of še to a single unambiguous particle remains, postulated ambiguity being a common property
of analyses of this element (see Shlonsky 1992). We have not given any lexical characterisation of
še here, so this challenge remains completely open.
140 CHAPTER 4. TREE GROWTH AND LANGUAGE TYPOLOGIES
where the copy provided by the pronominal should be established. This is indeed
what we find in the variation displayed by Romance languages.
The Romance languages, like the Semitic languages, vary as to whether resump-
tive pronouns are required. On the one hand, there is Italian, which is like English
in not requiring the use of a pronoun in establishing relative clause construals, and
dispreferring their use.20 On the other hand, there is Romanian, which patterns
exactly like Arabic, except that there is an island restriction constraining the avail-
ability of such resumptively used pronouns. Resumptive pronouns, which always
take the phonologically-reduced clitic form but are otherwise perfectly regular pro-
nouns, are required in all non-subject positions; but, in addition, the copy of the
head required in the process of relative clause construal must be provided within a
single propositional tree.
Given that the pattern displayed by Arabic arises in virtue of a relatively loose
modal requirement imposed on the top node of the newly introduced LINKed struc-
ture, it is straightforward to define a modal requirement with a more stringent lo-
cality condition. We simply replace the Arabic form of restriction, ?hDi(F o(α)),
with the restriction ?h↓∗ i(F o(α)), ensuring that the copy will not be found in any
LINKed structure. Such a restriction is motivated for Romanian relative clauses as
introduced by the particle care, with its accompanying pe, a structure in which, as
in Arabic, resumptive pronouns are obligatory in all non-subject positions.21
(4.25) a. bǎiatul pe care l- am vǎzut [Romanian]
the boy pe which him have1.SIN G seen
‘the boy that I saw’
b. * bǎiatul pe care am vǎzut
the boy pe which have1.SIN G seen
‘the boy that I saw’
The only difference from Arabic is that such resumptive pronouns are required
to occur locally to the complementiser care in the sense that they project a copy
of the head formula in the same LINKed tree that the complementiser initiates. So
(4.26), unlike (4.25a) is ungrammatical:22
20 Cinque (1995) reports that resumptive use of pronouns is precluded in standard Italian. How-
ever he notes in a footnote that they are used in very informal speech, a comment reported
anecdotally by Spanish and Portuguese informants also.
21 Romanian and Italian both have more than one relative pronoun, one of which demands a
gap. These can be distinguished according to whether they project an annotation for an unfixed
node, possibly like German as an internally structured term of type e, or merely a (modal) require-
ment for the required copy, lexically distinguishing two variants, unlike Hebrew in which a single
morphological form has two divergent uses. Udo Klein has suggested (personal communication)
that it is pe which induces the constraint of requiring a pronoun, as witness:
bǎiatul cu care am venit
(i)
the boy with whom I have come
*bǎiatul cu care l’- am venit
(ii)
the boy with whom him I have come
We do not give an explicit lexical definition of pu care here as there is the issue to sort out as to
what information the pe and care individually project. The function of pe in particular is poorly
understood, but it appears to be associated with LINKed structures, inducing the requirement of
a copy of the head. What we would expect in both Romanian and Italian is that strong pronouns
can be acceptably used if stressed, as in English and Arabic, but this remains to be established.
22 The preverbal position of the clitic is not what is at issue at this juncture, but the inability
of the clitic to have its value established by an antecedent in any structure other than the one
currently under construction. Nevertheless, the position of the clitic is not uninteresting, and the
processes involved in such placement will be a matter of central concern in chapter 5, and again
within a diachronic perspective in chapter 9.
4.1. TOWARDS A RELATIVE CLAUSE TYPOLOGY 141
of related phenomena.
24 Asudeh (2004), as noted earlier, purports to incorporate resumptive pronoun use within a
general account of anaphora. Yet, his proposal constitutes a quite separate pronoun use. Pronouns
are defined as antecedent-seeking operators of logical type e → e; and to get the resumptive use,
Asudeh defines an abstract higher-type operator which applies to the pronoun in question binding
this antecedent-seeking property so that this is removed, leaving the resulting term to be bound
by the regular abstraction-operator associated with the relativising complementiser. The formal
operation, that is, removes the anaphoric property from the pronoun in these contexts, thereby
ensuring that it will be treated as a gap. But this is an ambiguity analysis in all but name, since a
distinct binding device removing the antecedent-seeking definitive of pronouns has to be specially
introduced.
142 CHAPTER 4. TREE GROWTH AND LANGUAGE TYPOLOGIES
are incrementally built up on a left-to-right basis, we have a fine structure for iden-
tifying distinct mechanisms while retaining a common semantic characterisation of
all restrictive relative forms of construal. The problems facing syntactic accounts
where expressions are directly related to such specifications of content (the syntax
providing input to a bottom-up compositionality of semantic content) simply do
not arise. Syntactic generalisations, to the contrary, are seen to be grounded in the
time linear projection of the requisite logical structure. The heart of syntax thus
lies in the meta-level task of building up a semantic representation.
formal, is a general label for some aspect of the information being conveyed which is taken as
background, to which other expressions may be anaphorically LINKed. However the term topical-
isation is used for long-distance dependency effects, which signally do not involve the discourse-
based notion of topic, rather inducing a focus effect communicating some type of contrast. To add
to the confusion, in LFG the term is used whether or not the foot of the dependency is realised
with a pronominal (Dalrymple 2001). In general, in Minimalism, the term refers to movement
dependencies, where there is no pronoun signalling the foot of the chain. We shall discuss the
concepts of topic and focus in section 5, but in the mean time, in so far as we use the term topic,
it will be used only informally, of those strings which are construed as associated with pairs of
structures requiring a shared term.
144 CHAPTER 4. TREE GROWTH AND LANGUAGE TYPOLOGIES
{T n(0), ?T y(e), ♦}
26 For a recent example, see Adger and Ramchand (2003).
146 CHAPTER 4. TREE GROWTH AND LANGUAGE TYPOLOGIES
we define a transition introducing an inverse LINK relation between this node and a
top node decorated by ?T y(e). The relevant rule is given in (4.36) and its associated
tree display in (4.37).
(4.36) Topic Structure Introduction (Rule) :
Notice that no requirement for a copy is imposed on the propositional tree. This is
because at the point at which this rule operates there is (and can be) no formula
decorating the type e node. There is, therefore, here a separation of the introduction
of the LINK relation from the imposition of a copy. This is done by a separate
rule Topic Structure Requirement in (4.38) which has the effect shown in
(4.39).27
(4.38) Topic Structure Requirement (Rule):
The use of the hDi operator encodes the weakest of all tree relations, a re-
quirement that there be a copy of the term just completed on the paired structure
somewhere in the development of this newly introduced top node, as required for
(4.40a). Coinciding with this separation between the two structures is a character-
istic intonation break between the left-peripheral NP in such structures, and the
invariable presence of a pronoun or other anaphoric expression. Constructing a
LINK transition, that is, is used to construct a tree as context for the subsequent
construction of the tree dominated by the root node, as part of the overall informa-
tion conveyed. Indeed, in English, the complex preposition as for arguably induces
the building of such a LINK transition from the lexicon, without need of any inde-
pendently defined computational action. However, the process is a very general one,
so we presume that such a set of moves is also licensed by general computational
action (4.40b). This is a stipulation in so far as the rules need definition; but the
concept of LINK itself as a tree relation associated with anaphorically driven shar-
ing of terms carries over unchanged from the account of relative clause construal,
with expectations of linguistic patterning that are immediately confirmed.
(4.40) a. As for Mary, John has met the man who says he’s going to marry her.
27 This separation of LINK construction and copy requirement will be the pattern explored in
T y(e),
F o(U), ♦ T y(e → (e → t)),
⇑ F o(Dislike′ )
F o(ι, x, Mary′ (x))
and those languages which do not. Where the relativiser does not itself induce
the required copy in the LINKed structure, as in languages such as Arabic and
Romanian, the account so far provided leads us to expect a strong parallelism
between topic structures and relative clauses. Both will require a suitably construed
pronoun in some position within a string, for neither have any expression encoding
the requisite anaphoric link. In languages or structures in which a relative pronoun
secures the presence of the copy of the formula at an unfixed node within the
introduced LINKed structure, there should be no such parallelism.
This asymmetry is indeed reflected in the data. In both Arabic and Romanian
(in relative structures using care), which we have already taken as illustrations
of obligatory resumptive use of pronouns in relative pronouns, the two structures
display parallel effects. In Arabic, for example, a suitably construed pronoun is
obligatory in all non-subject positions, as it is in Romanian:
(4.43) a. l-bint ?p illi
ali ?p a:bil-ha [Egyptian Arabic]
the girl that Ali met-her
‘the girl who Ali met’
b. nadja,
ali ?p a:bil-ha
Nadia, Ali met her
‘As for Nadia, Ali met her.’
(4.44) a. baiatul pe care l- am vazut [Romanian]
the boy pe which him have1.SIN G seen
‘the boy that I saw’
b. pe Ion l am ı̂ntı̂lnit anul trecut.
John him have1.SIN G met year last
‘John, him I met last year.’
Hebrew, as the mixed case, should, like Arabic, display parallelism between relative
clauses and topic structures, and it does (compare (4.24a,b) with (4.45a,b):
(4.45) a. šalom, ?p ani xošev še ?p alav ?p amarta še sara
Shalom, I think that about him you said that sara
katva šir [Hebrew]
wrote poem
‘Shalom, I think that about him you said Sara wrote a poem.’
b. šhalom, ?p alav ?p ani xošev še ?p amarta še sara
Shalom, about him I think that you said that sara
katva šir
wrote poem
‘Shalom, about him I think you said Sara wrote a poem.’
In English, however, there is no parallelism between hanging topic structures
and relative clause construal, for English has an anaphoric relative-pronoun as its
relative complementiser. So, it is only in topic structures that a suitably construed
pronoun is required, as in (4.35)-(4.41a). In relative clauses it is not, and is merely
an option associated with markedness effects, as we have already seen:
(4.46) ?That friend of yours who I found him wandering round the supermarket
seemed very upset.
So the exact parallelism of Hebrew and Arabic relative clauses and such topic
structures is, as we would expect, not matched in English.
4.2. TOWARDS A LEFT PERIPHERY TYPOLOGY 149
examples below the left dislocated nominal is in the nominative case, but paired
with an accusative pronominal in the matrix clause:30
(4.48) I Maria, xtes gnorisa ton andra pu tin
theN OM Maria yesterday I met the man who herACC
pantreftike [Greek]
married
‘As for Maria, yesterday I met the man who married her.’
(4.49) Sue Johninu avale ishtammanu. [Malayalam]
SueN OM JohnDat herAcc likes
‘Sue, John likes.’
However, given the modal form of the requirement, there is every reason to ex-
pect variants of this form of constraint through selection of other modal operators
from the range hDi, h↓∗ i, h↓0 i, again just as in relative clause construal, thereby
imposing more severe restrictions on where the copy is to be found in the second
tree. The imposition of the requirement using the h↓∗ i operator, where the imposed
structural restriction on the antecedent-anaphoric pairing mimics that of an unfixed
node, corresponds directly to so-called ‘Clitic Left Dislocation’ (CLLD) structures
(Cinque 1990), which we see in all the Romance languages. Despite their strong-
island sensitivity, these are obligatorily associated with an attendant pronoun in
the second structure, and separated from the remainder of the string by a sharp
intonational break. Accounting for such strings in terms of paired LINKed struc-
tures would suggest that intonation is playing a role in indicating the separation
of the structure to be projected from the first expression from the structure to be
assigned to the remainder.31
(4.50) el coche, Marı́a lo compró [Spanish]
the car Maria it bought
‘The car, Maria bought it.’
Such strings also display a further property, which is a specificity restriction. In
general, quantified noun phrases cannot occur at the left periphery with pronoun
doubling, with one exception: that of indefinite NPs. And these, if duplicated by a
pronoun, must be interpreted as taking wide scope with respect to the remainder of
the string, being interpreted quasi-referentially. (4.51) is reported as unacceptable,
and (4.33) has to be interpreted as indicating that Pedro is looking for a particular
individual:32
(4.51) ??un coche, Marı́a lo compró. [Spanish]
a car, Maria it bought
‘A car, Maria bought it.’
30 In Malayalam, as in many other languages, nominative case is associated with null morpho-
logical marking.
31 In the Romance, Greek and Bantu languages, among others, clitic pronouns may occur either
as prefixes (as a form of agreement), or as suffixes. The prefixal form requires a specific lexi-
cally induced action licensing the construction of an unfixed node which then merges with the
(object/indirect-object) node projected by the actions of the verb. We shall see in chapter 6, in
turning to Japanese the need for a localised variant of *Adjunction; and in chapter 9, we shall
argue that in the Romance languages the pre-verbal clitic positioning is a calcified lexical reflex
of free use of this localised *Adjunction in Latin. For the moment, the preverbal clitic position
and the decoration of the argument node which the actions of the clitic pronouns give rise to, will
have to be taken on trust.
32 In many analyses, such indefinites are analysed indeed as names (Fodor and Sag 1982, and
many thereafter: see references in chapter 3), but we have already seen reason to doubt any such
analysis of indefinites.
4.2. TOWARDS A LEFT PERIPHERY TYPOLOGY 151
building paired LINKed trees; and that of building an unfixed node within a single
structure. So there will be more than possible analysis for an individual string.
And, as we turn to consider *Adjunction, this is what we shall find.
English expletive it: see chapter 5 where we shall argue that the loss of bottom restriction is what
defines expletive pronouns.
4.2. TOWARDS A LEFT PERIPHERY TYPOLOGY 153
The analysis of this sentence up to the point of merge is shown in the tree in
(4.54). (Irrelevant details are omitted and the speaker is taken to be Maria.)
(4.54) Parsing (4.32a):
T n(0), ?T y(t)
T n(010), F o(U),
T y(cn), T y(cn → e), ?∃xF o(x), F o(Nostalg′ ),
F o(x, Petro′ (x)) F o(λP.(ι, P )) T y(e) T y(e → (e → t))
♦
The effect of the process is shown in (4.55), clearly showing the node originally
decorated by the clitic no longer being terminal in the resulting tree. In such
cases, the case requirement on the unfixed node is met upon unifying it with the
appropriate fixed node.
(4.55) Parsing (4.32a) after Merge:
T n(0), ?T y(t)
As we would expect, these cases need not display specificity effects. Spanish,
which unlike Greek does not have case morphology reflected on the noun phrase
itself, certainly freely allows quantifiers with clitic doubling in the case of dative
clitics:35
(4.56) a nadie le devolvió Marı́a su manuscrito [Spanish]
to no-one CLDAT returned Maria his manuscript.’
‘To no-one did Maria return his manuscript.’
(4.57) a familias de pocos medios les ofrecieron queso y leche
to families of small means to them offer3pl cheese and milk
To low-income families, they offered cheese and milk.’
35 Traditionallyseen as an animacy marker in accusative cases, for which it does not distinguish
between direct and indirect objects, this use of a is extending also to animates (Company 2002).
As we shall see in chapter 7, this is a pattern characteristic of Bantu object marking also, where
the data provide an application of an operation specific to the introduction of locally unfixed
nodes: see chapter 6,7.
154 CHAPTER 4. TREE GROWTH AND LANGUAGE TYPOLOGIES
To achieve this effect, and the asymmetry between dative noun phrases and the
rest, we need an analysis which allows two strategies of construal for the one type
of case, and one only for the remainder.37 This is what the analysis of the dative
as having lost its bottom restriction provides.
Furthermore, we now expect an ambiguity in the construal of subject pronouns
in Spanish. Given the lack of morphological marking of case on the subject, and
the pro-drop properties of the verb, we correctly anticipate ambiguity of construal
of Marı́a compro un coche (‘Maria bought a car’) as to whether the subject NP is
taken to decorate a LINKed structure (4.59) or an unfixed node (4.60).38
(4.59) Parsing Marı́a compró un coche with LINK:
hLiT n(0),
F o(ι, x, Maria′ (x)), T n(0), ?T y(t), ?hDiF o(ι, x, Maria′ (x))
T y(e)
T y(e),
F o(U), ♦
T y(e → t)
⇑
F o(ι, x, Maria′ (x))
Just as in the account of the dative clitic, we do not need to say that, in some
sentences, subject agreement requires analysis as some form of indexical, while in
other cases an analysis as an agreement device. The account of verbs in pro-drop
languages as decorating one or more argument nodes with a metavariable without
a bottom restriction gives just the flexibility we need.39
Another expected variation is a language which was originally pro-drop, with ar-
gument annotations provided by agreement morphology, has changed into one that
uses clitics, which have exactly the same properties as the agreement morphology
which they replace. This is arguably the case with conversational French, which
lost its pro-drop properties early on, and which invariably doubles the subject NP
with the presence of a clitic:40
37 The Porteño Spanish accusative clitic appears to be moving in the same direction in that this
intonational patterning of separation between it and its remainder has gone the way of the dative,
and got dropped. However it retains a specificity restriction on all doubling of the clitic. See
Suňer 1991 and elsewhere.
38 See Zubizaretta (1998), Belletti (1999) for arguments that in Spanish a lexical specified subject
Bresnan and Mchombo’s (1987) analysis of subject agreement in Chichewa, which they analyse
as lexically ambiguous between anaphoric and syntactic agreement (see chapter 7).
40 The same style of analysis arguably applies to the Northern Italian dialects, which share with
156 CHAPTER 4. TREE GROWTH AND LANGUAGE TYPOLOGIES
as the first to observe this. The example is taken from Alexopoulou and Kolliakou (2002).
42 The same phenomenon is noted in passing for Hebrew in Sharvit (1999) which pursues an
all languages wherever there is freedom of choice in constituent ordering and the
distribution can be seen to be founded in relevance-related constraints operating
on speakers in the production task of choosing words in a particular order. As
we shall see in chapter 9, a major difficulty in the production task is to choose
appropriate words to express some selected representation of content, and do so
incrementally. Using context-dependent expressions is a major means of reducing
this task to manageable proportions, as the very fact that such expressions are
context-dependent means that some matching term, and the words already used
to express it, will be available in the context, so there will not be any need for a
speaker to search in their general lexicon store to find them. Furthermore, if such
words are placed at the initial edge of what the speaker is trying to say, they will
be temporally as close to that established context as possible, so that even within
that context, the search for the appropriate term will have been made as small as
possible. We shall leave this here as a promissory note to which we shall return
when we consider the DS modelling of production, where this kind of constraint
will play an important role. But what this account suggests is that while some
specificity effects have led to encoded instructions to build LINK structures, others
have not, an apparent indication of a language system in transition.
4.3 Summary
Overall, this account of clitic-duplicated left-dislocation structures matches the
more familiar accounts which invoke a dichotomy between anaphoric and dislocation
bases for characterising the data. However, it brings out the parallelism between
left-periphery effects and processes of relative clause construal, distinguishing those
languages where there is full parallelism from those where the parallelism is only
partial. The intermediate cases also emerge as expected variants, either from the
interaction of independently motivated processes, or as expected variations along a
locality spectrum. In particular, the various anaphoricity effects that are achieved
do not need to be formalised with mechanisms distinct from regular processes.45
The only difference between pronouns which do allow update by node-merging pro-
cesses and those which do not, is in the loss of the terminal-node restriction. Strong
pronouns invariably retain such a restriction, behaving in all respect as regular lex-
ical items. Clitic pronouns may have lost this restriction, though by no means all
have done so. In all other respects, these pronouns function as regular anaphoric
expressions, the loss of the restriction merely ensuring that they have a wider distri-
bution. So despite the complexity of the data, the various options available emerge
from the interplay of general principles of tree growth, and the minimum of lexical
stipulation.
As with the characterisation of relatives, this left-periphery typology is only
partial in that there is no hint of how a LINK relation might be induced in the
latter stages of the construal process, a right-periphery effect, a matter which we
turn to in the next chapter.
159
160 CHAPTER 5. ON THE RIGHT PERIPHERY
be no return to any of its daughter nodes: the addition of any further decorations
on these nodes would lead to the modification of the decorations on the mother
node in the light of any such changes which is incompatible with the monotonicity
of the tree growth process, the centre pin of the concept of tree growth. We now
see this constraint applying in a number of different ways.
The first case we take up in the light of this is a recapitulation effect. All
languages permit a pronoun to double some NP on the right periphery of the clause,
the effect being a backgrounding of the postposed constituent, sometimes described
as a “background topic” (Herring 1994). It is even possible in some languages for
the right peripheral noun phrase to be an emphatic pronoun (5.6c).
(5.6) a. I think you should realise that it’s an impossible topic, right
dislocation.
b. He’s not very good, that new administrator.
c. I’m not happy, me.
In these structures, an anaphoric expression is necessarily identified as co-referential
with the formula annotating the right-peripheral structure, which is itself optional
and definite (5.7). Since it re-identifies the same element identified in the main
clause by the pronoun, we may refer to the right peripheral expression as a reca-
pitulation.
(5.7) a. She’s a fool, my mother.
b. She’s a fool.
c. *She’s a fool, my brother.
d. ??She’s a fool, a woman I saw yesterday.
This construction may be considered to be the analogue of Hanging Topic con-
structions on the left periphery, discussed in chapter 4. In left-dislocation struc-
tures, we postulated the construction of a LINK relation between a node of type
e, projected for the analysis of an initial noun phrase, and a node requiring type t,
providing the main proposition. The recapitulation case is naturally interpreted in
DS analogously, but inversely, with a transition from the rootnode of the proposi-
tional tree to some following structure requiring type e. The term decorating this
LINKed tree is required to be identical to some subterm of the just constructed
propositional structure, which must itself provide a complete formula of type t.
We thus have the construction rule in (5.8) which licenses the transition shown in
(5.9).2
(5.8) Recapitulation (Rule):
2 Note the use of the external underspecified modality ↑ . This indicates that the node decorated
∗
by F o(α) is dominated by node T n(0) but at a possibly fixed position. See the discussion in chapter
2 on page 53.
162 CHAPTER 5. ON THE RIGHT PERIPHERY
The fact that this rule projects from a completed tree of type t accounts di-
rectly for optionality (5.7b) while the requirement imposed on the LINK structure
accounts for co-referentiality (5.7a),(5.7c). The restriction of these right dislocated
expressions to referring expressions (5.7d) follows from the fact that the pronoun in
the preceding clause cannot be truly cataphoric. This is precluded by the constraint
that a mother node cannot be completed and eliminated unless both daughters
carry no outstanding requirements. Since a pronoun projects a metavariable with
its associated formula requirement, the propositional tree cannot be completed until
substitution has occurred. It is the assigned value derived from substitution that is
carried across as a requirement on the development of the LINKed structure. This
can then only be satisfied by some referential term which itself identifies the same
value in context (i.e. a proper name or a definite noun phrase). Thus, in parsing
(5.2a) He talks too fast, the new secretary, the metavariable projected by the pro-
noun is substituted in context with the term picking out the new secretary, some
individual named Bill, which is carried across the LINK relation as a requirement
to be satisfied by the formula value of the postverbal definite NP.
(5.10) Parsing He talks too fast.. in (5.2a) :
T n(0), T y(t),
hL−1 iT n(n), ?T y(e), ?F o(ι, x, Bill′ (x))
F o(Talk-Fast′ (ι, x, Bill′ (x)))
T y(e), F o(U)
T y(e → t),
⇑
′ F o(Talk-Fast′ )
F o(ι, x, Bill (x))
A bonus for this analysis, in contrast to the problematic apparent violation of
principle C which these data pose for binding theory accounts, is the naturalness
with which it reflects the fact that a right dislocation structure with this type of con-
strual, like the analogous left dislocation structure, is a root structure phenomenon,
unavailable within the confines of an individual tree.3
Despite the mirror-image of Hanging Topic Left Dislocation that this analy-
sis presents, there is nevertheless an asymmetry between right and left periphery
effects, which is a consequence of the dynamics of time-linear processing. A left
peripheral expression that is used to introduce a LINKed structure projected at the
outset (as a topic) cannot rely for its interpretation on any information projected
by the following clausal sequence. However, the projection of a pronoun within the
subsequent clausal string can, indeed must, take the preceding LINKed structure
as its context.
(5.11) Giovannii , li/∗j ’apprezzavamo [Italian]
Giovanni him, we appreciate
‘Giovanni, we appreciate him.’
3 The naturalness of this explanation is in sharp contrast to the setting aside of these data by
5.3 Late*Adjunction
Having seen how a right peripheral expression can be analysed using the same LINK
analysis as for Hanging Topic constructions at the left periphery, we now turn to
an exploration of the applicability of using some form of *Adjunction at a late
stage in a parse to analyse some right peripheral expression.
It might seem that this could not be motivated in Dynamic Syntax, since the
later stages of processing a string do not provide the same sort of underspecifica-
tion so manifestly provided at the left periphery. Once some structure has been
introduced, and all its terminal nodes decorated, it would appear that there could
be no node introduced later into the structure whose position was underspecified.
Such a view is forced in frameworks which do not allow for left/right asymmetry,
yet such a conclusion is too hasty for Dynamic Syntax. Indeed, the conclusion
would entail that left and right periphery processes are radically asymmetric, when
it is precisely their symmetry that is more commonly observed (see, for example,
Steedman (2000) for whom the symmetry is definitional). In any case, we have
left a margin of flexibility open through the analysis of anaphoric expressions as
projecting metavariables as their formula value, leaving a requirement for a fixed
formula value yet to be provided. Nothing forces substitution to apply at any point.
Like all other rules substitution is optional, and with the provision of a type spec-
ification, completion can apply, and hence the parse process can proceed, leaving
that requirement unfilled. Should substitution not have taken place as the pronoun
is parsed, however, the pointer will have to return to that node at a later juncture
if a successful derivation is to be achieved.
Consider, now, what the parse process involves in general terms. Structure
is progressively introduced in a top-down, goal-directed way, introducing either
unfixed nodes to be updated or a skeletal propositional structure, to which lexical
actions provide further update for terminal and nonterminal nodes. The tree is then
evaluated bottom-up by functional application applying only if all requirements on
the daughter nodes are satisfied. Pointer movement does no more than reflect
incremental development, moving first down the tree driven by computational and
lexical actions, and then successively back up it driven by functional application
over types to complete the tree, as shown in detail in chapter 2. In the closing
stages of interpreting a clausal string, once the requisite structure is introduced
and all words are parsed, successive steps of evaluation then determine the values
of nonterminal nodes on the tree whose value is not yet established.
5 Itis notable that in English, such repetitions may occur in such examples as (i)-(ii) with the
combination of NP plus auxiliary at the right periphery acting as a backgrounding device:
(i) He’s always a pain, he is.
(ii) He always says nice things, Donovan does.
We take up the anaphoric properties of expressions occurring with verbs such as be in chapter 8.
5.3. LATE*ADJUNCTION 165
There is nevertheless a potential role for *Adjunction even at this late stage.
Consider the situation in which there is some node in the tree whose character-
isation during the construction process has been partial, leaving some unfulfilled
requirement. In such a situation, a late application of *Adjunction might be
invoked to satisfy the outstanding requirement through Merge. This would be
the case, for example, if a metavariable failed to get a fixed value through Substi-
tution from the context (all rules being optional). In such circumstances, while
the node decorated by the metavariable is type-complete, it is still formula incom-
plete and there remains an outstanding requirement. Satisfying a type requirement
allows the pointer to move on in the construction process, but the open formula
requirement causes a problem in the tree evaluation process, as the mother node
cannot be evaluated by Elimination. Steps must therefore be taken to allow this
requirement to be met in order to allow the tree to be completed. To accomplish
this the pointer must be able to move back down to the relevant daughter node
in order to further specify it. In such a situation, we have a scenario in which
late application of *Adjunction can lead to a successful update of the tree, by
creating an unfixed node whose formula annotations will provide the satisfaction of
the outstanding requirement through the Merge process.
In order to make this type of analysis more concrete we will explore a num-
ber of constructions in which the hypothetical situation sketched above occurs: a
metavariable is projected by some pronominal element (overt or covert) but its
content is only identified at some later stage of the parse.
5.3.1 Extraposition
We begin by looking at a construction in English which may not be obviously
associated with the right periphery, but which involves a pronoun that allows a
contentive expression to appear post-verbally: sentence extraposition. In this con-
struction, the subject argument of a predicate is propositional and so we find clausal
strings appearing pre-verbally as syntactic subject (5.17a). We also find the clausal
expression in post-verbal position just in case there is an expletive pronoun before
the verb (5.17b).
(5.17) a. That we are wrong is possible (but not likely).
b. It’s possible that we are wrong (but it’s not likely).
The pronoun it in (5.17b) is not ‘referential’, taking its value from the context
in which the string is uttered, but expletive in that it takes its content from the
postverbal expression.
In nonpro-drop languages such as the Germanic languages, lexicalised expletives
are essential in this position. Without them, the parsing sequence breaks down
and cannot proceed because the verb does not decorate its subject position with
a metavariable, unlike Japanese (Chapter 6) and some of the Romance languages
(see chapter 2 and below). Thus, the string in (5.18) is simply unparsable as the
pointer cannot move on from the subject node without lexical input.
(5.18) *Is likely that Bill is confused.
The function of an expletive use of a pronoun, accordingly, is to keep the parsing
process alive: it first provides a metavariable as an interim value to some type re-
quirement associated with one node and then moves the pointer on to another node.
Because the pointer is moved on as part of the actions determined by parsing the
expletive pronoun, no substitution can take place and an open formula requirement
necessarily remains on the node decorated by the metavariable.
166 CHAPTER 5. ON THE RIGHT PERIPHERY
The (provisional) definition of these actions is given in (5.19) which assumes that
T y(t) can decorate a subject node, and that certain predicates project a formula
of type t → t.6 Note the extra condition, h↑i⊥, which checks whether the current
node is the topnode in a tree and aborts the parse if it is, thus preventing it from
being the sole expression in a sentence.
IF ?T y(t)
THEN IF h↑i⊥
THEN Abort
(5.19) itexpl
ELSE put(F o(U), T y(t), ?∃xF o(x));
go(h↑0 ih↓1 i)
ELSE Abort
T y(t), F o(U),
?T y(t), ♦ ?T y(t → t) ?T y(t → t), ♦
?∃x.F o(x)
Once the predicate node is decorated, the pointer moves to the mother node in
order to complete the propositional type requirement. However, because the subject
node still carries an unsatisfied formula requirement no evaluation can proceed; and
the pointer must move back down to the subject daughter in order to complete the
requirements on this node. Since the node is type-complete, however, it looks
as if the parse is doomed to failure. But it is at this point that a variant of
*Adjunction, which we dub Late *Adjunction, can apply to provide an open
type requirement allowing the parse of new material to take place.
Unlike the versions of *Adjunction discussed in earlier chapters, Late *Ad-
junction projects an unfixed node with a requirement for the same type as the
node from which it is projected. Since no further direct development of the fixed
node is possible, this version of *Adjunction thus defines directly the structural
context to which Merge applies, i.e. the unfixed node and the fixed node from
which it is projected. We define this rule in (5.21) and show the effects from some
arbitrary node in (5.22).7
(5.21) Late*Adjunction (Rule):
6 Noting the possibility of (i)-(ii), we do not provide any tense restriction on the propositional
This is the only form the rule could take, on the assumption that downward un-
folding rules have provided a complete skeletal propositional structure; as it is
only a development of one of these already introduced nodes which remains to be
completed.
Applying Late*Adjunction to the subject node in a parse of It is possible
yields the configuration in (5.23).8 This permits the parse of the post-verbal string
and the completion of the unfixed propositional tree immediately feeds an appli-
cation of Merge, as shown in (5.24), which yields a complete subject node and a
final formula value F o(Possible′ (Wrong′ (CKM′ ))) as desired.
(5.23) Parsing It is possible :
T n(0), ?T y(t)
T n(00), T y(t),
T y(t → t), F o(Possible′ )
F o(U), ?∃x.F o(x)
T n(00), T y(t),
T y(t → t), F o(Possible′ )
F o(U), ?∃x.F o(x), ♦
F o(CKM′ ) F o(Wrong′ )
This analysis has the bonus of providing a much tighter restriction on late ap-
plications of adjunction characteristic of all rightward dependencies, encapsulating
8 Ignoring the contribution of the copula, including tense. See Chapter 8.
168 CHAPTER 5. ON THE RIGHT PERIPHERY
the Right Roof Constraint.9 The application of Late*Adjunction can only arise
in a context in which one of two daughter nodes lacks a F ormula value, so no
successful annotation of the mother node is as yet possible. The string in (5.25a)
will never emerge as well-formed, as the projection of an annotation for the embed-
ded subject cannot be successfully achieved without some later addition of a node
providing the necessary update for the metavariable projected by the expletive pro-
noun it. (5.25b), on the other hand, is fully well-formed, precisely because such a
value is provided prior to the construction of the main predicate.
(5.25) a. *That it is certain is unfortunate that we are wrong.
b. That it is certain that we are wrong is unfortunate.
Note further that the effect of parsing the expletive pronoun in moving the pointer
straight to a sister node means that examples like (5.26a) are precluded, given the
lexical definition of the expletive. The predicate must be established before the
pointer can return to the subject node.10 At the same time, left dislocation of a
clause is awkward, if not ungrammatical, in the presence of the expletive (5.26b).
Here, the fixing of the adjoined tree should occur as soon as the pointer is on the
subject node (see chapter 2) which prevents the parsing of the expletive.
(5.26) a. *It that I’m wrong is possible.
b. ??That we are wrong, it is possible.
An important thing to notice at this point is that for Merge to successfully
take place in (5.24) the node decorated by the expletive pronoun must not project
a terminal-node restriction, [↓]⊥. This is because any node initially decorated by
a pronoun whose final formula value is only successfully provided by a step of
Late*Adjunction and subsequent Merge will, in the end result, be a nonter-
minal node. Indeed, we take the lack of this restriction to be a definitive prop-
erty of expletive expressions of all kinds, as discussed at the end of the last chap-
ter. Despite the fact that Late*Adjunction and subsequent Merge only lead
to a successful outcome if the formula which decorates the node which is input
to Late*Adjunction contains a metavariable, no special provision needs to be
made to reflect this. This is a consequence of the fact that conflicting formula values
on a single node lead to ill-formedness, unless one of those values is a metavari-
able. Hence, the restriction follows without stipulation from the definition of well-
formedness for a node.
We return to a further discussion of expletive expressions in English in Chapter
8, but for now what we have established is that the analysis of expletive pronouns
may require a late application of *Adjunction to determine their content.
b. ha telefonato Maria
has phoned Maria
‘S/he phoned Maria.’
Given that an overt subject expression is optional in Italian and similar languages,
parsing the verb must construct a subject node which is decorated by a metavari-
able, exactly as though a pronoun were present, as we saw for Greek in chapter 2.
This is unlike English where subject nodes are constructed by general transition
rules and are initially decorated only with a type requirement. It is, furthermore,
unlike Japanese in that it is only the subject node which the verb annotates with a
metavariable: non-subject argument nodes are specified as open type requirement,
as in English. A verb such as sembra ‘seems’ will, therefore, project a full propo-
sitional template, decorating its propositional subject node with a metavariable as
illustrated in (5.28).
(5.28) Parsing sembra :
T n(0), ?T y(t), ♦
Since the verb is intransitive, the pointer moves to the topnode via Completion
from the predicate node, as shown. Elimination cannot, however, apply until
the formula requirement on the subject node is satisfied, which can be done by
an application of Late*Adjunction, exactly as in English (5.29). The only dif-
ference between English and Italian in this respect is thus the actions associated
with parsing verbs. There is nothing at all mysterious about the lack of expletive
pronouns in such constructions in pro-drop languages: it follows as a consequence
of the analysis.
(5.29) Parsing sembra che
T n(0), ?T y(t)
Such subject pro-drop languages also permit the inversion of a subject to the end
of the clause (in simple clauses) as exemplified by the examples in (5.4) repeated
below.
(5.4) a. ha telefonato Beatrice [Italian]
has telephoned Beatrice
‘Beatrice has telephoned.’
b. e’ arrivato uno studente
is arrived one student.
‘A student has arrived.’
170 CHAPTER 5. ON THE RIGHT PERIPHERY
Again this property is not strange from the DS perspective and again follows from
the actions associated with parsing verbs. We saw, in chapter 2, two cases where
the parsing of verbs result in the projection in full propositional structure. In Irish
and Greek, we assumed, at the end of the lexical actions associated with verbs,
the pointer is at the open subject node, in Irish decorated merely with a type
requirement and in Greek with a metavariable and completed type. the effect in
Irish is to force VSO ordering: the subject must be parsed immediately after the
verb (at least in finite contexts). Greek, also, shows VSO properties (although not
fully) and Late*Adjunction gives us the means for explaining this and at the
same time accounting for inversion in Italian.
Consider first the Greek example in (5.30), in which the subject occurs imme-
diately after the verb:
(5.30) kseri o Stavros ti Maria he knows [Greek]
theN OM Stavros theACC Maria
‘Stavros knows Maria.’
We associate a parse of the verb kseri ‘knows’ with the actions in (5.31).
(5.31) kseri [Greek]
IF ?T y(t)
THEN put(T ns(P RES));
make(h↓1 i); go(h↓1 i); put(?T y(e → t));
make(h↓1 i); go(h↓1 i);
put(F o(Kser′ ), T y(e → (e → t)), [↓]⊥)
go(h↑1 i); make(h↓0 i); go(h↓0 i); put(?T y(e));
go(h↑0 ih↑1 i); make(h↓0 i); go(h↓0 i);
put(T y(e), F o(U), ?∃x.F o(x))
ELSE Abort
As noted, the pointer is left at the subject node and the metavariable can either
be substituted by some accessible term from the context (as in kseri ti Maria
‘S/he knows Maria’), merged with some previously projected unfixed node (as in
o Stavros kseri ti Maria, see (2.75) on page 67) or give rise to an application of
Late*Adjunction, as illustrated in (5.32) which provides a means of parsing the
first three words in (5.30) (irrelevant detail omitted).
(5.32) Parsing kseri o Stavros :
T n(0), ?T y(t)
T n(00), T y(e),
?T y(e → t)
F o(U), ?∃x.F o(x), ♦
The same processes apply in parsing the intransitive Italian examples in (5.4),11
but in transitive contexts things are not the same. Italian does not show VSO
ordering and Subject Inversion with transitive verbs has the subject following the
object, as in (5.33).
?T y(t)
T y(e), F o(U),
T y(e → t), F o(Parl′ (Inglese′ ))
?∃x.F o(x), ♦
11 Although it should be noted that (5.4a) is, in fact, ambiguous between an intransitive ‘Beatrice
(i) Kim thinks that the Vice Principal is insane. It’s possible.
172 CHAPTER 5. ON THE RIGHT PERIPHERY
?T y(t)
T y(e), F o(U),
T y(e → t), F o(Parl′ (Inglese′ ))
?∃x.F o(x), ♦
T y(e → t),
T y(e), F o(p22 )
F o(Oi′ (ι, x, Paca′ (x)))
(5.39) Parsing (5.37a) with the dative clitic and quantifying expression:
?T y(t)
T y(e),
?T y(e → (e → t))
F o(ǫ, x, Queso′ (x))
T y(e → (e → (e → t))),
T y(e), F o(U), ♦
F o(Ofrecier′ )
We have now seen that the concepts of formula underspecification, LINKed struc-
tures and positional underspecification do operate in the analysis of right peripheral
14 We do not provide an account of the requisite actions here. See chapter 9 section 9 for
effects, as well as those on the left. They allow analyses to be given of apparently
quite disparate constructions such as pronoun doubling, sentence extraposition,
subject postposing and clitic doubling. The fact that the same concepts apply to
both the left and right peripheries provide striking support for the form of gram-
mar proposed in this book, while the time-linearity of the parsing process accounts
nicely for observed asymmetries between the two peripheries. A particular bonus is
the explanation of the Right Roof Constraint as a consequence of compositionality
of the tree growth process: in other frameworks, this restriction remains a mystery
- able to be expressed by some form of stipulation, but not subject to a principled
explanation.
hence outside the remit of the grammar formalism, strictly speaking.16 If this is the
right stance, such data would be intransigent also for a framework like DS, since
processes of update are exclusively defined over partial semantic structures, and not
over (structure defined over) strings. The challenge for Dynamic Syntax is whether
the combination of building independent but anaphorically LINKed structures and
introducing unfixed nodes within an individual tree are sufficient to reflect these
notoriously problematic properties.
We have so far seen the separate applicability of projecting LINKed structures
and unfixed nodes in characterising right peripheral constituents. Nothing pre-
cludes the interaction of these processes, and we expect there to be strings whose
well-formedness turns on an analysis requiring projection of both LINK relations
and Late*Adjunction. More specifically, we might expect the occurrence of
strings whose interpretation is induced as a pair of LINKed structures of which
the projection of the second involves application of Late*Adjunction, yielding a
composite right-dislocation effect. This, we suggest, is what underpins the analysis
of Right Node Raising.
Completing the parse of John came in and Mary fainted yields two LINKed
propositional trees, the first with formula value Come-in′ (ι, x, John′ (x)) and the
second with Faint′ (ι, y, Mary′ (y)). In chapter 3, we have presumed on the relation
between LINK structures being assigned the weakest possible interpretation, that
of conjunction; and the process of LINK Evaluation defined there for non-restrictive
relative clause evaluation (3.13) on page 79 applies equally to the evaluation of a
pair of LINKed structures constructed in the construal of sentence co-ordination.
The completion of the construction process from parsing Jane came in and Mary
fainted will thus yield a propositional formula for the whole structure:17
such as or require different means of interpreting the two LINK structures. In Cann et al. (2005),
we generalise the process of interpreting LINKed structures, introducing an EV AL predicate
which takes as value some (possibly logical) connective which determines the appropriate semantic
relation between the two linked trees. We do not pursue this elaboration here, as it is not directly
relevant to the task of characterising RNR.
178 CHAPTER 5. ON THE RIGHT PERIPHERY
This induces a structure like that in (5.57) after a parse of the first two words in
(5.53a) with the tree that results after the parse of the conjunct shown in (5.58).
The completed tree (not shown) is evaluated to give the formula value:
F o(¬Get′ (ǫ, x, Award′ (x))(ι, y, John′ (y)) ∧ Clever′ (ι, y, John′ (y)))
T n(0), ?T y(t)
T n(00), T y(e),
?T y(e → t)
F o(ι, y, John′ (y))
T n(0), ?T y(t)
T n(00), T y(e),
?T y(e → t)
F o(ι, y, John′ (y))
5.4.2 An Example
We are still not quite at the point where we can provide a full characterisation of
Right Node Raising. The DS account of this construction requires one assumption
additional to the mechanisms already in place: we allow there to be lexically defined
updates without any overt morphological form, for which only intonation provides
the clue as to what structure is to be built. Specifics of the acoustic stream is
5.4. RIGHT NODE RAISING 179
one aspect of the input which we have so far ignored altogether, and indeed the
analysis of prosodic information within the DS system remains an open question.
However, in such a system, with an explicit parsing-oriented perspective, sensitivity
to intonation is entirely expected: intonation forms part of the phonetic signal and
is thus available to induce procedures of interpretation during the course of a parse.
We suppose, then, that intonation can have the effect of signalling the ad hoc
construction of a metavariable as an interim F ormula value, indicating that the
containing structure is left incomplete at the current stage of the interpretation
process. This is not sufficient, however, as strings like those in (5.59a,b) are ill-
formed even with RNR-style intonation: the intonation apparently signals that the
string is incomplete and requires completion by some other expression.
(5.59) a. *John eagerly anticipated.
b. *John eagerly anticipated, but Mary dreaded.
c. John eagerly anticipated, but Mary dreaded, meeting the new
ambassador.
The insertion of the metavariable by the proposed rule must thus trigger one of
two distinct types of action: either to induce the building of a LINK transition, or
the building of an unfixed node which completes the structure. Such actions both
cut short the construction of the current node and prevent pragmatic substitution
of the metavariable, thus ensuring the incompleteness effect of the prosodic signal
and further development of the tree.
With these assumptions concerning co-ordination and the possibility of a metavari-
able being inserted without a morphological trigger,18 we are now in a position to
show how Right Node Raising may be characterised within Dynamic Syntax. To
illustrate, we take the example in (5.5a) and show informally how the incremental
processing of this string leads to a well-formed propositional structure with the
expected interpretation.
(5.5a) Mary wrote, and I reviewed, a paper on resumptive pronouns.
The parse of the first two words proceeds as we would expect. The the sub-
ject and predicate nodes are introduced by Introduction and Prediction in the
normal way, with the initial noun phrase being parsed to provide the content of
the former, and the verb providing a formula annotation for a two-place predicate
and an internal argument node awaiting development. Given the signal of incom-
pleteness by intonation (or punctuation), a metavariable is provided as annotation
to this node, satisfying its type requirement while at the same time projecting ad-
ditional structure, in this case a LINK structure with a requirement to find within
it a copy of the metavariable decorating the host node, as illustrated in (5.60).19
T n(0), ?T y(t)
T y(e),
?T y(e → t)
F o(ι, x, Mary′ (x))
The analysis then proceeds with the parse of the subject and verb of the second
conjunct as usual until the pointer rests at the internal argument node of the
predicate. At this point, again signalled by intonation, a further metavariable may
be introduced. This variable is taken to be identical to that used to construct the
structure for the first conjunct, anticipating a means of securing the fulfilment of
the requirement imposed by the LINK transition. It is then Late*Adjunction
which, this time, provides a node with an open requirement of type e and allows
the parse of the right dislocated expression, as shown in (5.61).
(5.61) Parsing Mary wrote, and I reviewed,
T n(0), ?T y(t)
T y(e),
?T y(e → t)
F o(ι, x, Mary′ (x))
T y(e),
?T y(e → t)
F o(ι, x, John′ (x))
Once the unfixed node is decorated and completed, it can Merge with the
argument node in the structure assigned to the second conjunct, and the inter-
pretation of the whole is duly compiled. In the construal of (5.5a), this yields an
interpretation for the second conjunct as (5.62).20
(5.62) F o(Review′ (ǫ, z, Paper′ (z))(ι, y, John′ (y))), T y(t).
20 The term constructed from a paper on resumptive pronouns is represented as ǫ, z, Paper′ (x)
With the propositional formula established for the LINKed structure, the pointer
moves back along the LINK relation to the terminal node with its provisional
metavariable characterisation, which it updates with the value to the variable that
was established in interpreting the second conjunct; and the interpretation for the
first conjunct can at last be completed and finally combined with that of the second.
The resulting formula value is given in (5.5a’)
(5.5a’) F o(Write′ (ǫ, z, Paper′ (z))(ι, x, Mary′ (x)) ∧
Review′ (ǫ, z, Paper′ (z))(ι, y, John′ (y)))
chapter 3, (3.46) on p.98. However, this set of actions is associated with very restricted contexts
and does not satisfy an open type requirement.
182 CHAPTER 5. ON THE RIGHT PERIPHERY
(5.66) is precluded on two counts. On the one hand, in the processing of the
second conjunct, obligatory movement of the pointer away from the node decorated
by the meta-variable precludes application of Merge directly. Equally, Merge
cannot apply to the output of Late*Adjunction unifying the node decorated
by that woman with the object node of the predicate F o(Dislike′ ), because the
node introduced by Late*Adjunction is not itself fixed. On the other hand, if no
application of variable insertion takes place in parsing the second conjunct, Merge
will apply directly to the object node, leaving the requirement for an update to the
variable from the first structure imposed by the LINK transition unsatisfied, and
no provision for it provided by the construction process as initially indicated. With
no possible derivation of a logical form, (5.66) is not well-formed.
This characterisation of LMVI, as it stands, has two unwanted effects. In the
first place, since it constructs a LINK structure, it will permit examples like that
in (5.67), where no conjunction occurs.
(5.67) ?*Mary criticised, John consoled, the weeping student.
This could be rectified using the EV AL predicate, which was proposed in Cann et
al. (2005) as a basis for generalising over connectives that there must be at least
one occurrence of the connective in some sequence of LINKed structures in order
to trigger the appropriate evaluation (see footnote 17). By adding a requirement
to the node to which the LINK structure is identified that specifies that some
EV AL value be found (?∃x.EV AL(x)), the apparent need for a conjunction will
be ensured.22
The second, possibly unwanted, effect of this version of LMVI is that conjunc-
tions will also need to be effectively ‘expletive’ in certain contexts. In other words,
instead of projecting LINK structures directly as indicated in (5.56), a conjunction
may be parsed in a context in which a LINK structure is already defined, simply
adding an EV AL value to the structure. As we wish to keep the characterisation
of co-ordination as simple as possible at this point for the discussion in Chapter 7,
we will not go into this further. What is important at this point is that the rule of
LMVI directly encodes what the intonational cue intuitively signals, which is that
the string is not yet complete, hence the projection of structure that needs to be
determined.
Although not problematic from the point of view of the grammaticality of the
output, postulating such “free-ride” processes in the lexicon without phonologi-
cal input does pose problems for the psychological parsing process as it substan-
tially increases the set of choices at any point during the parse. We have simply
stipulated in our analysis what it means to break off from a routine parse pro-
cess, lexically enforcing applications of the procedures of LINK-construction and
Late*Adjunction which, as computational actions, are optional. This is where
the characteristic prosody of Right Node Raising becomes significant: by its use,
the speaker signals to the hearer the incompleteness of the proposition under con-
struction, through the modification of the normal prosodic contour. In other words,
the speaker makes manifest to the hearer the possibility that a non-canonical op-
eration must be performed to yield a well-formed final representation. It is in this
sense that we consider the intonation to license the introduction of a metavariable
without lexical input. We assume, that is, that prosody does not give rise to spe-
cific parsing actions, and that intonational contours are not directly associated with
lexical actions (following Ladd 1996: 98 ff. amongst others). However, intonation
may nevertheless be used to bring out parsing information which might otherwise
not be recoverable.
22 Values of this predicate are, at least, the logical conjunctions ∧ and ∨, there may be others.
5.4. RIGHT NODE RAISING 183
will arise in any case where a lexical item has conditions which trigger the update
actions that it provides that are not reflected in that output. In all such cases,
it is only in the second conjunct that the triggering condition will be met. This
property is met by a negative polarity item such as any:
(5.69) John has read, but he hasn’t understood, any of my books.
By hypothesis, we assume that any is a negative polarity item which projects
into the structure a regular indefinite (epsilon) term as Fo value, but it does so only
in the presence of a negative (or ‘affective’) feature decorating its locally dominating
propositional type node.23
IF ?Ty(e)
THEN IF ↑∗ N EG
THEN make(h↓1 i); go(h↓1 i);
(5.70) any put(F o(λP.(ǫ, P )), T y(cn → e)); go(h↑1 i);
make(h↓0 i); go(h↓0 i); put(?T y(cn))
ELSE Abort
ELSE Abort
The negative polarity condition must be met by the structure projected from the
second conjunct in (5.69), hence the update is licensed, and a successful action of
Merge takes place. The object node associated with the first conjunct of (5.69),
on the other hand, being decorated with the same metavariable as in the second
conjunct, then gets updated with whatever value is assigned to that second occur-
rence, this being the indefinite term projected by any. The presence of this term
is thus duly licensed in the structure projected from the first conjunct despite the
lack of negation.
Sensitivity to the presence of negation is not required for the indefinite term it-
self: it is merely a condition on the tree in which the lexical item any is to provide
an update. The analysis leads immediately to a second prediction of asymmetry.
We expect correctly that negation only in the first conjunct will lead to ungram-
maticality, as is indeed the case.
(5.71) *John hasn’t understood, but he has read, any of my books.
This observation is generalisable to other clashes in the selectional properties
between the predicates in the two conjuncts. Such clashes will be more tolerated
if resolved solely with respect to the second conjunct but not if resolved solely
with the first. Hence, (5.72a) is preferable to (5.72b), as (5.73a) is preferable to
(5.73b).24
(5.72) a. ?John prefers but Sue would rather not eat meat.
b. *John prefers but Sue would rather not to eat meat.
(5.73) a. ?John intended to, but Sue prevented him from, submitting a paper
to Linguistic Inquiry.
23 This feature is shown as N EG. Again this is a promissory note for an account of downward
Not all speakers judge (5.72) and (5.73) to be fully well-formed. What is at issue, in the analysis,
is whether the action induced by the verbs includes some reflex of their particular non-finite form
in the required logical-form output, or whether they merely contribute different triggers for the
required update action. Leaving this issue unresolved, the difference in acceptability between
(5.72a) and (5.72b) on the one hand and between (5.73a) and (5.73b) on the other is nevertheless
uniformly agreed upon by all speakers.
5.4. RIGHT NODE RAISING 185
b. *John intended to, but Sue prevented him from, submit a paper to
Linguistic Inquiry.
These data showing asymmetry translate straightforwardly into many other
languages, and we cite here comparable negative polarity data from Hindi and
Malayalam:
(5.74) a. John-ne parhaa lekin voh samjhaa nahĩ meri koi
JohnERG read but he understand-past not my any
kitaabẽ [Hindi]
books
‘John read but hasn’t understood any of my books.’
b. *John-ne samjhaa nahĩ lekin voh parhaa meri koi
JohnERG understood not but he read-past my any
kitaabẽ
books
‘John has not understood but has read any of my books.’
under our analysis, required to match the case requirement of the verb in the second
conjunct, while licensing a mismatch with the case requirement of the verb in the
first. Contrarily, a subject noun phrase marked to match the verb in the first, but
to mismatch with that of the second, should not be well-formed. This is exactly
what we find, as illustrated in (5.75).
(5.75) a. us aurat-ko ignore kiyaa aur abhi usko samadhan
that womanDAT ignore did and now sheDAT reassure
karnaa parhega John-ko
do will have to JohnDAT
‘John ignored that woman and will now have to reassure her.’
b. *us aurat-ko ignore kiyaa aur abhi usko samadhan
that womanDAT ignore did and now sheDAT reassure
karnaa parhega John-ne
do will have to JohnERG
‘John ignored that woman and will now have to reassure her.’
c. us aurat-ko samaadhan kiyaa aur abhi usko ignore
that womanDAT reassure did and now sheDAT ignore
kareega John
will do JohnN OM
‘John reassured that woman and will now ignore her.’
d. *us aurat-ko samaadhan kiyaa aur abhi usko ignore
that womanDAT reassure did and now sheDAT ignore
kareega John-ne
will do JohnERG
‘John reassured that woman and will now ignore her.’
Exactly comparable data occur in German.
We thus have welcome independent evidence of an analysis initially proposed
solely on the basis of the English data. In striking contrast to the way in which
the data match the DS style of analysis, it is not at all clear how in situ, CCG, or
movement analyses of RNR could be extended to these data. In particular, the in
situ style of analysis is committed to there being copies of the dislocated expression
in the two sites, with whatever restrictions the expression imposes being matched
symmetrically in both contexts (see e.g. Hartmann 1998). It is therefore far from
obvious how the wide-ranging asymmetries observed here between first and second
conjuncts can be expressed.
movement back to its subject node once the predicate node is constructed and
decorated, which will already be decorated with a metavariable as Formula value.
A LINK transition is an available option and will, if applied to the metavariable,
project a new LINKed structure, which must be developed to have a copy of that
variable, which is provided by the second expletive. In the second structure, too,
there is no need to invoke any special device, other than the fact that the choice
of metavariable in the second conjunct must determine that the imposed LINK
requirement is met. Late*Adjunction can apply in a way that is standard for
all expletive constructions, applying at the late stage of the construction process.
Thereafter, all the processes are entirely regular. The result of processing (5.76) is
shown schematically in (5.78) with an interpretation equivalent to (5.4.4.3).
(5.77) That our analysis will fail is likely, and that our analysis will fail is not
unreasonable.
(5.78) Parsing (5.76):
?T n(0), T y(t)
T y(t),
F o(Fail′ (ǫ, x, Analysis′ (CKM′ )(x)))
conjunct that copy is established through a second application of LMVI. The lack of
restriction on where in the LINKed structure the copy of the constructed variable
should occur is determined by the weakness of the modality of the requirement
?hDiF o(U) which ranges over both dominance and LINK relations.
An unfixed node is then projected from the node decorated with the copy of the
metavariable in the first conjunct to provide the means of identifying the content
of that copy. As we have seen, because the unfixed node is projected locally to the
node in the LINK structure that is decorated with the copy, it is necessarily the
case that the right dislocated expression is interpreted as local to the propositional
structure containing the metavariable. There can thus be no dependency into a non-
local propositional structure. Compare (5.80a) with (5.80b) (with the intonational
breaks indicated by the commas).
(5.80) a. *Kim likes, and everyone who knows is bound to like Sue, the new
secretary.
b. Kim likes, and Sandy thinks that everyone who knows her is bound
to like, the new secretary.
Thus, such apparent constraint-free long-distance dependencies are in fact licensed
by Merge applying locally within some T y(t) subtree. The DS analogue of the
Right Roof Constraint is maintained and dependencies into islands are licensed by
the pronominal nature of the analysis.
We have now established that RNR can be given an analysis in terms of LINK
and *Adjunction that accounts for observed asymmetries between left and right
dislocation structures in terms of the time linearity of the parsing process.27
5.5 Summary
In this chapter, we have provided an account of a number of right peripheral con-
structions, largely focusing on English data. Pronoun Doubling (Recapitulation)
constructions have been analysed as involving a LINK transition from a completed
T y(t) to a type e tree, an analysis of which the backgrounding topic construal
is a consequence. It Extraposition and Subject Inversion have been analysed in
terms of an unfixed node introduced late in the parsing process whose resulting for-
mula replaces the metavariable projected by the subject anaphoric expression. The
obligatory clitic doubling of Spanish has been analysed in terms of the potential
availability of both LINK transitions and the late construction of an unfixed node.
Finally, Right Node Raising has been modelled using a combination of building
LINKed structures and unfixed nodes.
27 There are a number of problematic properties of the construction that we have not touched
on, however. One in particular is the observation that the right peripheral sequence may comprise
more than one constituent:
(i) ?John criticised and Mary refused to employ a friend of mine for drinking too much.
This has been denied in Postal (1998), with a critique in Levine (2000) providing a wealth of
apparently conflicting data. In fact such sequences are relatively straightforward to handle in this
framework, but they involve defining subvariants of *Adjunction, licensed relative to a range of
locality constraints, then justifying an analysis of Heavy NP Shift as an application of the locally-
restricted form of *Adjunction, which applies within the predicate sub-structure. We take up
these variants of *Adjunction in chapter 6. Here we leave these data on one side, merely noting
that they are analogous to the multiple scrambling phenomena of verb-final languages in which
more than one constituent appears to be able to occur at the left periphery, external to its own
clause (see Kempson and Kiaer (2004) and Kiaer (in preparation) for an account of such multiple
long-distance scrambling).
5.5. SUMMARY 189
Almost all these data are intransigent for most frameworks. The characteri-
sation of expletives in Minimalism involves special stipulation, with the insertion
of expletive pronouns often taken to be an operation of Last Resort to save an
otherwise irredeemable derivation. While there are notable attempts to provide
substance to such accounts (see, for example, Lasnik 1995, Holmberg 2000), they
remain both theoretically problematic (with respect to concepts of economy) and
singularly unexplanatory.
The analysis of right dislocation effects of any sort is problematic for most
theories of syntax. Within minimalist approaches which adopt Kayne (1994)’s
Linear Correspondence Axiom or some analogue thereof, all such constructions must
be accounted for in terms of leftward movement of any material which is normally
linearised to the right of the dislocated expression. Such movement requires the
proliferation of otherwise unmotivated functional projections (such as Topic and
Focus projections below the VP projection), and is required solely to preserve the
asymmetry hypothesis of no rightward movement, resulting in an account which is
little more than a description of the data with a great deal of movement machinery
(see Buering and Hartmann 1997).
Furthermore such analyses singularly fail to provide any account of the Right
Roof Constraint. This mysterious condition is quite generally problematic for all,
LFG, HPSG and CCG included, as it does not follow from any configurational
principle associating the source site with the discontinuous sequence that provides
its interpretation: it simply has to be defined as a property of these particular
structural dependencies. The DS account, however, provides a straightforward and
intuitively satisfying account of this difference between left and right dislocation,
as it is based solely on differences in the opening stages of a tree-construction
process, where trees are partial and incomplete, and later stages of the process when
the outline propositional structure is complete and its nonterminal decorations are
being compiled.
Right Node Raising has long been recognised as problematic for movement
explanations. Its combination of the display of total insensitivity to islands but
nevertheless locality of the right-peripheral expression(s) with respect to the right
conjunct is particularly hard to express. Though categorial grammar formalisms
often laud the type-lifting mechanisms defined for the ease with which they allow
the ad-hoc constituents needed to analyse Right Node Raising as a form of con-
stituent co-ordination (Steedman 1996, 2000), in practice the left-right asymmetry
effects we have observed in Right Node Raising are extremely problematic for such
accounts. This is because, given that left and right implication are basic connec-
tives, for any type-lifting operation that can be defined to license left-extraction,
the formalism will equally allow the analogous operation licensing right-extraction.
There is then also the anaphoric doubling processes at the right periphery, with
the particular twist of obligatory strong-pronoun doubling. These remain problem-
atic in minimalist accounts, often set aside (see Cecchetto 1999); and, in categorial
grammar formalisms too, this type of interaction between anaphora and structural
processes remains unaddressed. So far as we know, no current analyses of these data
can offer principled explanations of the various asymmetries set out, in Right Node
Raising, or more generally between left and right periphery effects. In Dynamic
Syntax terms however, the systematic variation between left- and right-periphery
effects is expressible through application of the same tree-growth processes with
the differences seen as a consequence of early vs late stages in the interpretation
process.
190 CHAPTER 5. ON THE RIGHT PERIPHERY
are there as promissory notes pending proper semantic characterisation (possibly in terms of some
form of evaluation).
29 Although HPSG is not explicitly multi-level, its use of discrete attributes for syntax, semantics
and pragmatics, with different principles applying to the way the information in these structures
is encoded, makes it implicitly multi-level.
5.6. DYNAMIC SYNTAX: SOME COMPARISONS 191
The main thrust of our story so far has been that the dynamics of parsing is the basis
for everything there is to say about structural properties of languages. The partial
typologies established in chapters 4 and 5 provide some preliminary confirmation
of this, with their sketches of cross-linguistic similarities in relative clause and left
and right periphery effects across a broad range of languages as illustration of
this. However, there are singular gaps in these typologies; and despite the cross-
linguistic similarities we have so far drawn attention to, there are nevertheless
major differences between different language families which seem to threaten the
extremely strong claims made. In verb-final languages, in particular, from the
starting point provided by English, everything seems to be the other way round.
First the verb generally appears last in the clause. So while in English the canonical
order is subject–verb–direct-object–indirect-object (6.1), in Japanese it
is subject–indirect-object–direct-object–verb (6.2).
(6.1) John peeled an apple for Mary.
(6.2) Hiroto-ga Akiko-ni ringo-o muita
HirotoN OM AkikoDAT appleACC peeled
‘Hiroto peeled an apple for Mary.’
In addition, unlike in English, the relative ordering within any sequence of NPs in
an individual clause is very free. Any order will do:1
(6.3) a. Hiroto-ga Akiko-ni ringo-o muita
HirotoN OM AkikoDAT appleACC peeled
‘Hiroto peeled an apple for Akiko.’
b. Ringo-o Hiroto-ga Akiko-ni muita
appleACC HirotoN OM AkikoDAT peeled
‘Hiroto peeled an apple for Akiko.’
c. Akiko-ni Ringo-o Hiroto-ga muita
AkikoDAT appleACC HirotoN OM peeled
‘Hiroto peeled an apple for Akiko.’
1 None of these sentences is the most natural way of expressing their shared content, as there
is overwhelming preference to use the topic marker -wa to indicate surface subject. But with
appropriate intonation, these are fully well-formed (and if embedded in a nominal, e.g. preceding
koto (‘fact’) i.e. as presenting a translation of a factive nominal the fact that these are perfectly
normal. So we follow the convention of recent discussions of Japanese scrambling in which, fol-
lowing Saito (1985), all sequences are given with -ga, except where discussion of properties of -wa
are directly relevant (see section 6.4).
193
194 CHAPTER 6. THE CHALLENGE OF JAPANESE
When it comes to embedding one clause within another, the freedom of noun phrases
within the clause repeats itself (though with some constraints as we shall see later),
but the verb now must be final in its clause, for both clauses, and the effect is of
one verb following the other. So the ordering can be any one of:2
subject1 subject2 object2 verb2 verb1 (6.4a)
subject1 object2 subject2 verb2 verb1 (6.4b)
object2 subject1 subject2 verb2 verb1 (6.4c)
(6.4) a. Hiroto-ga Masa-ga ringo-o tabeta to itta
HirotoN OM MasaN OM appleACC ate comp said
‘Hiroto said Masa ate an apple.’
b. Hiroto-ga ringo-o Masa-ga tabeta to itta
HirotoN OM appleACC MasaN OM ate comp said
‘Hiroto said Masa ate an apple.’
c. ?Ringo-o Hiroto-ga Masa-ga tabeta to itta
appleACC HirotoN OM MasaN OM ate comp said
‘An apple, Hiroto said Masa ate.’
But there is no possible order:
*subject1 subject2 object2 verb1 verb2 (6.5)
(6.5) *Hiroto-ga Masa-ga ringo-o itta tabeta to
HirotoN OM MasaN OM appleACC said ate comp
‘Hiroto said Masa ate an apple.’
In short, a common pattern in Japanese is for there to be a sequence of noun phrases
followed by a sequence of verbs.
However, it is not only verbs which occur last in the structure they induce. So
too, canonically do nouns. That is, in relative clauses, the sense of everything being
the other way round repeats itself, for in Japanese the head noun always follows
the relative. What is said in English as (6.6) is said in Japanese as (6.7) with its
literal English construal in (6.8).
(6.6) An apple that Mary peeled was tasty.
(6.7) Hiroto-ga muita ringo-ga oisikatta
HirotoN OM peeled appleN OM tastyP AST
‘An apple which Hiroto peeled was tasty.’
(6.8) ‘Mary peeled apple tasty.’
There is no relative pronoun. It is simply that the clause to be interpreted as a
relative normally has one argument missing, which is provided by the head of the
relative, which follows.3 There is also an added complication that Japanese has no
determiner (notice the single word ringo translated as ‘an apple’).
The problem that this ordering poses is that in parsing a sequence of noun
phrases followed by a verb, a choice has to be made at some point as to which of
the noun phrases in the sequence are part of the relative clause, and which are not.
In (6.9), all the noun phrases are part of the relative clause:
2 Case-marked left-peripheral noun phrases that precede the expression taken to be the matrix
subject but yet are interpreted as in some subordinate structure are of uncertain status. See
section 6.3.
3 One of these arguments might be expressed with the topic marker -wa to avoid repetition,
the verbal form tabeta (‘ate’), this takes the form of projecting a full propositional
template with argument nodes decorated with metavariables. This indicates a more
radical form of pro-drop than is found in the Greek, Spanish and Italian examples
in previous chapters where only the subject node is type-complete. Otherwise, the
lexical entry for the verb tabe is very similar for that given for the Italian verb parla
in (5.34) on page 171.5
(6.12) tabe
IF ?T y(t)
THEN make(h↓0 i); go(h↓0 i); put(F o(U), T y(e), ?∃x.F o(x)); go(h↑0 i);
make(h↓1 i); go(h↓1 i); put(?T y(e → t));
make(h↓1 i); go(h↓1 i);
put(F o(T abe), T y(e → (e → t)), [↓]⊥); go(h↑1 i);
make(h↓0 i); go(h↓0 i); put(F o(V), T y(e), ?∃x.F o(x))
ELSE Abort
Noun phrases also are frequently composed of just one word, e.g. ringo-o in (6.2)
and many other examples. So, in like manner, nouns in Japanese are taken to
project a full term of type e, introducing a new variable, and an operator to bind
it. A noun like ringo will thus be associated with (at least) the set of actions
shown in (6.13) which effectively combine the actions associated with parsing the
two words an apple in English (see also section 6.5.1).6
(6.13) ringo
IF ?T y(e)
THEN make(h↓1 i); go(h↓1 i); put(T y(cn → e), F o(λP.(ǫ, P )), [↓]⊥);
go(h↑1 i); make(h↓0 i); go(h↓0 i); put(?T y(cn));
make(h↓1 i); go(h↓1 i); put(T y(e → cn), F o(Ringo′ ), [↓]⊥);
go(h↑1 i); make(h↓0 i); go(h↓0 i); freshput(x, F o(x))
ELSE Abort
So from a minimal sequence of words, (6.14) has just three, a full propositional
template may be induced complete with quantificational terms:
(6.14) Hiroto-ga ringo-o tabeta
HirotoN OM appleACC ate
‘Hiroto ate an apple.’
This means that there is a considerable gap between richness of morphological in-
formation and richness of structural information; but this separation of surface form
and decoration on the semantic tree is unproblematic in DS, given that the concept
of lexical content is not just the provision of some Formula value but a macro of
actions for building up emergent semantic structure. Indeed it is at least in part
the richness of information provided by individual words which underpins the very
great flexibility in the NP sequences allowed. The actions that lead to the requisite
logical structure do not turn on the subject and object-denoting expressions being
in any particular order, for full predicate-argument structure is provided by just
one word – the verb.
5 We return to the tense specification directly.
6 Thisanalysis is inconsistent with Chierchia (1998), who analyses quantification in languages
such as Japanese as semantically distinct from the basis for quantification in languages such as
English. In this framework, both types of analyses are expressible (see chapter 8), but in the
Japanese case we presume that the difference is merely one of how much of the containing logical
structure is projected by the word itself.
197
6.0.2 Scrambling
The flexibility in ordering of noun phrases is however very far from unproblem-
atic, and this phenomenon, called Scrambling (Ross 1967), has been the focus of
a great deal of attention over an extended period, at least within transformational
frameworks.7 Despite all this attention however, scrambling remains resistant to a
unified form of analysis in all current frameworks.
One of the cross-theoretic debates has been whether these languages are as
configurational in their structural properties as more familiar languages such as
English. The apparently flat sequence of noun phrases preceding the verb with
their multiple possible orders may be taken as evidence for the lack of any verb
phrase constituent. Can they, that is, be taken to project a VP node in any regular
sense, if the noun phrases can occur in an order which makes the assignment of
such a constituent impossible? LFG analyses propose that scrambled sentences are
analysed with a non-binary flat structure at the level of c-structure, while other
semantically related levels encode their thematic and semantic predicate-argument
properties (Bresnan 2001, Dalrymple 2001, Nordlinger 1998). HPSG analyses too
project such sequences as flat at the level of the string, separating out configu-
rational principles from linearity, with superimposed linearisation principles. For
example, Reape (1994) defines discrete domains with relations between domains
defined hierarchically, order internal to any one domain (roughly that of a clause)
being unspecified. Kathol (2000), in addition, defines a topological concept of fields,
based on the concepts of vorfeld and mittelfeld in traditional German grammar, in-
ternal to which idiosyncratic ordering statements are definable. Combinatory Cat-
egorial Grammar might seem best suited to languages with such free permutation,
since it allows very great flexibility in the creation of constituents (Hoffman 1995),
but this turns out to provide too great a flexibility, and the restriction on possible
permutations is very hard to state (Baldridge 2002).8 The optionality of all argu-
ments provides an additional problem, as given that expressions are typed as to the
number and type of expressions they combine with to form a clausal sequence, all
verbs will have to be assigned multiple types to match the full range of arguments
expressible.
Even within a movement-based framework, in which most work on Japanese
scrambling has been done, the analysis of these data remains controversial.9 The
problem is that the form of movement apparently needed to express the word order
variation displayed in scrambling data has never fitted well with assumptions of
movement that have been articulated in the theory (Mahajan 1990), and many
analyses invoke more than one process of scrambling. The problem has taken on
new urgency with the Minimalist assumption that movement operations should be
obligatory, for scrambling is transparently optional, with change of word order in
some cases, but only some cases, being associated with change of interpretation.10
7 Amongst movement analyses, see Hale (1983), Saito (1985, 1992), Speas (1990), Diesing
(1992), Bošković and Takahashi (1998), Saito and Fukui (1998), Karimi(ed.) (2003) and many
others.
8 See Kempson and Kiaer (2004) for an account of just why scrambling is so problematic for
the task of enforcing the requisite movement (Kiss 2003, Miyagawa 2003, Maki and Ochi 1998).
But invoking such features solely in order to trigger scrambling equally threatens the content of
the Minimalist claim; and analyses are also proposed in terms of base generation of the scram-
bled strings, with LF lowering (Bošković and Takahashi 1998). Yet others invoke concepts of
information-restructuring (Bailyn 2003) (though the status of such discourse-based explanations
within a grammar formalism which eschews all reference to phenomena of use is unclear).
198 CHAPTER 6. THE CHALLENGE OF JAPANESE
In the original wh data cited by Saito (1992), in which the wh expression can
be construed in the subordinate structure suitably marked by the -ka Q-marker,
there is no change of meaning and the correspondence between the two is said to
involve “radical reconstruction”, but in mixed quantification sentences, the order in
which expressions occur does matter. (6.15) is said to be unambiguous, but (6.16)
is ambiguous:11
(6.15) dareka-ga hotondo-no uta-o utatta
someoneN OM mostGEN songACC sang
‘ Someone sang most of the songs.’ (unambiguous: some < most)
(6.16) hotondo-no uta-o dareka-ga utatta
mostGEN songACC someoneN OM sang
‘Most of the songs, someone sang.’
(ambiguous: some < most/most < some)
The idiosyncratic property of indefinites in allowing wide scope effects that we saw
in chapter 3 repeats itself in Japanese, except that when the indefinite precedes an-
other quantifying expression, it appears that linear order is providing an additional
constraint.
A problem that scrambling poses for all movement accounts (as noted by Saito
1992) has turned out to be particularly troublesome for a Minimalist account. The
problem is the feeding relation between rules. In principle a complement noun
phrase can be dislocated out of some embedded clause, as in (6.17), and, equally,
a complement clause itself can be moved to the left of the matrix subject as in
(6.18):
the appropriate LF configuration (Saito 1985), rather than any radical reconstruction.
12 In this example, we provide indication of traces, as in movement analyses, in order to bring
out the problem for analyses of this type. There is no commitment to any such movement in the
DS account.
13 Saito (2003) defines a particular restriction on Merge to state the facts appropriately relative
What we have to hand is the concept of having unfixed nodes early on in the
parse process; but it is far from obvious that this provides sufficient expressive power
to express this heterogeneous set of facts. As so far defined, with the process of
*Adjunction introducing one unfixed node within an individual tree, the answer
is pretty clearly “No”. The phenomenon of very free ordering without any necessary
substantive difference in interpretation is not adequately reflected in the distinc-
tion between the building of linked structures vs the characterisation of one unfixed
node within an individual tree. A linked-structure analysis reflects an anaphorically
established relation between otherwise distinct structures; and *Adjunction reflects
the isolatability of one constituent across an indefinite structure within an individ-
ual tree from its point of construal. The phenomenon of locally free constituent
ordering is something else again. Yet, as it turns out, there is a natural extension
of the vocabulary we have already set up which does reflect the data very natu-
rally. This is to extend the concept of varying locality constraints on the resolution
of Formula underspecification (familiar from the Binding Principles distinguishing
reflexives, pronouns and names (Chomsky 1981)) to the resolution of underspeci-
fication of structure, defining an analogue of the binding principles for structural
underspecification. Dynamic Syntax is uniquely set up to explore this particular
form of parallelism between anaphora and long-distance dependency effects, as it is
only in this framework that such effects are expressed as a form of underspecifica-
tion. These in combination will make possible an account of scrambling. We start
from the regular process of *Adjunction, and modify it in two directions, one more
local, one less local.
But this is what the framework disallows, for exactly the reasons stated above for
*Adjunction. There is, fortunately, a simple solution to this problem, provided
by overt case marking. Case can be seen as fulfilling two distinct roles. On the one
hand, it can be used to define filters on output (see chapter 2), imposing require-
ments on a node which constrain subsequent development. This means imposing
a requirement, say for object marking, of decorating a node whose position in the
resulting tree is with a mother of predicate type, i.e. imposing on a term node the
requirement ?h↑0 iT y(e → t). The object-marking suffix -o could accordingly be
defined as in (6.23).15
IF T y(e)
THEN IF h↑∗ i(T n(a)∧?T y(t))
(6.23) -o THEN put(?h↑0 iT y(e → t))
ELSE Abort
ELSE Abort
Nothing forces this requirement to be met at any particular stage in the construc-
tion process: hence its use in earlier chapters in connection with long-distance
dependency. An unfixed node, in particular, can be decorated with such a case
specification to constrain where the node is to be merged. This case specifica-
tion does not have any encoded pointer-return process. This is instead ensured
by Completion, suitably generalised to cover the introduction of locally unfixed
nodes.16
On the other hand, case can also serve a more constructive role, in some sense
inducing a structure-building action as part of the incremental structure-building
process, as it appears to in these local scrambling structures. This is indeed the
form of analysis proposed in several different frameworks.17 It might seem in this
framework that such a move would have to be defined as a quite separate form
of action invoking distinct lexical ambiguity, but, as it turns out, these two roles
of case-marking, as a constraint on output and as structure building, can both be
served by the one output filter mechanism, given the particular locality restriction
defined by local*adjunction. Like the other adjunction processes, this is defined
using the Kleene* operator which covers a disjunction of possibilities, but because
the underspecified relation is defined as h↑0 ↑1∗ iT n(a), it makes available only three
possible subsequent forms of development given the restricted adicity of verbs in
natural-language.18 There is h↑0 iT n(a) (subject) where h↑1∗ i is satisfied by the
empty set of such relations,19 h↑0 ih↑1 iT n(a) (direct object), and h↑0 ih↑1 ih↑1 iT n(a)
(indirect object). This is where the role of the case specification comes in: the
effect of processing the case marker will automatically rule out all but one of these
possible developments. For example, the parsing of the accusative case-suffix -o,
having decorated a node introduced by Local*Adjunction immediately rules
out all further tree developments except those in which the relation of this node is
15 As noted in chapter 2 footnote 10, we do not in this book provide a full theory of case-marking.
In particular, we do not account for the so-called semantic uses of case and have nothing to say
about case alternations or quirky case. These are significant issues, but whatever solution is
found for them within DS, it should remain the case that case-marking of subjects and objects
will provide a means of linking arguments with their relevant argument slots, represented in DS
in terms of hierarchical tree relations. So, even if the precise characterisation of case given for -o
(and other case-markers) turns out not to be ultimately tenable, nevertheless what we say about
its use in the analysis Japanese scrambling should not be significantly affected.
16 I.e. with µ allowed to range over ↓1 ↓ and µ−1 over ↑ ↑1 in (3.12) on page 78.
∗ 0 0 ∗
17 See Nordlinger (1998) for a constructive account of case in LFG, and Kracht (2002).
18 If adjuncts constitute optional arguments, this list would be increased, but would still consti-
fixed as object, despite the case specification itself being only a filter on the output
tree.20 Hence the constructive effect of case in short-scrambling environments. The
process of parsing ringo-o in (6.20), repeated here, is shown in (6.24) (ignoring the
internal structure of the term projected by ringo):
20 We are grateful to Wilfried Meyer-Viol for pointing this out to us, and for the discussions
Nothing in the characterisation of any case suffix forces the node it decorates
to be introduced in a particular order; and, as the display in (6.26) shows, there is
no record on the tree of which node was constructed first. In every case, according
to this derivation, the node is introduced by constructing an unfixed node, and
the local relation to the dominating type t-requiring node is fixed following the
action dictated by the case specification. The subsequent projection by the verb
of a full propositional template of predicate node and array of argument nodes is
unproblematic. If any of the argument nodes of this template have already been
introduced, they will simply vacuously be duplicated by the associated argument
node of the predicate. Indeed it could not be otherwise, as two nodes with the
same tree-node address decorate one and the same node. This is the same phe-
nomenon as the problem of multiple unfixed nodes. Once the verb is introduced,
each argument node has an assigned fixed tree-node address (see chapter 2).21
No application of merge is required to be externally imposed. The two sets of
actions, the fixing of position through case markers and projecting a full proposi-
tional template by the verb, simply create one and the same tree-relation. Thus
in the parsing of (6.20), the tree-update actions provided by the verb induce a
propositional template of structure, essentially introducing the functor node which
it decorates with the formula F o(Tabe′ ), the other argument nodes having already
been constructed: notice in (6.27) the F o(U), F o(Hiroto′ ) decorating the subject
node and F o(V), F o(ǫ, x, Ringo′ (x)) decorating the object node.
(6.27) Parsing Ringo-o Hiroto-ga tabe:
T n(0), ?T y(t)
With these actions completing the outline of structure, decoration of all nonterminal
nodes duly takes place.
21 The case suffixes in Japanese are optional, and, if omitted, necessitate other ways of determin-
ing construal. One strategy is to use computational actions to induce subject predicate structure
(see Kempson et al. 2001), as SVO ordering, in any case generally taken as the canonical ordering,
will match such top down actions. Any variation from this without case-marking will rely solely
on pragmatic considerations or contingent knowledge of individuals and events described.
204 CHAPTER 6. THE CHALLENGE OF JAPANESE
As we shall see in addressing long-distance scrambling, things are not quite this
simple, as long-distance scrambling is apparently precluded for -ga-marked subject
expressions. Some additional specification of what -ga contributes is needed to
reflect this asymmetry. Nonetheless, the essential dynamics of short scrambling is
to introduce an unfixed node, decorate it, and then fix its tree relation to the locally
dominating type t-requiring node, all as a consequence of Local*Adjunction.22
details of all floating quantifiers, either preceding or following a constructed term, remain to be
explored, as does plural quantification in general.
24 See Perrett (2000), Kurosawa (2003) for discussion, and Shirai (2004) for a more detailed DS
Recall from chapter 3 that hU i is defined as the reflexive transitive closure of the
union of the inverse-LINK (hL−1 i) and mother (↑) relations, so hU iX holds at
some node n if somewhere along a sequence of relations, including either h↑i or
hL−1 i, X holds. Like the relation between h↑∗ iT n(a) and h↑0 ↑1∗ iT n(a), there is an
entailment relation between h↑∗ iT n(a) and hU iT n(a), though this time h↑∗ iT n(a)
is the more restricted relation. But, again, because a discrete operator is defined,
the two relations can be distinguished in partial trees containing both relations.
To introduce such a node, we define a process of Generalised Adjunction
whereby a node can be introduced that matches in type the node from which the
relation is induced, but which can hold across any arbitrary relation to the input
node. The rule is given in (6.31) and its associated display in (6.32). In tree-
diagrams representing this tree relation, we use a dotted line, distinguishing it from
the dashed line indicating application of *Adjunction.
(6.31) Generalised Adjunction (Rule):
{. . . {T n(a), . . . , ?T y(t), ♦} . . . }
{. . . {T n(a), . . . , ?T y(t)}, {hU iT n(a), ?∃x.T n(x), . . . , ?T y(t), ♦} . . . }
T n(a), ?T y(t)
This process is one which allows a node to be, as it were, pulled apart from the
place in the tree from which it was introduced for further modification. There are
at least two structure types in English which appear to motivate such a process,
the so-called preposed clausal adjuncts, and genitive constructions:
(6.33) a. Having once had a fright by drinking too much, I am sure Tom will
be careful not to do so at his party this time.
b. The King of England’s mother’s brother’s wife has disappeared.
Though these are very different constructions, they both pose the problem that the
local projection of structure apparently needs to be nested at arbitrary levels of
embedding with respect to the root.
In Japanese, as we shall see, structure is quite generally developed without any
indication of its contribution to the overall structure:27
(6.34) a. Hiroto-ga muita to itta
HirotoN OM peeled comp said
‘Hiroto (or someone else) said that he peeled it.’
27 Given the full pro-drop nature of the language, in these and subsequent examples, other
as a LINKed structure, the safe option at the outset of every parse would be to presume on a
step of Generalised Adjunction. However, again as we shall see, the existence of the option
of building a LINKed structure is paired with the very general use of topic-marking to indicate
surface subject, so, as things turn out, Japanese processing does not require any such default
assumption of complexity.
29 As we shall see in due course, the topic marker -wa is proto-typically used to indicate the
matrix subject, so without such marking a common assumption is that the structure under con-
struction is subordinate, an assumed strategy for analysis which would be confirmed by the -ga
marking.
30 In this and subsequent trees, the internal structure of the terms will not be shown where this
is not relevant to the discussion. Types are also omitted where not relevant.
208 CHAPTER 6. THE CHALLENGE OF JAPANESE
T n(0), ♦
With no use of -wa to mark a surface subject, the first morphological sign of
anything other than that this string is a matrix clause comes with the parsing of
the suffix -to. In the construction of the requisite structure, the complementiser
has a clear role to play as the next step needed to induce structure for (6.35): it
marks the completion of a propositional sub-structure. The nesting it creates can
be achieved in one of two ways, as reflected in the disjunctive set of lexical actions
(6.39): either by making use of structure already induced and returning the pointer
there, or by adding a further intermediate node locally dominating the node just
completed:31
IF hU iT n(a), F o(α), T y(t)
THEN (put(h↑0 ih↑1 iT n(a));
go(h↑0 ih↑1 i)
(6.39) -to ∨
(make(h↑∗ i); go(h↑∗ i);
put(?T y(t), h↓1 ih↓0 iF o(α));
ELSE Abort
The actions of -to, that is, dictate how or whether the pointer returns from the
completed propositional structure in (6.38) to the root.
Following the regular pattern of suffixes, a completed type specification, here
a type t formula, is the condition necessary for the update given by -to to be
carried out. What -to imposes, as a result, is obligatory local subordination. Thus,
pursuing the parse of (6.37), we need only the first alternative of returning to the
root, in so doing determining the local relation between it and the node initially
introduced by Generalised Adjunction as shown in (6.40):32
pressing dialectal variation in the use of -to. In some dialects, e.g. the Osaka dialect, use of -to,
like case, is optional. On this analysis, the dialect difference lies in whether the updating of the
tree is by lexical or computational action. In standard Japanese, there is no generalised convention
of return of the pointer from any node: such a computational action is only applicable to nodes of
T y(e). In the Osaka dialect, this is generalised to apply to nodes with formulae of type t. In fact,
as we shall see shortly the action of using a node already introduced and enriching the relation in
question is very general in Japanese, but here we keep the disjunction, as with -to, like -ga, the
effects are strictly encoded.
32 We leave the propositional formula here in its unevaluated form for simplicity, ignoring the
T n(0), ♦
h↑1 iT n(0)
The result of carrying out the actions induced by -to, on either alternative,
is that the pointer will be at a node from which the subsequent verb itta will
be able to project its propositional template. From this node, the propositional
template of structure provided by the verb itta is constructed (with metavariables
F o(U), F o(V) decorating its argument nodes), in the way we have already seen. In
this derivation, the already partially constructed propositional structure induced
from the parsing of the initial clausal sequence will collapse with its object argu-
ment, the two nodes being non-distinct, as shown in (6.41).
(6.41) Parsing Hiroto-ga ringo-o tabeta to itta:
T n(0), ?T y(t)
F o(U),
?∃x.F o(x), ?T y(e → t), ♦
T y(e)
term F o(ι, y, Hiroto′ (y)) on the tree once constructed would not be appropriate. The DS analogue
210 CHAPTER 6. THE CHALLENGE OF JAPANESE
non-root nodes completed, the topnode of the tree can finally be decorated with
the formula (suppressing tense specifications):
Notice that we are building up semantic structure, subpart by subpart, with each
suffix, whether case, tense, or complementiser, determining the full compilation of
semantic interpretation that is possible at that stage in the interpretation process
prior to the subsequent structure-building step.
This derivation, though providing a natural interpretation, is by no means the
only possible interpretation that can be assigned to (6.37). An analysis closer to
that assumed in current syntactic accounts might be to assume as a first hypothesis
that the subject-marked noun phrase is to be projected as the matrix subject at
the early stage at which the noun phrase itself is parsed. Such choices are always
available. At each step, there is choice as to whether to interpret all noun phrases
in the sequence as arguments of the subordinate clause, or whether to interpret the
presented expressions as providing arguments of the matrix predicate, by making
alternative selections of the nested arguments from some independent context. In
Japanese, any argument of a predicate may be identifiable from context. (6.42) in
a context of identifying who has eaten some cake might well mean that Akiko said
to Halimah that some contextually identified person had eaten the cake:
(6.42) Akiko-ga Halimah-ni tabeta to itta
AkikoN OM HalimahDAT ate comp said
‘Akiko said to Halimah that Tom ate the cake.’
If Generalised Adjunction is taken to have applied following the processing of
the first -ga marked expression in (6.37), the following object-marked node would
then be constructed relative to a lower level of embedding, and the subordinate
subject be identified anaphorically. Given that -wa is characteristically used to
indicate which expression is to be construed as matrix subject, this is not the
natural interpretation. Nevertheless, this possibility gives a glimpse of the large
numbers of sequences of actions available for an individual string, with variability
even for one possible outcome.
Before we turn to a discussion of locality constraints involved in construing noun
phrases within scrambled embedded contexts, we briefly show how more complex
levels of embedding can be handled in this analysis. In examples with more than
one level of embedding, as in (6.43), the second form of action provided in the
specification of -to will be required. One analysis of the initial string in this example
would thus require the construction of the tree in (6.44) from that in (6.38), licensed
by the second set of actions in (6.39).
(6.43) Hiroto-ga ringo-o tabeta to itta to omotteiru
HirotoN OM appleACC ate comp said comp thinks
‘Hiroto ate an apple, he said, he thinks.’
of the binding constraints are filters on choosing values in constructing interpretations for the
natural language expressions, e.g. the name Hiroto. They do not apply to the terms in the
formula language; and so do not apply to any logical term decorating a node in the output
structure.
6.2. GENERALISED ADJUNCTION 211
hU iT n(0), ?T y(t), ♦
Notice what has happened here. Given that two NPs both marked with -ga as in
(6.46) cannot be resolved in the same local domain,34 the only possible sequence
of transitions which allows the parse of Akiko-ga is one in which Generalised
Adjunction applies following the parsing of the first subject expression, Hiroto-
ga.35 But on such a transition, the propositional domain now under construction
is the radically unfixed one. The expression Masa-ni following Akiko-ga can only
lead to interpretations in which either the two NPs are interpreted as co-arguments
of some lower predicate (as indicated in the display provided), or where Masa-ni
is interpreted as contributing to some structure at a further level of embedding
(following yet a further step of Generalised Adjunction). What is excluded is
its projection as co-argument with the node decorated by the actions of Hiroto-ga,
for there is no possibility of pointer movement back across completely unrestricted
tree relations to arbitrary nodes already introduced. Parsing the dative can yield a
tree (something) like that in (6.48), which is developed through a parse of the first
verb muita to yield a tree in which Masa is the beneficiary of Akiko’s peeling an
apple.36
34 Apart from the pairs of -ga marked expressions which dictate a more abstract subject-
predication relation. See footnote 39.
35 There cannot be a step of *Adjunction as the type-t-requiring node already has one node
So far the predictions match those of other frameworks. However, with the
occurrence of the matrix subject after the embedded clausal sequence, this analysis
leads to predictions which depart from movement analyses, in particular. On the
present analysis, the occurrence of the dative-marked NP following -to as in (6.49)-
(6.50), must be interpreted relative to the matrix subject, and not as contributing
a term in the subordinate structure:
(6.49) Akiko-ga ringo-o muita to Hiroto-ga Masa-ni itta
AkikoN OM appleACC peeled comp HirotoN OM MasaDAT said
‘Hiroto said to Masa that Akiko peeled an apple.’
6= ‘Hiroto said that Akiko peeled an apple for Masa.’
(6.50) Akiko-ga ringo-o muita to Masa-ni Hiroto-ga itta
AkikoN OM appleACC peeled comp MasaDAT HirotoN OM said
‘Hiroto said to Masa that Akiko peeled an apple.’
6= ‘Hiroto said that Akiko peeled an apple for Masa.’
As we have already seen, once the pointer has moved down to some subordinate
structure, there is no return to a higher point in the tree across a relation con-
structed using Generalised Adjunction, until that intermediate tree is com-
pleted. Yet once that intermediate tree is completed, the pointer moves on to a
dominating node, with the complementiser having imposed the completion of that
subordinate structure. The pointer is at that later point placed at whatever node
locally dominates the node decorated by Hiroto-ga. Hence the only interpretation
for either (6.49) or (6.50) is one in which the term decorated by Masa-ni modi-
fies the predicate applied to F o(ι, y, Hiroto′ (y)) (either as third argument, or as
adjunct, depending on the analysis attributed to the dative).37
This result is not predicted by movement accounts. To the contrary, on those
accounts, (6.50) ought on the face of it to allow an interpretation in which the
dative-marked noun phrase, Masa-ni, is understood as an argument in the subor-
dinate structure. This is because there is a possible sequence of steps that first
moves the dative NP from the complement clause to left-adjoin to the containing
37 In all of (6.46)-(6.50), the fixing of the subject relation in processing -ga means that the
construction of the full template of structure projected by the verb will involve constructing the
subject relation non-distinctly, as in the simpler cases, assigning the subject node a metavariable
as Formula decoration emptily, given the presence of an already determined Formula value.
214 CHAPTER 6. THE CHALLENGE OF JAPANESE
introducing either LINKed structures or unfixed nodes in the latter stages of the interpretation
process, analogous to (ii):
6.3. LONG-DISTANCE SCRAMBLING 215
distinguishing feature of not merely imposing the requirement for relative position
in a tree, but also of making that node relation and moving the pointer back up to
what is now the immediately dominating type-t-requiring node as in (6.55).39
IF T y(e)
THEN IF h↑∗ i(T n(a)∧?T y(t))
THEN put(h↑0 iT n(a));
(6.55) -ga
make(h↑0 i); go(h↑0 i)
ELSE Abort
ELSE Abort
What we have done in defining -ga in this way is to lexically encode the very ac-
tions which otherwise would emerge as a mere consequence of the interaction of the
output filter which the case specification might impose (?h↑0 iT y(t)) and the locality
constraint imposed by Local*Adjunction. This encoding of what is otherwise a
consequence of interaction between case-marking and a computational process will
force the fixing of an immediate dominance relation to the node from which it was
constructed in all environments, whether introduced by Local*Adjunction or
by *Adjunction. So *Adjunction will be applicable as input to the parsing of
a -ga marked expression, but it will never give rise to long-distance dependency
effects.
In pursuing the applicability of *Adjunction for the other case-specifications,
what we now find is that, despite its availability in principle in Japanese, the imple-
mentation of *Adjunction in well-formed derivations is far from unproblematic.
This is because, in a language such as Japanese, fixed subordination relations are
only established by the complementisers after some propositional structure is in-
duced. The step that initiates such embedded structures gives no indication at all of
the level of embedding, as we saw above. The problem with examples such as (6.54)
is that in pursuing the parse beyond the identification of the matrix subject, *Ad-
junction cannot be used to introduce the subordinate structure, because there is
already an unfixed node whose update has not yet been provided, and only one such
unfixed node can be introduced at a time. But this means that the transition from
main clause structure to subordinate structure between the parsing of Hiroto-ga
and the parsing of Akiko-ga has to be constructed by application of Generalised
Adjunction, as we saw earlier in section 6.2.2. This in its turn creates a further
39 There are uses of -ga which appear not to mark subject position. They fall into two classes,
those which involve a concept of subject at a suitable level of abstraction, indicating predication
on the first -ga marked term to be constructed from the remaining string, (i), and particular
stative verbs which appear to have to be itemised as taking an idiosyncratic use of ga-marking,
(ii):
(i) usagi-ga mimi-ga nagai
rabbitNOM earNOM long
‘Rabbits’ ears are long.’
(ii) John-ga nihongo-ga wakaru
JohnNOM JapaneseNOM understands
‘John understands Japanese.’
It is arguable that the first type can be expressed while retaining the characterisation of -ga
suggested here, by presuming on variable adicity of nominal-internal predicates (mimi (‘ear’) in
(i) functioning as a two-place predicate ‘ear -of’), following the direction of Marten 2002b. Takano
2003 also has arguments for a bi-clausal analysis even in the case of verbs such as wakaru, as in
(ii), which turn on difference in scope effects for -ga and -o marking, the former giving rise to wide
scope effects, the latter not. Since scope is defined over propositional domains, we would take
such arguments to apply in this framework to motivate assigning structure with more than one
type t-requiring node. If such an account can be substantiated, the amount of lexical stipulation
specific to -ga then reduces to those stative verbs that cannot be analysed in terms of projecting
more abstract structure involving nesting of one propositional structure within another, a very
restricted set.
6.3. LONG-DISTANCE SCRAMBLING 217
hurdle, as the resulting structure is too weak to license Merge of the unfixed node
originally introduced by *-Adjunction:
(6.56) Parsing (6.54):
T n(0), ?T y(t)
F o(W), ♦ F o(Tabe′ )
A step of Merge cannot be applied to unify the node decorated with F o(ǫ, x, Ringo′ (x))
and the object node for tabe, because the application of Merge depends on a pro-
cess of evaluating the unfixed node successively down some tree under construction
across a succession of daughter relations, as discussed in chapter 2. Generalised
Adjunction does not, however, provide the appropriate structural environment to
allow this, since what it defines is a transition which is a disjunction across LINK
or daughter relations.
This may seem to enforce a characterisation of all such strings as incapable of
yielding a logical form as a result, hence ungrammatical. Yet there is a simple
and monotonic repair process. Given that the formal system allows interspersing
of pragmatic enrichment processes with the mechanisms which encode the building
of partial structures, all that is required to achieve a parsable string is to assume
that pragmatic enrichment, as a generally available cognitive process, can apply not
only to formula enrichment as in anaphora resolution, but also to structural enrich-
ment. This is hardly contentious, since enrichment of stimuli is a general cognitive
phenomenon, not one specific to a certain mode of representation. On the contrary,
given the extension of the concepts of underspecification plus update from anaphora
to these more general structural processes, such an enrichment step is exactly what
we should expect. So, suppose we assume that a hearer, when faced with an overly
weak tree relation such as constructed by Generalised Adjunction, has the op-
tion of enriching it from hU iT n(a) to a fixed relation h↑0 ih↑1 iT n(0). The required
step of Merge is now licensed to identify ringo-o as providing the required object
argument for tabe, as shown in (6.57). The significance of such a step is that it is
not morphologically triggered: it is a step of abduction, and what is required here is
a meta-level process of reasoning. Being a pragmatic and system-external process,
any such choice should be expected to be associated with general cognitive con-
straints; and it is notable that such strings, often initially rejected by speakers, are
judged to be acceptable when the context relative to which the string is interpreted
has already set up the type of structure which constitutes the enrichment step that
has to be made, i.e. when the required structural relation is available from the
immediate context. This is the structural analogue of identifying the appropriate
substituend for interpreting a pronoun from the immediate context.
218 CHAPTER 6. THE CHALLENGE OF JAPANESE
hU iT n(0),
h↑0 ih↑1 iT n(0), ?T y(t)
F o(W), ♦ F o(Tabe′ )
indicate that this is overwhelmingly the general pattern in Turkic, Polynesian, Japanese and
Korean.
6.3. LONG-DISTANCE SCRAMBLING 219
is construed as the matrix subject constitute what have been labelled ‘surprising constituents’
(Takano 2002). They have to be construed as local to each other, as an incomplete propositional
structure. In these non-finite constructions, analysing them as forming a partial propositional
structure, hence a constituent, is not essential since the case-marking allows an analysis in which
the causative sase, by a word-formation process in the lexicon, increases the adicity of the pred-
icate to which it is suffixed, with -ni being defined to range over both nested subjects and the
more regular indirect object construal, as in many languages. However such pairs of expressions
functioning as co-arguments of some embedded predicate may also occur in finite constructions,
and in these they require analysis as a partial propositional structure containing only argument
nodes. Given the DS perspective, these can be straightforwardly characterised in virtue of the
licensed feeding relation between *Adjunction and Local*Adjunction. See Kempson and Kiaer
(2004) and Kiaer (in preparation) for analysis of such data in these terms.
220 CHAPTER 6. THE CHALLENGE OF JAPANESE
their significance.
6.3. LONG-DISTANCE SCRAMBLING 221
locality of all other expressions to the structure immediately containing the sub-
ject node decorated by the actions of Saito-ga is exactly as we expect. In marked
contrast, on an analysis of (6.61) as involving left-peripheral separation of the -ga
marked phrase from the remainder (by *Adjunction), with -wa taken to initiate
the construction of the surface subject, this is exactly what would not be expected.
So these facts strongly buttress the proposed account of -ga as locally fixing the
relation of the node it decorates.
Secondly, the construal of left-peripherally placed -ni marked expressions as
subordinate equally buttresses the proposed analysis of -ga. For a pair of -ni-
marked expressions, it is indeed pragmatic considerations that are operative, as in
(6.62). Speakers report that (6.65)-(6.66) are both ambiguous, though with the
first -ni marked expression much preferred as the understood embedded subject in
both cases:
(6.65) Akiko-ni Taro-ni John-ga kisu-saseta
AkikoDAT TaroDAT JohnN OM kiss-caused
‘John caused/let Akiko kiss Taro.’ (preferred)
‘John caused/let Taro kiss Akiko.’
(6.66) John-ga Akiko-ni Taro-ni kisu-saseta
JohnN OM AkikoDAT TaroDAT kiss-caused
‘John caused/let Akiko kiss Taro.’ (preferred)
‘John caused/let Taro kiss Akiko.’
This availability of inverted subject interpretations extends to Korean, where
subject-marking freely allows inverted scope interpretation as long as the context
makes this plausible. So, it appears to be only with contiguous sequences of -ga-
marking that general cognitive considerations have no impact, as notably displayed
with (6.53). But this suggests that it is indeed an encoded property of -ga that
prevents such displacement: because only in this case does the morphological spec-
ification induce structure and move the pointer on to the immediately dominat-
ing node just created with its requirement ?T y(t), hence precluding return of the
pointer to the higher node for further development. (Completion, recall, applies
only to type-completed nodes.) If the constraint were pragmatic only, exactly the
same constraints should operate to determine construal of expressions, whether -ga
marked, -ni marked, or as in Korean subject-case-marking too: all should be alike.
Thus we take these data to confirm that the correct analysis does not involve
the construction of one isolated node by application of *Adjunction for the -ga
marked expression to decorate.
in the scrambling literature, as without a full account of plurals, let alone reciprocals, any such
discussion would be premature in a system committed to formally characterising both syntactic
and semantic properties. (But see chapter 7 for an initial characterisation of plurality.)
222 CHAPTER 6. THE CHALLENGE OF JAPANESE
because any node whose tree node relation is not fixed will be associated with a
subsequent update process. The fact that there has to be such a subsequent process
means that there will be a distinct point in the construction process at which any
aspects of underspecification left without update at the earlier step of constructing
the unfixed node can be resolved at this second stage:
(6.52) zibunzisin-o Taro-ga semeta
selfACC TaroN OM blamed
‘Himself, Taro blamed.’
Because all rules are optional, nothing forces the substitution of a term to update
the meta-variable projected by the anaphor in (6.52) at the point at which the
unfixed node is decorated. The pointer can be moved on from this node without
any such move, given the provision of a type specification for the meta-variable
projected by the anaphor, as it is only fixed type-assignment that is critical to
successful processing of the case-marker -o. Completion thus licenses the tree in
(6.67) from a parse of zibunzisin-o, despite the incomplete formula specification on
the unfixed node.
T n(a), ?T y(t), ♦
(6.67)
At a later stage, once this incompletely decorated node has merged with the internal
argument node projected by the verb semeta, the substitution of this meta-
variable will however become essential, as otherwise with an open requirement
remaining, its immediately dominating predicate node will fail to be decorated with
formula by Elimination, and so there will be no well-formed outcome overall.
It might seem from (6.52) that the identification of appropriate construal for
zibunzisin is identified off the fixed structure, once the initially unfixed node is
updated. Such an analysis would be in agreement with accounts in which the
associated predicate-argument structure has to be established before the construal
of the antecedent can be established (effectively a radical reconstruction analysis,
as in Saito 1985 and others following him). But things are not quite this simple
(as Saito 2003 discusses). As is well-known, zibun is a subject-controlled anaphoric
device, restricting identification of its antecedents to terms that decorate a subject
node. zibunzisin is the local analogue of this, requiring as its antecedent a suitably
local subject node, that is the closest subject in some sense to be made precise.
However, it is not sufficient to define locality either off the predicate relative to
which the anaphor has to be construed, or off the subject expression closest to
it in the linear sequence, for it turns out that in a more extended sequence of
embeddings, the anaphor can be identified from any subject along the route, as it
were, between antecedent and anaphor, so that the further the distance between
antecedent and anaphor, the more interpretations become available:
(6.68) Taro-gai Hanako-gaj Jiro-gak zibunzisin-o∗i,∗j,k hihansita to
TaroN OM HanakoN OM JiroN OM selfACC criticised comp
itta to omotteiru (koto)
said comp thinks fact
‘Taroi thinks that Hanakoj said that Jirok criticised self∗i,∗j,k .’
6.3. LONG-DISTANCE SCRAMBLING 223
F o(Uanaph ), T y(e)
⇑
F o(α)
There is more to this characterisation of locality than merely defining its domain.
Its significance lies in the fact that it is defined to apply to any closest subject-
decorated node at some arbitrary point in the construction process, so it is not
specific to a fixed position in a tree. First, the substitution defined will trivially
apply in the case of (6.68) since the node which zibunzisin decorates is a node
which is locally dominated by a type-t-requiring node with a fixed term as subject.
This can function as its antecedent as there is no intervening type-t-requiring node.
Indeed, because it has this antecedent, no other potential antecedent will do.
However, the substitution process can also apply in a derivation in which, fol-
lowing the parsing of some ga marked expression to serve as antecedent, a presumed
step of Generalised Adjunction introduces an intermediate type-t-requiring
node across a very weak structural relation and then a new unfixed node is intro-
duced for the anaphor to decorate, as in (6.74), which gives the case of (6.69).
(6.74) Parsing Taro-ga Hanako-ga zibunzisin-o Jiro-ga hihansita:
T n(0), ?T y(t)
F o(Uanaph ),
F o(ι, z, Jiro′ (z)) ?T y(e → t)
T y(e)
F o(W), ♦ F o(Hihansi′ )
unfixed node with the internal argument position decorated by F o(W) in (6.74).
This gives the interpretation for (6.69) of ‘Taro thinks that Hanako said that Jiro
criticised Hanako’.
But this is not the only possible interpretation of the string. Nothing forces
zibunzisin to select Hanako as antecedent, as the process of substitution is a
computational action, and hence optional. So, should the option of substituting
F o(ι, y, Hanako′ (y)) not be taken up at the point at which zibunzisin is parsed and
decorates the unfixed node, its metavariable remains uninstantiated until the point
at which it can be merged as the object of hihansita. At this point, the closest avail-
able subject antecedent is Jiro. Since substitution must now take place to allow
the propositional tree to be completed, the anaphoric metavariable is identified as
F o(ι, z, Jiro′ (z)), giving the interpretation for (6.69) of ‘Taro thinks that Hanako
said that Jiro criticised himself’. As we would expect, these two interpretations are
reported to be equally natural, since no step of structural abduction was necessary
in the derivation of either interpretation.
Finally we get to (6.70), and here, with its three possible interpretations, an
intervening step of abduction becomes essential. As in (6.69), a step of Gener-
alised Adjunction can be presumed to apply following the parsing of the matrix
subject, and the immediately succeeding anaphor can be construed as sharing the
F o value of that matrix subject, i.e. as F o(ι, x, Taro′ (x)), as shown in (6.75).
(6.75) Parsing Taro-ga zibunzisin-o Hanako-ga Jiro-ga hihansita:
T n(0), ?T y(t)
F o(Uanaph ), T y(e))
⇑ F o(ι, y, Hanako′ (y)) ?T y(t)
′
F o(ι, x, Taro (x))
F o(W), ♦ F o(Hihansi′ )
be defined just in order to allow the facts to be expressible within the general movement account.
The feature may seem reasonably well motivated, but it is a stipulation none the less.
46 These examples and translation are from Saito (2003).
6.4. TOPIC-MARKING AND LINK 227
We have now provided an account of scrambling in Japanese that exploits the way
that different forms of underspecification, of position within a tree and of content,
interact within the process of constructing well-formed representations of semantic
content. The one additional assumption we have had to make in this account of
scrambling is to allow a step of abduction in the characterisation of long-distance
dependency. But within a broadly inferential framework in which establishing an
interpretation in context is a process of building up interpretation in a goal-directed
way, such additional steps are conceptually natural, and not unexpected.
as given by the verb whose argument they provide; and there is no sensitivity to
island restrictions:
(6.81) Masa-wa Akiko-ga muita to itta ringo-o tabeta
MasaT OP IC AkikoN OM peeled comp said appleACC ate
‘As for Masa, he ate the apple which Akiko said she peeled (for him).’
(6.82) Tanaka kyooju-wa kaita ronbun-o koosei shiteita
Tanaka ProfT OP wrote paperACC correction did
gakusei-ga inakunatte shimatta
studentN OM had disappeared
‘Speaking of Professor Tanaka, the student who was correcting the paper
that he wrote had disappeared.’
Notice how in these cases, the anaphoric dependence of the argument of the most
nested verb and the topic-marked expression is entirely natural: because of its linear
positioning, it is closest to the left-peripheral topic marker.
The only difference from the analogous construction in head-initial languages
is that the topic-marking suffix itself imposes the requirement of the copy to be
constructed, so the action is lexically driven as given in (6.83).
IF F o(α), T y(e)
THEN IF hLiT n(a)
(6.83) -wa THEN go(hLi); put(?hU ihDiF o(α))
ELSE Abort
ELSE Abort
-Wa is thus defined as requiring a completed node to which some additional struc-
ture is built as to be LINKed, from which it imposes a requirement on some node of
the tree under construction that it be developed so as to contain a copy of the term
just constructed. In other words, it is much like the complex preposition as for in
English, except that it is defined as a suffix, and so does not itself induce the con-
struction of the LINKed node. Moreover, unlike English, Japanese is fully pro-drop
so a pronoun is not required to establish the copy in the tree of the constructed
term. (6.84), which is the translation-equivalent of (6.82), involves essential use of
the pronoun :
(6.84) As for Professor Tanaka, the student who had corrected his paper had
disappeared.
Once having introduced such a LINKed structure, with its imposition of a require-
ment for the copy, a pronoun may be used, but it is not necessary:
(6.85) Masa wa Akiko-ga soitu-ni muita to itta ringo-o
MasaT OP AkikoN OM that guyDAT peeled comp said appleACC
Mamoru-ga tabeta
MamoruN OM ate
‘As for Masa, Mamoru ate the apple which Akiko said she peeled for
Masa.’
Because no case marking is involved (indicating some particular hierarchical po-
sition in the configuration), the tree decorated by the -wa-marked expression and
the tree projected from the remainder of the string do not need to merge, and the
two structures remain in the output structure sharing a common term. This is all
exactly as we expect from an analysis of a LINK relation holding between the term
6.5. RELATIVE CLAUSE CONSTRUAL 229
projected from the left-peripheral expression and the remainder string. We then
expect no island constraints, as the relation between the term and the remainder
of the string is established solely through anaphora resolution.
Since, furthermore, the left-peripheral expression is in a quasi-independent struc-
ture, with no dominating node requiring ?T y(t) at which scope statements might
be collected, we do not expect any accompanying scope actions, and so we do not
expect quantified expressions to occur in that position. This is indeed a wellknown
restriction on topic structures. We also expect the displacement data discussed in
section 6.3.1, where a -wa-marked expression intervenes between a -ga-marked ex-
pression and its co-arguments, with just the extension of the requirement imposed
by the processing of -wa that the required copy which the LINK transition imposes
can occur anywhere at all in the structure under construction, hence the restriction
takes the form ?hU ihDiF o(α).47
There is much more to be said here. First, there is a very strong preference to
use -wa as indicating which is the matrix subject argument, counter-balancing the
effect otherwise of building up interpretation from the most subordinate structure
first. Secondly, -wa, as a topic marker, is used both for contrastive purposes, and
to set a background topic. These are standardly analysed as two discrete uses of
wa: “contrastive” wa and “topic” wa. However, as we already saw briefly in section
4.2.3, the projection of an initial linked structure can be used either as a basis for
setting up an anaphoric link with some larger context, or for introducing a context
specifically for the subsequent tree construction process, hence in contrast to the
larger context. There is thus reason, to the contrary, to think the contrastive use of
-wa is sufficiently well-characterised by the LINK analysis. So the general pattern
of left-peripheral topic effects repeats itself in verb-final languages, buttressing the
analysis of chapter 4 in terms of building up such structures as paired anaphorically
connected structures.
Otherwise, the transition might be seen as occurring only at the rootnode and imposing a require-
ment on a node requiring ?T y(t) of the form ?hDiF o(α), constraining all downward development.
230 CHAPTER 6. THE CHALLENGE OF JAPANESE
48 In modern Japanese head-final relatives, the indication of the relative clause boundary is
provided solely by the positioning of the noun immediately following the verb. It is invariably
indicated by lack of intonational break between verb and head: see Kurosawa (2003). Indication of
such a LINKed structure relation may be provided earlier. See Yoshida et al. (2004) for evidence
from quantifier float data that a quantifying determiner agreeing with the head may precede the
whole relative-clause sequence, by its very mismatch between the classifier it bears and the NPs
that follow indicating the upcoming LINK transition.
6.5. RELATIVE CLAUSE CONSTRUAL 231
Then, having the pointer back at the top node of this intermediate type-t-requiring
node, the lexical specification of muita in (6.91) leads to the construction of a tree
whose nodes of type e are annotated with meta-variables that require identification
as shown in (6.92), where the subject node decorated collapses with the first of
these, just as in the characterisation of subordinate clauses.
232 CHAPTER 6. THE CHALLENGE OF JAPANESE
(6.91) mui
IF {?T y(t)}
THEN make(h↓0 i); go(h↓0 i); put(F o(U), T y(e), ?∃x.F o(x)); go(h↑0 i);
make(h↓1 i); go(h↓1 i); put(?T y(e → t));
make(h↓1 i); go(h↓1 i);
put(F o(Mui′ ), T y(e → (e → t)), [↓]⊥); go(↑1 );
make(h↓0 i); go(h↓0 i); put(F o(V), ?∃x.F o(x), T y(e))
ELSE Abort
T n(0), ?T y(t)
F o(V),
?∃x.F o(x) F o(Mui′ )
T y(e), ♦
What now of the object node, with its metavariable decoration? One of the
verb’s arguments has to be left in some sense open in order to be identified as an
argument of the predicate of the head noun once that is introduced subsequently.
Yet we need to presume that Completion and Elimination will all successfully
apply to yield a propositional structure in order that the tense suffix can be parsed.
In order for these rules to take place, all requirements must have been satisfied.
This is the heart of how a compositionality constraint applies to the compilation
of interpretation on the resulting tree structure, and was critical to the account of
expletives in chapter 5.
It might look as though we have a number of choices, but in fact they reduce
to effectively the same one. If the metavariable is a decoration provided for an
unfixed node we might allow that metavariables are not always identified on line,
as long as the process of substitution can apply at the subsequent point at which
the node in question is assigned a fixed value. But if the node is projected by the
verb’s template, as it is in these structures, then, since it is the verbs that provide
fixed templates of structure, this argument node will be at least fixed relative to
the predicate that introduced it. So, in order to compile an interpretation for this
structure, there is really only one course of action: to enrich the metavariable by
a step of abduction. In this case, the step of abduction is to construct a fresh
variable that instantiates the metavariable and satisfied its formula requirement.
Without such an assumption, no completed decoration can be provided for the
LINKed structure, so the abduction process is effectively obligatory, just as in the
enrichment of the subordination relation in establishing long-distance dependency
construals. Such a move has the effect of making possible the projection of a
propositional formula, from which a LINK transition can indeed be licensed, see
(6.93).
6.5. RELATIVE CLAUSE CONSTRUAL 233
F o(V), F o(v),
F o(Mui′ )
T y(e)
The effect is displayed in the tree in (6.95). In the parsing of (6.89), having con-
structed a formula F o(M ui(v)(ι, y, Hiroto′ (y)) and parsed the tense suffix, what is
copied over to the new structure is the variable v embedded within another type
e structure. Notice in (6.95) how this composite tree contains the subpart of a
LINKed structure as projected for English construal (see chapter 3), but with a
‘containing’ structure that is itself entirely unfixed with respect to the root. No-
tice, too, how each of the steps to get to this juncture is in effect driven by the
tense and the imposition of a LINK transition by the adjacency of verb plus noun,
for at each step the next update relies on the previous update having taken place.
We can set this out through a chain of reasoning as follows.
234 CHAPTER 6. THE CHALLENGE OF JAPANESE
hU iT n(0), T y(t)
h↓0 ihLihU iT n(0)
F o(Si : Mui′ (v)(ι, y, Hiroto′ (y)))
hLihU iT n(0),
F o(ι, y, Hiroto′ (y)) Mui′ (v)
F o(v), T y(e)
F o(v) F o(Mui′ )
The metavariable decorating the object argument node cannot be left without any
formula replacing it, as otherwise, with one requirement remaining open, the propo-
sitional structure could not be compiled. Hence a fresh variable is constructed by
assumption (being a variable, it has no attendant scope statement). The neces-
sity of applying Completion and Elimination to provide Formula values for the
non-terminal nodes in the structure is driven by the tense specification with its
trigger condition that a type t tree is completed. And the LINK transition, much
like the specification of -to, closes off the decoration of that LINKed structure by
requiring the top formula of that structure to be compiled, carrying over a copy of
the variable left in the propositional formula and with its place in some term under
construction determined, despite not yet being fixed in the overall structure.49
The rule in (6.94) does not itself determine that what is copied over into the
new structure is a variable, since it turns out that variables are the only formula
values which can decorate this node. This is because, as a result of the transition
defined by the rule, the pointer is now at an open type-e-requiring node which
necessarily immediately triggers the parse of some noun (rather than allowing, for
example, the parsing of a case suffix). Recall from the lexical actions associated
with ringo in (6.13) on page 196 that nouns project full quantifier structure in
Japanese. This means that a parse of some noun after the construction of the tree
in (6.95) will give rise to a quantifier structure with a fresh variable decorating the
internal T y(e) node, as shown in (6.96) (with any nodes with the same treenode
identifier constructed twice collapsing immediately, as in other derivations).
(6.96) Parsing Hiroto-ga muita ringo:
?T y(e), hU iT n(0), ?∃x.T n(x),
T n(0), ?T y(t)
h↓0 ih↓0 ihLihU iT n(0)
hLihU iT n(0),
F o(Ringo′ ),
F o(ι, y, Hiroto′ (y)) Mui′ (v) F o(v), T y(e),
T y(e → cn)
♦
F o(v) F o(Mui′ )
49 There are issues to be explored as to how the construal of head-final relative clauses interacts
with the evaluation of quantifying terms, which we entirely leave on one side.
6.5. RELATIVE CLAUSE CONSTRUAL 235
Because this internal term structure is induced, nothing other than a variable will
be possible as a decoration on the node containing the copy of the term from the
LINKed structure. Moreover, this variable can only be identical to the variable
already abduced to allow completion of the newly LINKed tree. Even this does
not need particular stipulation in either the rule itself, or in the particular up-
date actions induced by the noun. The checking action associated with putting a
fresh variable, which is part of the actions induced by the noun, checks only those
variables already used in scope statements;50 and the copied variable is merely an
assumption without any associated scope statement. So nothing prevents selecting
the copied variable, and indeed choice of this variable is the only possible selection
to obtain a consistent set of decorations at this node. It therefore follows that only
a variable can be copied over from the completed propositional tree onto the head,
and only that variable will then be bound by the epsilon operator introduced in the
parsing of the noun.51
As can be seen from (6.96), the parsing of (6.89) leads to a structure which is
exactly that of the relative clause construal set out in chapter 3 except for the tree
node identifiers, which bear record of the fact that the structure is being induced
upwards. The topnode of this structure is additionally characterised as being below
whatever node the propositional formula providing the restrictor (from the relative
clause) is below: i.e. below the starting point of the whole subroutine of processing
the relative. This is formulated in the LINK adjunction rule as the declaration
that given the construction of a LINK relation from a node within a structure of
topnode T n(a), the newly introduced containing node of type e across that LINK
relation is also below T n(a).
Once this structure is projected in the construal of (6.89), the evaluation of this
subtree together with its LINKed tree can take place, exactly as in English. Recall
that the sequence of words so far parsed in setting out the derivation of (6.89)
is Hiroto-ga muita ringo, a sequence of ‘relative clause plus head’. The formulae
projected by the common noun and by the LINKed propositional structure are
combined together exactly as in head-initial languages through an application of
LINK Evaluation 1 (see (3.13) on page 79), to yield a type cn term:52
The whole term can now be compiled to give a substructure consisting of a term
with a propositional structure LINKed to its internal variable, that is exactly as in
English. The only difference at this point in the parse is the fact that the structure
is radically unfixed within the unfolding tree, as shown in (6.97).
50 For the details of the checking procedure for determining what constitutes a fresh variable,
relative clause structure contains a (weakly specified) epsilon term to be copied onto the head in
the primary structure for development, rather than a variable. However this analysis remains to
be set out in detail, so we do not adopt it here.
52 See Kurosawa (2003) for detailed justification of this analysis and the interaction of structural
hU iT n(0), T y(e),
F o(ǫ, v, Ringo′ (v) ∧ Si : Mui′ (v)(ι, x, Hiroto′ (x)))
hLihU iT n(0),
F o(Ringo′ )
F o(v)
hU iT n(0), T y(t),
F o(Si : Mui′ (v)(ι, y, Hiroto′ (y)))
F o(v) F o(Mui′ )
This presents us with an apparently new problem. The parse routine has in-
duced a structure from the bottom up without having determined the role of that
structure within the overall structure currently under construction. Without this
being determined, a scope relation for the constructed term cannot be defined in the
locally containing structure. This underspecification must therefore be resolved. It
is again a suffix, the case suffix, that induces the requisite structure, not merely
closing off the structure just completed, but constructing a relation in the tree
from that now completed node. Indeed as we have already seen, this is how case
suffixes function in short scrambling environments, providing the necessary enrich-
ment. But in the current situation, where there is no introduced superstructure,
other than the original starting point, we have two possibilities.
One is to take the term constructed as decorating a node which is assigned a
relation relative to that point of departure, in (6.89) the rootnode, by a direct step
of enrichment as in complement clause construal. By this route, given a -ga marked
expression, we would have:
and similarly for the other case-markers, though in these other cases this update
will need to be an abductive step. The other alternative, which will have to be
invoked in cases where relative clauses are modifying terms in subordinate struc-
tures, is to allow the construction of an arbitrary intermediate node to allow for
the development of further levels of embedding. To allow for this latter possibility,
what we now need in order to preserve the same analysis is to presume on a general
characterisation of a rule that is the inverse of Local*Adjunction. The rule is
given in (6.98) and the treegrowth effect in (6.99).
6.5. RELATIVE CLAUSE CONSTRUAL 237
hLihU iT n(0),
F o(Ringo′ )
F o(v)
hU iT n(0), T y(t),
F o(Si : Mui′ (v)(ι, y, Hiroto′ (y)))
F o(v) F o(Mui′ )
With this fixing of an immediate domination relation by the rootnode, the inter-
pretation of (6.89) can now be completed in a straightforward way. The adjective
oisikatta (‘tasty’) projects a subject node decorated with a metavariable that col-
lapses with the subject node already constructed. The tense suffix drives the com-
pletion of the propositional structure, and all requirements are finally satisfied. Any
accompanying scope statement, which will have to have been constructed during
the construction process, will be evaluated to yield a completed logical form. The
result is a predicate-argument structure with just the one argument, the result of
building the LINKed structure compiled into the restrictor of the term contributing
that argument. The final formula is:
hSj < vi F o(Sj : Oisika′ (ǫ, v, Ringo′ (v) ∧ Si : (Mui′ (v)(ι, x, Hiroto′ (x)))))
which is evaluated as:
F o(Ringo′ (a) ∧ Si : (Mui′ (a)(ι, x, Hiroto′ (x))) ∧ Sj : Oisika′ (a)
IF T y(e)
THEN IF hU i(T n(a)
THEN IF hU i(T n(b)∧?T y(t)∧
hU iT n(a) ∧ a 6= b)
(6.103) -o (revised)
THEN Abort
ELSE put(?h↑0 i(e → t))
ELSE Abort
ELSE Abort
The only difference from the subject-marked nouns, again as before, is to assume
that the step which enriches the weak tree relation between the node decorated
by the case suffix and some higher node is an abduction step rather than being
encoded in the specification of the case marker. Just as with possible derivations
involving -ga marking, a step of Inverse Local*Adjunction is an option, so
the case marking update will either fix the relation to the point of departure (if
Inverse Local*Adjunction does not apply) or to some sub-structure node (if
it does).
In the projection of (6.89) and (6.102), we have still kept things relatively simple
by projecting the LINKed structure for the relative clause sequence initially in
the string, and not assuming more levels of embedding than is dictated by the
string under consideration. But these strategies may apply at arbitrary depths
of embedding, and in all such cases Inverse Local*Adjunction will become
essential:53
(6.104) shujutsu-o okonatta isha-ga byooki-o shinpaisita
operationACC did doctorN OM diseaseACC worried
kanja-o hagemashita
patientACC encouraged
‘The doctor who did the operation encouraged the patient who worried
about his disease.’
The LINKed structure may involve a transition to a head which is subordinate to
some finally added verb as in (6.105), or across a further link relation:
(6.105) Masa-wa Yuki-ga muita ringo-ga oisikatta to itta
MasaT OP YukiN OM peeled appleN OM delicious comp said
‘Masa said that the apple Yuki peeled was delicious.’
‘The apple Yuki peeled, Masa said was delicious.’
(6.106) Tanaka kyooju-wa kaita ronbun-o koosei shiteita
Tanaka ProfT OP wrote paperACC correction did
gakusei-ga inakunatte shimatta
studentN OM had disappeared
‘Speaking of Professor Tanaka, the student who corrected the paper that
he wrote had disappeared.’
We do not give the full sequence of transitions for any such derivations, because
the examples already set out have established the individual patterns.54 There
is, transparently, a whole field of research to be done on the interaction of linear
ordering, scope choices and depths of embedding. For our purposes, it is sufficient
to have given a sense of the overall dynamics.
53 Thisexample is from Nakayama (1995).
54 And keeping one’s head down in the explicit details of a construction process is hard work on
the reader!
240 CHAPTER 6. THE CHALLENGE OF JAPANESE
57 Japanese also displays so-called headless relatives, which we do not take up here. See Kurosawa
The interpretation of the clause within (6.110) is shown in the evaluation of (6.111a)
as (6.111b).
(6.111) a. hSi < xi F o(Mui′ (ǫ3 , x, Ringo′ (x))(ι, y, Hiroto′ (y)))
b. F o(Si : Ringo′ (a) ∧ Mui′ (a)(ι, y, Hiroto′ (y)))
where
a = (ǫ3 , x, (Ringo′ (x) ∧ Mui′ (x)(ι, y, Hiroto′ (y))))
The critical difference from head-final relatives and their construal is that there
is no free variable in the formula that decorates the propositional tree to be carried
across a LINK relation to provide the appropriate term to be shared. However,
what can be carried across such a relation is the completed epsilon term shown in
(6.111b), i.e.
60 The suffix -no expresses a range of functions; and the challenge of providing a unitary account
remains open. See Otsuka (1999) for the outline of an analysis which goes at least some way
towards achieving this.
6.5. RELATIVE CLAUSE CONSTRUAL 243
hU iT n(0), T y(t),
F o(Si : Mui′ (a)(ι, y, Hiroto′ (y))), ♦
F o(U),
?T y(e → t)
?∃x.F o(x)
T y(e), F o(a),
F o(V), F o(Tabe′ )
?h↑0 iT y(e → t), ♦
allowing any choice of dependency, an associated scope statement gets added to the
sequence of scope statements for its locally dominating T y(t)-requiring node in a
regular way.62 As in the parsing of (6.37), the subject node remains to be identified;
but this can be identified as F o(ι, y, Hiroto′ (y)) as a value taken from the immediate
context at this stage in the interpretation process. This anaphoric process parallels
the identification of antecedent in the construal of some unexpressed matrix subject
following the construction of a complement structure: it is the context from which
the formula value is selected, irrespective of the tree relation between the two.
The result will be the logical form in (6.115a) which yields (6.115b) once the
scope evaluation algorithm applies, hence the resulting interpretation of (6.109) as
‘Hiroto ate the three apples that Hiroto peeled’. The E-type effect arises because
the object term in the primary structure is construed as identical to a term derived
from the full process of constructing an interpretation for the relative clause and
its containing expressions (ignoring the internal structure of ι, x, Hiroto′ (x))).
(6.115) a. hSj < xi F o(Tabe′ (ǫ3 , x, (Ringo′ (x) ∧
Mui′ (x)(ι, y, Hiroto′ (y)′ ))(ι, y, Hiroto′ (y))
b. F o(Sj :
Ringo′ (a) ∧ Mui′ (a)(ι, y, Hiroto′ (y)) ∧ Tabe′ (a)(ι, y, Hiroto′ (y)))
a=
(ǫ3 , x, (Ringo′ (x)∧Mui′ (x)(ι, y, Hiroto′ (y))∧Tabe′ (x)(ι, y, Hiroto′ (y)))
In these examples, the pattern of Japanese parsing repeats itself yet again. Without
early commitment to precise structural relations between the imposed axiom node
and the remainder, composite propositional structure is built up, propositional unit
by propositional unit, each subpart, as it is compiled, being used as the context for
what follows.
We can now see why it is that head-internal relatives are so problematic for
other frameworks, yet straightforward for the DS perspective. In other frameworks,
syntax is defined over natural language strings (or some abstraction from them). It
is not defined over representations of meaning for that string, as this is the burden
of semantics. Syntax in those frameworks merely provides the vehicle over which
principles of semantic interpretation are articulated. In Dynamic Syntax to the
contrary, syntax is nothing more than the progressive compilation of representations
of semantic content. In the head-internal relatives, where everything is available
to project a final interpretation of the sub-string comprising the relative before
constructing the structure to be associated with the string following the relative, it is
precisely the completed interpretation which provides the input to that construction
process. That a full lexical noun phrase serving as antecedent might occur within
the relative sequence is expected: indeed it is essential to this form of interpretation.
The E-type construal is expected: it is the consequence of copying over an evaluated
term whose restrictor contains a full record of the scope dependency in the evaluated
formula and the resulting interpretation of the antecedent expression. Even the
coincidence of head-internal relatives with head-final languages is expected, for it is
only if some clause in its entirety precedes the head that such an interpretation (with
all scope statements evaluated) will be available. In short, the entire explanation
emerges as a consequence of general principles of the dynamics of tree growth for
relative clauses and quantified expressions and their interaction.63
62 With no metavariable from some lexical projection and no other quantifying term in this
structure, the only possibility is for the copied epsilon term to take narrow scope with respect to
the index of evaluation(see chapter 3).
63 Kuroda (1999) identifies five different “look-alike” constructions to head-internal relatives. It
remains to be established whether all these construction types can be reduced to the single form
6.6. SUMMARY 245
6.6 Summary
We began this chapter by setting out the perplexing properties of Japanese as il-
lustrative of verb-final languages, and posing the challenge of finding a grammar
formalism which expresses the properties of these languages in the same terms as
the more familiar head-initial languages and without undue complexity in the anal-
ysis. We seem to be close to realising our goal. The dynamics of setting out partial
tree structures incrementally and building up propositional structure as further
information becomes available has applied in exactly the same terms, and with es-
sentially the same mechanisms as in those languages. The superficial flatness of the
languages has been expressed through building underspecified structural relations
whenever the lexical actions do not themselves induce the requisite structure. The
asymmetry between morpheme simplicity and complexity of structure projected
has been straightforwardly definable. And, because the only level of structure that
is represented is that of the emergent representation of content, there is no com-
plex mapping from some structure inhabited by the words onto this semantically
transparent structure. It is notably the abandoning of the assumption that nat-
ural language strings induce a level of structure defined over those strings which
has enabled a characterisation of verb-final languages which is commensurate with
languages with more fixed word order.
Of course, we have only scratched the surface of Japanese syntax and semantics
in setting out this sketch. But what is notable in the account, as indeed all other
phenomena we have looked at, is how analyses have invariably taken the form of
a direct, even simple-minded, reflection of the way in which interpretation is built
up bit by bit. Moreover, we now have our various typologies rounded out. We
have relative clause construal involving the building of paired LINKed structures
with the propositional structure either constructed after the head or before it, with
asymmetry in the different kinds of anaphoric relations between the items in these
LINKed structures according as the head or the LINKed structure precede.64 And
we have left versus right periphery effects of the much the same sort across all
languages. As we have seen, there is cross-linguistic variation according as left-
periphery effects are more or less locally constrained. And, although we have not
taken up right-periphery effects in any detail in Japanese, we also anticipate, for
example, that the Right Roof Constraint would hold universally, being a constraint
on compositionality of content for the resultant structure.
There is, as a coda, the challenge of explaining where Generalised Adjunc-
tion and Local*Adjunction might be needed in English, or if not, why not. Mo-
tivation for application of Generalised Adjunction is not difficult to find: we
arguably need it for modelling attributive adjectives and their construal, exactly as
in Japanese but defined as a lexically induced sequence of actions. Many languages
allow adjectives both before and after the noun they modify, with singularly little
difference, except that the pre-head adjectives invariably lack any tense marking:
the tattered book vs. the book that is tattered. Moreover, Japanese is highly excep-
tional in having fully tense-marked head-final relatives (the standard cross-linguistic
pattern is for such head-final relatives to be non-finite), the parallelism suggesting
that this is indeed an appropriate parallel to draw. For Local*Adjunction, we
suggest that it is only a case of knowing where to look. With structurally fixed
subject relations in language such as English, its application would seem to have
to be internal to the verb phrase, and indeed many subject-verb-object ordering
of analysis provided here.
64 We would now confidently anticipate, for example, that resumptive pronouns would not occur
in head-final languages.
246 CHAPTER 6. THE CHALLENGE OF JAPANESE
languages display much freer constituent order after the verb, redolent of short
scrambling. Another candidate application, as we have already seen in passing,
is to allow application of Local*Adjunction in introducing a predicate node,
thereby allowing VP adjuncts to be introduced as predicate modifiers directly. Yet
another candidate is the type-e-internal application, to express the identity of con-
strual of John’s picture and the picture of John. There are also local permutation
processes, of which passive might be a candidate. We do not follow up on these
here, but Generalised Adjunction and Local*Adjunction certainly give us
tools with which to address these further phenomena.
Chapter 7
In the previous chapters we have seen how the dynamic perspective on natural
language adopted by DS provides new and, we think, more insightful analyses
of cross-linguistically recurrent patterns of complex syntactic structures such as
relative clauses, left and right periphery effects, and word-order variation. In this
chapter, we look at yet another language group, and at a new topic, namely at the
relation between agreement and conjunction in Swahili, a Bantu language spoken
in East Africa.
Bantu languages are known for their noun class systems, and for their morpho-
logically complex verb forms. The morphologically marked relationship between
NPs and verbs in Bantu is often analysed as agreement, but we will see in the
course of this chapter that, from a dynamic perspective, Bantu verbal agreement
markers are more appropriately analysed as pronominal elements, which share a
number of characteristics with the pronominal clitics found in Romance languages,
and, indeed, with expletive pronouns in English (see Chapter 5). After introducing
the relevant background, we concentrate on our analysis of the relation between
agreement and conjoined NPs, as this is a topic which has received attention more
widely in the literature, and because some aspects of this complex topic provide a
good example of the importance of linear order for syntactic analysis.
We begin by reviewing the agreement system of Swahili shows the relation of
different parts of the sentence:1
(7.1) a. wa-toto wa-wili wa-zuri wa- me- anguk-a
2-children 2-two 2-beautiful 2-SUBJ PERF fall-FIN
‘Two beautiful children have fallen.’
b. m-iti mi-wili mi-zuri i- me- anguk-a
4-tree 4-two 4-beautiful 4-SUBJ PERF fall-FIN
‘Two beautiful trees have fallen.’
The examples in (7.1a) and (7.1b) show how the relation between the subject noun
and the numeral, adjective and verb to which it is related is marked with agreement
prefixes. In (7.1a), the subject noun watoto, ‘children’, belongs to Class 2 (the
1 The prefixed numbers indicate the noun-class, and singular/plural distinctions are presented
as subscripted suffixes. In this chapter we use SUBJ, OBJ as annotations, rather than N OM, ACC
as Swahili lacks a morphological case system, and the information as to whether the prefix marks
subject or object comes from the position in the verbal template.
247
248 CHAPTER 7. SWAHILI AGREEMENT AND CONJUNCTION
numbering of different noun classes is arbitrary, and we are following here the
system used by Bantu linguists), indicated by the nominal prefix wa-. The same
prefix is used for numeral and adjectival stems, so we have wa-wili and wa-zuri.
The relation between subject nouns and verbs is marked with the subject concord,
which in the example in (7.1a) is identical to the nominal prefix. In (7.1b), we see
that the nominal prefix for class 4 is mi-, and that the subject concord for class 4
is i-.
Noun class systems like that in Swahili can be regarded as half-way between
gender systems such as found in German or French, and classifier systems, as for
example in Thai. As in gender systems, each Swahili noun belongs to a particular
class which is morphologically marked and determines agreement patterns. On the
other hand, Swahili noun classes have, in contrast to most gender systems, a fairly
discernable semantic base. For example, classes 1 and 2 contain only humans,
classes 3 and 4 contain a lot of words for natural features like trees, mountains
and rivers, classes 7 and 8 contain many words for concrete physical things like
chairs, and class 9 and 10 contain words for animals, but also many loan-words. In
general, the classes show a pairing of singular and plural. However, in many cases
the relation is more subtle, so that for example the word for ‘lion’, simba, is in class
9, and ‘many lions’ are in class 10, but there is also ma-simba, in class 6, meaning
‘a pride of lions’. In fact, quite generally, nouns can be shifted from one class to
another to achieve a change in meaning. For example, shifting a noun into class 7
often gives a diminutive interpretation, so that ki-toto, from the stem -toto which
we have seen already in wa-toto, ‘children’, means ‘small child’. So, Bantu noun
class systems such as the one in Swahili have a morphological dimension, in that the
noun class system determines the morphological shape of nouns and other words,
but also a semantic dimension in that class membership and class shifting often
reflect semantic distinction. In addition, there is a syntactic dimension because
noun classes are important for the agreement system.
using a conjunction of two clauses, rather than two noun phrases. However, exam-
ples of all strategies we describe here are readily found in discourse and in ‘natural’
Swahili texts (Marten 2000 has a number of examples taken from a Swahili novel).
and a class 6 noun, in that order, can take agreement marking on the verb of class 8.
250 CHAPTER 7. SWAHILI AGREEMENT AND CONJUNCTION
ment. Given its morphological structure, this is arguably a pronominal element with internal
quantification, but we ignore this here.
5 This holds for animate NPs. The situation for non-animate NPs is less clear. Bokamba (1985)
has examples showing agreement with the first conjunct of a conjoined NP preceding the verb, but
these examples were not confirmed by our informants, which may be due to dialectal variation; all
examples used here are from the Kiunguja dialect of Swahili, spoken on Zanzibar island. Other
examples of this kind reported in the literature (including the cases discussed in Marten (2000))
can be analysed as cases of default agreement further discussed in section 7.4.1.
7.1. STRATEGIES FOR RESOLVING AGREEMENT 251
form is also used, for example, to indicate the agent of a passive verb (cf. Mous and Mreta
(forthcoming) for discussion of a similar situation in Chasu). Without going into details, we take
this to indicate a more fundamental semantic underspecification than an ‘and/with’ ambiguity.
7.2. AGREEMENT MARKING 253
the tense marking follows. If it follows the agreement marker, then the marker
indicates the subject; if the tense marker precedes the agreement marker, then the
marker indicates the object.8 This then leaves the problem that there is never
more than one object marker, either indirect or direct object marking, and this
underspecification remains unresolved by the agreement system, and is determined
only by the verb and the general context.
Looked at broadly, we have two choices as to how to analyse this phenomenon.
Either we analyse Swahili, as classically, as having a highly complex verbal mor-
phology with idiosyncratically defined mappings onto the required semantic rep-
resentation, demanding subclassifications of an entirely language-particular sort.
Alternatively, we take the linear ordering in the processing strictly, and analyse
these forms as sequences of reduced pronominal elements that themselves induce
the requisite semantic representation before the verb is processed. There is good
reason to think that attending to the time-linear dynamics of parsing is the right
route to go; and, considered in this light, Swahili shows interesting parallels with
Spanish, and the other Romance languages. First there is the obvious similarity,
that in both languages, as indeed many others, the choice of subject expression is
obligatorily reflected by marking on the verb, whether or not there is an explicit
subject:
(7.20) llamaron a Paca
callP AST,3ps.pl a Paca
‘They called Paca.’
But remember too how in Spanish, strong pronouns obligatorily required the pres-
ence of a clitic:
(7.21) a. la llamaron a ella
her call3pl. a her
‘They called her.’
b. *llamaron a ella
call3pl a her
‘They called her.’
In Swahili, we have a similar phenomenon. A subset of NPs obligatorily requires
the presence of a marker on the verb, morphologically transparently a reduced
anaphoric element, in its function just like the Spanish clitic; but in this language
it is the subset of animate-marked NPs that imposes this requirement.
(7.22) a. ni- li- mw- on-a Juma
1-SUBJ1st.sg PAST 1-OBJ3rd.sg see-FIN Juma
‘I saw him, Juma.’
b. *ni- li- on-a Juma
1-SUBJ1st.sg PAST see-FIN Juma
Intd.: ‘I saw Juma.’
This suggests that, despite differences between the phenomena in the two lan-
guage families, in terms of their role in building up interpretation, they should be
analysed along the same lines. For Spanish, we defined strong pronouns as deco-
rating only unfixed or linked nodes, encoded as not decorating fixed nodes in the
8 For the slightly different cases of infinitive, habitual and subjunctive marking, see footnote
12.
7.2. AGREEMENT MARKING 255
indefinites in chapter 3 (see section 3.3.5). On the other hand, if the behaviour of
the affix under conditions of duplication does not impose any specificity implication
on the doubled expression, then this suggests an analysis in terms of the metavari-
able projected from the affix having lost the bottom restriction definitive of full
lexical status. This would allow derivations in which the node decorated by the
affix could unify with that decorated by some co-construed noun phrase. On this
analysis, there would be only one articulated tree, hence no reason to preclude any
subclass of quantifying expressions. As things turn out, whether a specificity re-
striction holds or not appears to play a relatively small role in Swahili, arguably no
more than a pragmatic constraint, as in Greek (see chapter 4 section 4.2.2.3). How-
ever, it has been argued to play a larger role in Chichewa (Bresnan and Mchombo
1987), and the framework certainly makes available a range of lexical effects which
enables such variation to be naturally expressed.
To follow through on this general form of analysis, we have to assume that the
affixes together with the verb induce a full template of structure, the verb inducing
whatever structure is not provided by the clitics themselves, as in Japanese.9 The
one puzzle that appears not to be reducible to the distinction between the building
of linked structures vs the building of an unfixed node is the lack of information con-
veyed by the morphological marking as to whether the relation encoded is subject,
object or indirect object. This property is unlike the Romance clitics. However,
it is directly redolent of Local*Adjunction and can be expressed very straight-
forwardly as a lexical action corresponding to this general computational process.
So, as it turns out, all the principal patterns can be straightforwardly expressed by
analysing the distribution in terms of the two strategies of either building a pair of
linked structures or building an unfixed node within a single structure.
To flesh out these analyses now with lexical specifications of some of the agree-
ment markers, we take first the most weakly specified form, an agreement marker
(not a class 1 marker) which has an identical form whether subject or object or
indirect object. The analysis involves using the Local*Adjunction operator,
the disjunction across these various relations. We take the class 10 marker zi- by
way of illustration which we treat as projecting a metavariable with a presupposi-
tional constraint restricting substituends to those that have the semantic properties
associated with class 10 nominals.10
IF ?T y(t)
THEN IF ?h↓1 i∃x.T y(x)
THEN Abort
(7.25) zi
ELSE make(h↓1∗ ↓0 i); go(h↓1∗ ↓0 i);
put(F o(U10 ), T y(e), ?∃x.F o(x), ?∃x.T n(x))
ELSE Abort
The effect of parsing the prefix zi- is shown in the tree display in (7.27) considered
as part of a parse of the verb complex in (7.26). (We ignore the noun phrases for
the time being.)
(7.26) (sabuni hizi) zi-li-zi-harib-u (ngoma)
10.soaps these 10SUBJ-PAST-10OBJ-destroy-FV 10.drums
‘These soaps destroyed the drums.’
9 As in Japanese, nouns induce a full complex of term structure projecting an argument; and
because this structure is projected directly from the lexicon, the specification of additional restric-
tions induced by some semantic constraint is straightforward. Nouns denoting animate objects,
for example, can be defined as invariably decorating an unfixed or linked node. Inanimate nouns
can be defined as doing so only if they precede the verb, or in the presence of the object marker,
in which case they decorate a linked structure.
10 For more discussion of the content of the class-marking, see section 7.4.
7.2. AGREEMENT MARKING 257
Notice how the construction of the locally unfixed node directly reflects the under-
specification of its semantic role in the resulting structure: this sequence of actions
can apply whenever the pointer is at a type-t-requiring node, as long as the pred-
icate node is not already decorated: in other words, it is obligatorily a prefix to
the verb. But, because all such underspecified relations must be unique within any
partial tree, there is a consequence: no further application of such action will be
possible until the underspecified relation is updated. In the present analysis, we
propose that it is the tense marker which provides the fixed subject node. This
can be seen as a reflex of the verbal origin of tense markers in Swahili: this is most
transparent in comparatively recently grammaticalised tense markers such as the
future tense marker ta- from -taka, ’want’, or perfect me- from -mala, ’finish’; and
also in the auxiliary-like status of Bantu tense-aspect markers, of which the Swahili
tense marking is an atrophied reflex.11 Accordingly, we define the tense marker
as marking a type-t-requiring node with the appropriate tense specification, and
inducing a subject relation in the presence of one such locally unfixed node, see
(7.28). The action for this affix aborts if any fixed structure is already introduced,
guaranteeing also its pre-fixal status.
(7.28) li-
IF T n(a), ?T y(t)
THEN IF h↓i⊤
THEN Abort
ELSE IF h↓1∗ ↓0 i⊤
THEN go(h↓1∗ ↓0 i); put(h↑0 iT n(a), ?T y(e));
go(h↑0 i); put(T ns(P AST ))
ELSE Abort
ELSE Abort
The effect of parsing li- is shown in (7.29) assuming an appropriate substituend for
the metavariable and Thinning of all satisfied requirements (?T y(e), ?∃x.T n(x),
?∃x.F o(x)).
(7.29) Parsing zi-li-:
T n(0), ?T y(t), T ns(P AST ), ♦
Once the tense marker has served to fix the subject relation, the lexical actions of
zi- can apply again to yield another unfixed node (all the condition precludes is
that a fixed predicate relation already be introduced), as in (7.30).12
11 As we shall see in ch.8, there is also reason to analyse auxiliary verbs in English as projecting
a propositional template.
12 This analysis can be extended to the (verb-initial) habitual marker, hu-, and the infinitival
marker ku-. These can be defined as inducing the construction and decoration of a fixed subject
node (which they invariably replace), thereby licensing the presence of the immediately following
258 CHAPTER 7. SWAHILI AGREEMENT AND CONJUNCTION
Whatever the cluster of argument nodes so far constructed, the verb then induces a
full template of structure, as in other languages, the nodes which have already been
constructed by the affixes harmlessly collapsing with them as not distinct. As in
the Romance languages, the subject node projected by the verb is decorated with
a metavariable, but the object node decorated solely with a requirement. In other
words, we analyse Swahili as a subject pro-drop but not object pro-drop language.
(The fact that the subject affix is obligatory as a prefix is a consequence of the
lexical conditioning for the update action of the subject and tense markers jointly,
and not an intrinsic difference between subject and object marking.) The lexical
actions associated with the verb stem -haribu are given in (7.31) and are very similar
for those given for verbs in Italian in chapter 5 (see (5.34) on page 171).13
IF ?T y(t)
THEN make(h↓0 i); go(h↓0 i);
put(T y(e), F o(U), ?∃x.F o(x)); go(h↑0 i);
make(h↓1 i); go(h↓1 i); put(?T y(e → t));
(7.31) -haribu make(h↓1 i); go(h↓1 i);
put(T y(e → (e → t)), F o(Haribu′ ), [↓]⊥);
go(h↑1 i); make(h↓0 i); go(h↓0 i);
put(?T y(e))
ELSE Abort
In continuing the parse of the verbal complex in (7.26), the subject node collapses
with that introduced by the verb and the locally unfixed node then must Merge
with the internal argument node. This is shown in (7.32) where again a suit-
able substituend has been selected to instantiate the metavariable on the unfixed
node.
(7.32) Parsing zi-li-zi-haribu:
T n(0), ?T y(t), T ns(P AST )
?T y(e), ♦ F o(Haribu′ )
object marker as introducing a locally unfixed node. This leaves subjunctives as the only problem-
atic case in which a subject marker may be followed by an object marker, with the subjunctive
marking as a verbal suffix. But with subjunctives only occurring in well-defined contexts it is
arguable that a fixed subject node may be induced by the containing context even in these cases.
Without having considered such data in this book, we leave this problem on one side.
13 We put to one side a characterisation of the status of the final vowel of inflected verbs which
Although for most noun classes, there is no difference in form between subject
and object agreement markers, class 1 markers do differ according to subject or
object function, as do the markers for second person. So class 1 agreement has a-
marking for subject, and m- marking for object marking, but even in this case the
m- marking does not differentiate between direct and indirect object marking. Since
this is a lexical differentiation, it is straightforward to define the second person sub-
ject marker u-, for example, as having exactly the case marking properties familiar
from other languages, an output filter of decorating an immediate dominating type
t node:
IF ?T y(t)
THEN IF ↓⊤
(7.33) u- THEN Abort
ELSE make(h↓1∗ ↓0 i); go(h↓1∗ ↓0 i);
put(F o(UAddressee ), ?h↑0 iT y(t)) ELSE Abort
As with all other case agreement markers, the subject marker induces a locally
unfixed node and the tense-marking updates it. But this time, the agreement
marking itself is sufficient to rule out all other developments, with its output filter of
immediate domination by a node of type t. The 2nd person singular object marker,
ku-, on the other hand, following the tense marker retains its underspecification vis
a vis whether the node it decorates will in the end result serve as an object or as
an indirect object (7.34).
IF ?T y(t)
THEN IF ↓0 ⊤
THEN make(h↓1 i); go(h↓1 i);
(7.34) ku- make(h↓1∗ ↓0 i); go(h↓1∗ ↓0 i);
put(F o(UAddressee ), T y(e), ?∃x.F o(x))
ELSE Abort
ELSE Abort
As this variation between individual agreement markers shows, the framework
allows a range of variation within a language. There is a question of whether these
Swahili agreement markers should include specification of a bottom restriction, but
to retain the neutrality of the marker with respect to subject and object marking,
we do not impose any such restriction. Nonetheless, the agreement marker, if it
precedes the duplicated NP invariably serves as a pronominal device identified in
context so that the effect is to pick out some term in context with which the du-
plicated NP also has to be construed as identifying, suggesting possible motivation
for assuming that a bottom restriction is invariably in place. In the absence of
case-marking on the NPs, unlike Greek, there is no definitive evidence of whether
the specificity effect obtained by using such agreement markers is or is not a con-
sequence of an encoded restriction; but it is notable that this is again a parallelism
with the Romance languages, in the availability for these clitic doubling phenom-
ena of an analysis for the doubled NP as either decorating an independent LINKed
structure and an unfixed node. On this basis also, we expect there to be a con-
siderable degree of variation across related languages. In particular, the reported
greater discrimination between subject and object marking in Chichewa (Bresnan
and Mchombo 1987) is straightforward to express, since we can analyse the sub-
ject marker as having no bottom restriction hence not giving rise to a specificity
effect, the object marker, contrarily, having an associated bottom restriction and
invariably so doing.
There is a further bonus to this style of analysis. From the perspective of
frameworks which make a sharp distinction between grammar-internal agreement
260 CHAPTER 7. SWAHILI AGREEMENT AND CONJUNCTION
the verb will be at the object node awaiting development. We assume the use of
*Adjunction (in its variant forms) and the construction of LINKed structures as
in all other languages, with NPs solely decorating these nodes, the complex of fixed
argument nodes and their corresponding predicate being induced by the clitics and
the verbs. As in the description of other languages, the use of introducing unfixed
nodes and updating them, or introducing a node with a partial specification and
updating that subsequently is completely unproblematic. There is no need for extra
functional projections to reflect the quasi-independent status of these affixes: the
projection of partial tree decorations and their update is sufficient.
most tense forms for all NPs, there is in Swahili, a difference in object agreement
between animate and non-animate denoting NPs. As mentioned above, with NPs
denoting animate objects, object agreement is mandatory (at least in the majority
of contexts), while for non-animates, object agreement is related to specificity:
(7.36) a. ni- li- mw- on-a Juma
1-SUBJ1st.sg PAST 1-OBJ3rd.sg see-FIN Juma
‘I saw him Juma.’
b. *ni- li- on-a Juma
1-SUBJ1st.sg PAST see-FIN Juma
‘I saw Juma.’
c. ni- li- on-a ki-tabu
1-SUBJ1st.sg PAST see-FIN 7-book
‘I saw a/the book.’
d. ni- li- ki- on-a ki-tabu
1-SUBJ1st.sg PAST 7-OBJ see-FIN 7-book
‘I saw it, the book.’
¿From (7.36b), it appears that an animate-denoting NP like Juma, which is a name
for a man, cannot be used to decorate directly the object node provided by the verb:
the pronominal prefix is obligatory. As already sketched, this should be analysed
in parallel with Spanish strong pronouns, decorating unfixed or LINKed nodes.
With these preliminary assumptions in place, we concentrate first on the class
of nouns denoting animate objects, and see how the partial agreement patterns
illustrated above result from the dynamics of our system.
At this point, the pointer could return to the root node; but, given the presence of
the connective na, a LINK relation is constructed between two nodes of type e. The
connective is defined as the reflex in this system of what in categorial systems would
involve typing as T y(X → (X → X)): the connective induces the introduction of
the linked tree that is to provide the modification of the first constructed term.
This is defined in the lexical entry for na in (7.40).
IF T y(X), ∃x.F o(x)
(7.40) na THEN make(hLi); put(?T y(X))
ELSE Abort
Note that this lexical entry for conjunction is in relevant respects identical to the
one used for English and in Chapter 5. The additional condition, that there exist
a full formula value at the node from which the LINK relation is launched, reflects
the fact that the LINK relation from the conjunction is built only from formula-
complete nodes, and not from nodes with metavariables as formulas. This seems
fairly straightforward, and we will make use of this condition for the analysis of
extraposed conjuncts later on. Parsing na leads to the partial tree in (7.41).
14 The information SING(x) might, assuming Link (1983), be taken as shorthand for Atom(x),
or, equivalently SIN G(x) ≡ (|x| = 1). It would clearly be preferable if such an additional
specification were seen as the result of application of a step of inference expressed as some general
inference schema, such as a meaning postulate. However, we have not entered at all into issues of
content for predicative terms decorating the trees in this book, so here we take the information
to be projected directly onto the tree by the lexical actions of the nouns. Nothing turns on this.
264 CHAPTER 7. SWAHILI AGREEMENT AND CONJUNCTION
The information provided by the lexical entry given by Nayla, the next word
in the string, then leads to the decoration of this newly introduced LINKed node,
as normal. The way we have used the LINK relation here differs from the use of
LINK in chapters 3 and 4, and, like that in chapter 5, does not impose any sharing
of information across the LINK relation. This is in principle possible: the LINK
relation is defined as a relation between nodes in two separate partial trees, and as
such is independent from the requirement of shared information. Nevertheless, in
co-ordination, as it turns out, the shared term results from a process of evaluation;
so the substance of LINKed structures in the final completed tree as involving
sharing of terms is preserved. The evaluation step needed is that which creates a
term out of two distinct individual-denoting terms, forming a new term denoting
the group.
We, therefore, introduce a new rule for LINK evaluation in addition to those
introduced in chapter 3 for relative clause construal.15 This version is a process
which takes two terms constructed across a LINK relation, and from them defines
a term out of the join of the two, giving rise to a plural interpretation (as in the
English Tom and an administrator control the timetable).
(7.42) LINK Evaluation (Conjunction):
Notice how in this rule, the predicates P and Q which serve as the restrictors of the
two terms (and which could equivalently be written as λy.P (y) and λz.Q(z)) are
each bound by the quantifier of the two separate epsilon terms in the input, but in
the output these predicates are not distributed over the same individual variable.
What this means is that the application of the rule constructs a term which denotes
15 Although it seems that we are building up an array of LINK evaluation rules that are inde-
pendent of each other, this is not, in fact, the case. All the evaluation rules specify a means of
copying the content provided by a LINK structure onto the matrix tree. The rules all involve
some form of co-ordination of formula values (typically, as here, as conjunction) but they differ
as to which node in the matrix tree at which this co-ordination is to occur: for restrictive relative
construal it is the T y(cn) node that dominates the node decorated by a variable; for non-restrictive
construal, it is the topnode in the propositional tree. The rule in (7.42) copies the information
onto the term from which the LINK relation is projected. In all these cases, it is arguably the
types that are involved that induces the various forms of evaluation and the possibility is open
that LINK evaluation is always semantically co-ordination of formulae, possibly reducible to a
single characterisation, while pragmatic inference over the tree under construction determines the
actual output. See also footnote 31 on page 312.
7.3. AGREEMENT AND CO-ORDINATION: ANIMATES 265
the union of the two sets which the predicates define, a complex term denoting a
group.
(7.42) thus allows the construal of the two epsilon/iota terms as giving rise to
a complex term where the properties of the two terms as input do not distribute
over the variable.16 In the analysis of Haroub na Nayla in (7.37), evaluation of the
two LINKed nodes of type e in provides the formula value:
This does not reduce any further because of the joining of the two properties of
being Haroub and being Nayla, so the term picks out ‘the plural entity consisting
of (the join of) Haroub and Nayla’. Notice that through this process of evaluation
the sharing of formula values across the LINK relation is established, as illustrated
in (7.43) after Completion.
(7.43) Parsing Haroub na Nayla:
T n(0), ?T y(t), ♦
complete characterisation of plural quantification this rule needs to articulate in a fully general
way the relationship between quantification over individuals and quantification over groups.
17 In fact, even the verb root here can be analysed as being complex, as the syllable -ku- is a
dummy morpheme, historically related to an infinitival marker, which is used with, among others,
monosyllabic verbal roots such as -j-, ’come’ (Marten 2002a).
266 CHAPTER 7. SWAHILI AGREEMENT AND CONJUNCTION
We now arrive at a parse state in which there are two terms decorating a single
tree node. This is not precluded, recall, as long as one of these formulae entails
the other (this property holds on the tree after every application of substitution
updates a metavariable). Here, the requisite entailment transparently holds in the
semantics because, with Nayla and Haroub being distinct individuals, the complex
term that conjoins their properties must be taken to pick out a group entity, which
is precisely what the underspecified plural epsilon term does:
Merge is thus licensed in the semantics and we might construct a composite term
in which the plurality is expressed in the representation language:18
F o(ι, z, ((λx.Haroub′ (x) ∧ SIN G(x)) ∧ λy.Nayla′ (y) ∧ SIN G(y))(z) ∧ P LU R(z))
Notice that here the predicates distribute over the same variable z, so that the
property of constituting the set of individuals made up of Haroub and Nayla and
the property of being a plural entity are indeed both predicated of the same (plural)
18 Formal rules for achieving this through rules of Appositive LINK Evaluation and Term
extension of the epsilon calculus, which we cannot possibly take on here. However, plurality is a
quite general problem, not specific to Swahili; and what we have here will suffice for expressing
the variation available in agreement phenomena.
268 CHAPTER 7. SWAHILI AGREEMENT AND CONJUNCTION
with plural subjects as long as the term constructed from that subject has not yet
been built at the time when the verb is introduced. The relevant data for this
second case are repeated below:
(7.47) a. wa- li- kuj-a Haroub na Nayla
2-SUBJ3rd.pl PAST come-FIN Haroub and Nayla
‘Haroub and Nayla came.’
b. a- li- kuj-a Haroub na Nayla
1-SUBJ3rd.sg PAST come-FIN Haroub and Nayla
‘Haroub and Nayla came.’
The pattern here, mixed agreement apart, is remarkably like that of the Romance
languages, particularly Spanish. Subject-inversion structures as in these examples
are more marked than subject-verb order, and often convey a presentational focus
interpretation. In addition, as pointed out above, intuitively, while (7.47a) is an
assertion about both Haroub and Nayla, (7.47b) seems to be more an assertion
about Haroub. On the other hand, in terms of denotational content, strictly, both
sentences assert the same proposition, namely that Haroub and Nayla came. We
are going to show now how the two agreement possibilities, and the slightly different
readings, result from the process of structure building.
We take first the cases where there is matching agreement. These are the inverse
of the preposed matching agreement cases. In these examples, following the Ro-
mance pattern, the verb is parsed first, hence the construction of the propositional
template with its predicate kuj as shown in (7.48).
(7.48) Parsing wa-li-kuja:
T n(0), ?T y(t)
T n(00), T y(e),
T y(e → t)
F o(U), ?∃x.F o(x),
F o(Kuj′ )
F o(ǫ, x, P LU R(x)), ♦
Given the agreement properties of Swahili, the formula value can of course be sub-
stituted from context, in which case no further input would be necessary. However,
in (7.47a) there is further input, the post-posed subject, and prototypically these
examples are used in contexts where the subject cannot be identified in context, for
example at the beginning of texts, as is common for presentational structures like
these. There is no restriction on the type of NP in these structures, indicating that
pronominal subject-agreement markers in Swahili have lost their bottom-restriction,
and post-posed subjects allow introduction of their lexical actions into the parse
tree by means of Late *Adjunction (see Chapter 5), as in (7.49). Notice that it
is the first conjunct of the conjoined subject whose lexical specification decorates
the node introduced by Late *Adjunction, before the conjunction na is scanned.
7.3. AGREEMENT AND CO-ORDINATION: ANIMATES 269
T n(00), T y(e),
T y(e → t)
F o(U), ?∃x.F o(x),
F o(Kuj′ )
F o(ǫ, x, P LU R(x)), ♦
This is a trivial observation; but, at this juncture, there are two options, and these
are not trivial at all. The first is to continue scanning lexical input and to develop
the unfixed node by introducing a LINKed structure and using that to build up
a compound formula. The second option is to Merge the unfixed node with the
subject node, and only after that, to introduce some linked structure to build up
the appropriate compound term with whatever further words are provided. It is
the availability of these two options which explains the different agreement options.
The first option reflects plural agreement: the second option reflects singular agree-
ment. Furthermore, in the second option, there is a stage in the derivation where
the representation of Haroub is the only subject of the assertion, and it is this
(intermediate) stage which accounts for the intuition that (7.47b) is an assertion
about Haroub.
To see these points more clearly, let us continue with the derivation of (7.47a),
with plural agreement. Here, the first option is relevant, that is, the unfixed node is
not yet merged, and the lexical information from na and Nayla induces the building
of a linked structure, whose top node the actions of Nayla decorate, exactly as
we saw in the example with preposed subject. The step that then follows is the
evaluation of the linked structure, to form a compound group-denoting term at
the unfixed node, which combines two singular terms as a formula decoration for
the unfixed node, again as we saw before. At this point with the LINK structure
complete and evaluated, the unfixed node can (indeed must) validly Merge with
the subject node as shown in (7.50).
(7.50) Parsing wa-li-kuja Haroub na Nayla:
T n(0), ?T y(t)
T n(00), T y(e),
T y(e → t)
F o(U), ?∃x.F o(x),
F o(Kuj′ )
F o(ǫ, x, P LU R(x)), ♦
T n(00), T y(e),
T y(e → t)
F o(U), ?∃x.F o(x),
F o(Kuj′ )
F o(ǫ, x, SIN G(x)), ♦
The two quantified terms decorating the subject node as a result of this merge step
collapse trivially to:
Since Haroub is a singular entity, the update of the metavariable F o(U) by the term
denoting Haroub is unproblematic, with the result that at this stage, the subject
node is decorated with a term denoting a singular entity as subject, signally match-
ing the singular specification of the verb’s pronominal-subject-agreement marking.
It is now only after this step that the conjunction, and the second conjunct are
parsed, introducing and decorating a linked structure to give the tree in (7.52).
(7.52) Parsing a-li-kuja Haroub:
T n(0), ?T y(t)
T n(00), T y(e),
T y(e → t)
F o(ǫ, x, SIN G(x)),
F o(Kuj′ )
F o(ι, x, Haroub′ (x) ∧ SIN G(x))
¿From this step on, the sequence of actions is all familiar. By LINK evalu-
ation (conjunction), this can be developed into a tree whose subject node is
decorated by:
And so with this compound formula as subject of the verb kuja we again derive the
same resulting formula:20
F o(Kuj′ (ι, x((λy.Haroub′ (y) ∧ SIN G(y)) ∧ λz.Nayla′ (z) ∧ SIN G(z))(x)))
The final trees of all examples discussed so far are effectively identical, reflect-
ing correctly the fact that the denotational content expressed by these examples
is the same, namely that Haroub and Nayla came. The bonus is that the specific
pragmatic interpretations associated with the different sequences result correctly,
not from differences in the final tree, but from the differences in the sequence of
building up the structure. The availability of a presentational focus construal with
postposed subjects results from the presumed unavailability of a contextual inter-
pretation of the metavariable associated and the late construction of the formula
value only once the structural template is provided by the verb. The intuition about
the aboutness of verb-subject structures with singular agreement, namely that the
assertion is about the individual itemised by the first conjunct, and not about the
group of individuals picked out by the whole conjunction, is directly reflected in
the availability of an intermediate tree in which only the information from the first
conjunction is projected at the subject node.
A final addition to the analysis so far in this context is to assume that the LINK
structure introduced by na can be constructed from a position in the string not
adjacent to the head:21
(7.53) Haroub a- li- kuj-a na Nayla
Haroub 1-SUBJ3rd.sg PAST come-FIN and Nayla
‘Haroub came with Nayla.’
In order to explain examples like these, we assume that the pointer may move back
to the subject node after the verb has been scanned and that a LINK relation from
the subject may be projected so as to accommodate the lexical actions defined by
Nayla. This would lead to an analysis where (7.53) results in the same final tree
as all examples discussed so far, but, again, crucially differing in the way these
final trees are derived. For the derivation of (7.53), it is necessary that the unfixed
node decorated with the information from Haroub merges with the fixed subject
node projected from the verb before the LINK relation is built. This is because,
firstly, according to our lexical definition of na, the LINK relation can only be built
from a node with a fixed formula value, so that the LINK construction process
cannot carry over the metavariable decoration provided by the verb. Secondly,
the pointer can only move to the fixed subject node, but not to the unfixed node,
so the LINK construction process cannot be built from the unfixed node. The
result is that, in order to build a LINK relation, the subject node has to have
been decorated with the formula value from Haroub. In consequence, end-placed
conjunctions with na can only occur with singular agreement, the plural agreement
marking is precluded:22
(7.54) *Haroub wa- li- kuj-a na Nayla
Haroub 2-Subj3rd.pl PAST come-FIN and Nayla
Intd.: ‘Haroub came with Nayla.’
20 This is a formula differing only from earlier formulations in lacking an explicit predicate of
I like.
22 The return of the pointer to a completed subject node assumed here is used in chapter 8 to
The examples discussed so far show how the interaction between agreement
and word-order receives a natural explanation from a dynamic perspective. What
we have developed are analyses for sets of well-formed strings, all with the same
semantic interpretation, whose syntactic explanation, in particular the distribution
of agreement morphology, as well as their discourse-pragmatic status have been
shown to be crucially dependent on the transitions from initial to final tree.
markers must, like their subject counterparts, lack any bottom restriction.
7.3. AGREEMENT AND CO-ORDINATION: ANIMATES 273
T y(e),
h↑∗ iT n(0), ?∃x.T n(x), F o(U),
?T y(e → t)
F o(Asha′ ), T y(e) F o(ǫ, x, SIN G(x),
?∃x.F o(x)
hL−1 ih↑∗ iT n(010), T y(e), (ι, z, Nayla′ (z) ∧ SIN G(z)), T y(e)
With both conjuncts parsed, the information from the LINKed structure can, at the
next step, be evaluated as part of a compound term compiled on the unfixed node
in order to provide the appropriate plural entity that can be consistently merged
with the decorations on the object node. Finally, the unfixed node decorated by
Asha is merged with the subject node and the tree can be completed coherently,
leaving no outstanding requirements.
In contrast, (7.57) shows a snapshot of the parse of the string in (7.55b) with
singular object marker. Here, as before with subject post-posing, the unfixed node
decorated by Haroub merges with the object node prior to parsing of the conjunc-
tion.
(7.57) Parsing Asha a-li-mw-ona Haroub na Nayla:
T n(0), ?T y(t)
T y(e), F o(U),
h↑∗ iT n(0), ?∃x.T n(x),
F o(ǫ, x, SIN G(x), ?T y(e → t)
F o(Asha′ ), T y(e)
?∃x.F o(x)
Different object agreement, like different subject agreement, thus results in the pro-
jection of identical final trees, with differences in the construction process, namely
whether Merge occurs before or after the LINK structure induced from na is built.
A further parallelism between subject and object agreement can be seen with
fronted objects. Objects in Swahili can freely be fronted, as the following examples
show. However, for fronted conjoined objects, verbal agreement has to be plural:
274 CHAPTER 7. SWAHILI AGREEMENT AND CONJUNCTION
It is clear from these examples that the relevant parameter determining agreement
possibilities is word-order, and that object and subject agreement behave exactly
alike in this respect: If the conjoined NP precedes the verb, only plural agreement
is possible, while if the conjoined NP follows the verb either plural or singular
agreement is possible.
The relevant DS analysis follows from the analysis developed so far, again on
the assumption that the fronted objects are projected unto unfixed nodes at the
outset of the parse (though here, as in other cases, a LINK analysis would, equally,
be appropriate).
(7.60) Parsing Haroub na Nayla a-li-wa-ona :
T n(0), ?T y(t)
hL−1 ih↑∗ iT n(010), T y(e), (ι, z, Nayla′ (z) ∧ SIN G(z)), T y(e)
The tree in (7.60) shows the unfixed object, which has been constructed first in the
parse sequence, together with a LINKed structure from which a compound term
has been compiled as formula value for the unfixed node. Such a structure trans-
parently allows the unfixed node to Merge with the plural-marked object node.
Should, however, the verb be marked with a singular object pronominal agreement
marker, no such Merge could take place, the compound term being already com-
piled by that stage of the interpretation process. This result is a straightforward
extension of the analysis to the object-marking data, and does not involve any
further stipulation, confirming the approach.
relation is in some sense often less structural than with the animate NP cases we
have discussed so far, and is better characterised as an anaphoric relation satis-
fied across a LINK relation. This can be seen at least partly as a function of the
difference between animate and non-animate agreement markers. Animate agree-
ment markers encode singularity and plurality as a restriction on the substitution
of the metavariable they project, and this is provided by other terms presented
in the same structure, hence indicating a construction process involving a step of
*Adjunction to introduce an unfixed node with Merge with the node decorated
by the agreement marker. Non-animate agreement markers, on the other hand,
encode a restriction with no structural reflex, which is merely that the substituend
provided by the antecedent can be construed as fitting into the same set as that
indicated by the class of the agreement marker. We will thus use examples with
non-animate NPs in the following section to illustrate agreement possibilities across
a LINK relation.
conjoined NP without the need to absorb the created term and its associated struc-
ture into the propositional structure. Since the category of number is not relevant
in these cases, we assume, in line with Schadeberg (2001), that pronominal pre-
fixes with a noun class indicating a non-animacy restriction project a metavariable
with a presupposition, not relating to number, but to the set membership of this
class. Thus, in (7.61), the question in constructing a representation for the string is
whether the concept associated with the formula value constructed from sabuni and
maji, i.e. an assortment of soap and water, provides an appropriate substituend for
the metavariable projected as the formula value of vi- which we analyse as F o(U8 ).
The answer to this is yes, as the grammaticality of (7.61) shows. The choice of
agreement morpheme for conjunctions of non-animates in Swahili is then to some
extent pragmatically determined, reflecting contingent properties of the objects de-
noted. We would expect speakers to vary in their acceptance of one or more of
those pronominal forms (i.e. those found in the examples above, with vi-, zi-, or
u-, and possibly others) which plausibly in a given context can serve as anaphoric
element for a given antecedent. A schematic tree display showing how (7.61) is
parsed is shown in (7.63), using rules for constructing topic structures in chapter 4
(see section 4.2.1).
(7.63) Parsing (7.61):
hLiT n(0), T y(e), T n(0), ?T y(t),
F o(ǫ, x, (λy.Sabuni′ (z) ∧ λz.Maji′ (y))(x)) ?hDiF o(ǫ, x, (λy.Sabuni′ (y) ∧ λz.Maji′ (z))(x)))
T y(e), F o(U8 ), ♦
⇑ T y(e → t),
F o(ǫ, x, (λy.Sabuni′ (z) F o(Saidi′ (ι, y, Juma′ (y)))
∧λz.Maji′ (y))(x)))
analysed as agreement with the last conjunct of a pre-verbal conjoined NP are in fact instances of
anaphoric agreement across a LINK relation to a compound term, of which the first conjunct is
used to decorate a structure LINKed to the node decorated by the second conjunct (see Ashton
(1947: 311), Schadeberg (1992: 22), Marten (2000)):
wema huu na hisani hii ha-i-nenekan-i
(i)
11-goodness 11-this and 9-kindness 9-this NEG-9SUBJ-sayable-NEG
‘This goodness and this kindness cannot be expressed (in words) ...’
It is worth noting that examples like this are only ever reported with non-animate NPs.
7.4. AGREEMENT ACROSS LINKED STRUCTURES 277
In this example, the object clitic shows class 9 agreement, and the object NP is
a conjunction of two class 9 nouns. The fact that both conjuncts are of the same
class here is accidental, and we assume that the object clitic is construed as picking
out the term provided by the first conjunct. The analysis of this type of example
parallels our analysis for the animate cases above, except that we assume that
the post-verbal object serves to decorate a node at the top of a linked structure.
The term constructed indeed functions as a background topic here, presumed to
have been independently identified in the context in some form (we assume that
the subject is identified from context as Asha).26 Such LINKed structures can
be constructed in two ways. The first is via the construction of a LINK relation
from the top of the propositional structure, analogous to She talks too fast, that
woman, as we saw in chapter 5. Upon this construal, the metavariable provided by
the agreement marker must be already identified as some term identified from the
context to allow compilation of the full propositional structure. Here, the context
provides the group denoting term (see footnote 26):
26 Indeed the picture and the frame are mentioned in the previous section of the novel, from
which this example is taken, so the analysis accurately matches the rhetorical effect intended by
the writer.
278 CHAPTER 7. SWAHILI AGREEMENT AND CONJUNCTION
F o(U9 )
⇑ F o(Ti′ )
F o(ǫ, x, (Fremu′ ∧ Picha′ )(x))
T n(0), ?T y(t)
The examples show a conjunction of a class 7 and a class 3 noun, and the differ-
ent verbal agreement markers possible with it. The first two examples show default
agreement with class 8 pfi-, similar to Swahili class 8 agreement with vi-. However,
the second two examples show instances of ’pragmatic’ agreement not found in
Swahili. In (7.72), the verb has a class 7 agreement marker, and in (7.73) a class 3
agreement marker corresponding to the first and the second conjunct respectively.
The point here is that the conjunct which triggers verbal agreement provides in-
formation which is pragmatically prominent, and emphasised (represented in the
English translation by the use of small capitals). Pragmatic agreement is sensi-
tive to word-order as in the Swahili cases. The following examples show pragmatic
agreement with one conjunct of a fronted object, but the unavailability of agreement
with the second conjunct if the object follows the verb:
(7.74) ichi-ti na ghumu-biki wanzehe wa- chi- ghul-a
7-chair and 3-tree elders 2-SUBJ 7-OBJ buy-FIN
‘A chair and a tree the elders bought.’
[7 + 3 ≫ 7]
(7.75) ichi-ti na ghumu-biki wanzehe wa- u- ghul-a
7-chair and 3-tree elders 2-SUBJ 3-OBJ buy-FIN
‘A chair and a tree the elders bought.’
[7 + 3 ≫ 3]
(7.76) *wanzehe wa- gha- ghul-a li-banzi na ma-bwe
elders 2-SUBJ 6-OBJ buy-FIN 5-wood and 6-stone
Intd.: ‘The elders bought a wooden board and stones.’
[∗6 ≪ 5 + 6]
While we do not develop a full analysis of the Luguru facts here, they do not seem
particularly intransigent. Given that all processes are optional, the unfixed node
decorated by the subject may be left unmerged and the LINK evaluation process
in setting out the subject also left unevaluated, both forms of underspecification
being delayed until after the verb is parsed. This would allow the match between
agreement markings between the first provided term and the pronominal agreement
marking. Alternatively, given the pragmatic nature of antecedent selection, the
choice of antecedent might be made from the smallest possible context, that being
deemed to pick out the set of properties most recently added to the context, that
of the second term. On either strategy, the way in which interpretation would be
being built is analogous to the two-stage construal of expletives, with an initial
partial specification and a later update. Such a delay is not available to the parsing
of the object since pointer return to the object node, given that the sequence of
object NPs is in any case after the verb, would be precluded. Of course, any such
account would need rounding out in detail, but it would seem very much within
the general spirit of the explanation, and furthermore an appropriate reflex of a
phenomenon undergoing change.28
28 Analogously, in Spanish, Rio-Platense Spanish object clitics are shifting into having the free-
dom of the dative expletives, and in so doing, losing the bottom restriction, which, in standard
Spanish, distinguishes the two forms of clitic.
7.5. SUMMARY 281
7.5 Summary
Our main topic in this chapter was the analysis of agreement with conjoined NPs
in Swahili, and how this relation can be analysed from a dynamic perspective. This
also provided a welcome opportunity to look more closely at two of the defining
features of Bantu languages, namely the systems of verbal agreement and nominal
classification in the noun class system. We have seen that agreement in Swahili
shares a number of characteristics with the pronominal clitic system of Romance
languages, and we have proposed an analysis of Swahili subject and object agree-
ment markers along the lines sketched earlier for Spanish. The difference between
the systems of these two language groups is, from this perspective, mainly one of
relative freedom of positioning. The Romance clitic system is slightly more free,
with cross-linguistic variation in clitic ordering, while in the Bantu system the
pronominal markers are in a fixed position with respect to the verb stem and tense
morphemes. However, both systems share essential pronominal features which allow
for both cataphoric and anaphoric uses of the pronominal elements. Our analysis of
the noun class system was based on the observation, made by several researchers,
that there is a considerable semantic core to nominal classification in Bantu, and
that speakers can exploit this semantic information in the construal of pronom-
inal, underspecified elements. This analysis explains the agreement facts solely
with recourse to the dynamics in which semantic structure unfolds, and so doing
explaining both the denotational equivalent, and yet the pragmatic differences of
structures with different agreement properties. It goes without saying that more
empirical and theoretical work in this area is necessary, but we take the dynamics
of the parsing-oriented system to have illuminated a complex morphological area.
282 CHAPTER 7. SWAHILI AGREEMENT AND CONJUNCTION
Chapter 8
Copula Constructions in
English
In the last two chapters, we have been exploiting the ability of Dynamic Syntax
to allow steps of inference to interact with the syntactic process of establishing
representations of content to provide explanations of problematic linguistic data.
The use of various kinds of underspecification permit these explanations through
processes of update, either derived from computational actions (*Adjunction and
Merge) or through pragmatic reasoning (abduction and substitution). In this
chapter, we push these ideas a bit further, and develop the analysis of expletives put
forward in chapter 5 and developed for Swahili in chapter 7 into a (partial) account
of the copula in English that utilises semantic underspecification and pragmatic
enrichment. In particular, we present an account of the predicative construction,
and show how an analysis of the ‘double expletive’ construction involving forms
of the string there be can be given. We end the chapter with a brief account of
equative and specificational copular clauses which exploits the properties of the
LINK mechanism. In all these constructions, we argue, the interpretation of the
string crucially depends on the properties of the expressions that appear with the
copula, the context in which the string is uttered and the parsing process itself. This
will require us to look a bit more closely at the syntax/pragmatics interface and the
discussion in this chapter is thus inevitably more speculative than previously. We
believe, however, that it provides enough substance to give the reader an idea of
the explanatory potential of Dynamic Syntax for constructions whose interpretation
varies with local and non-local context.
283
284 CHAPTER 8. COPULA CONSTRUCTIONS IN ENGLISH
constructions (8.1g):1
(8.1) a. Mary is the dancer.
b. John is silly.
c. There is a riot on Princes Street.
d. It’s me.
e. It is Mary who is the dancer.
f. What we want is a good review.
g. Neuroses just are (they don’t need a cause)
The variability in the interpretation of be in (8.1) is further compounded by the
subtle differences in meaning exhibited by very similar sentences. For example,
copular clauses involving a definite noun phrase give rise to slightly different in-
terpretations according to whether the definite NP precedes or follows the copula.
Equative clauses, as in (8.2a), involve a post-copular definite which appears to be
fully referential, while specificational clauses, as in (8.2b) involve an initial definite
which appears to provide a description of an unknown entity, rather than to pick
out some specific object.2
(8.2) a. John is the culprit.
b. The culprit is John.
Whether a string consisting of two noun phrases and a form of the copula is in-
terpreted as predicative or equative thus depends largely on the definiteness of the
post-copular term: an equative reading is only possible if this is definite.3 Further-
more, if both noun phrases are definite, then either an equative or a specificational
reading may result, depending on whether the post-copular term may (or must)
be interpreted as fully referential in context and whether the initial term need not
be. A sentence such as that in (8.3) where both noun phrases contain the definite
article may be interpreted as equative or specificational according to the context of
utterance.
(8.3) The culprit is the teacher.
Such variation in interpretation according to the definiteness of a noun phrase
is found also in constructions of the copula with the expletive pronoun there. So,
for example, when the post-copular noun phrase (the associate) has a weak (or
intersective, Keenan and Stavi (1986), Keenan (2001)) determiner, this gives rise
to the ‘standard’ existential construction illustrated in (8.4a). With post-copular
definites, however, we have presentational or locative readings as in (8.4b)-(8.4c),
while numerals may give rise to existential, presentational or locative interpretations
as in (8.4d).
8.1.1 There be
There have, of course, been a number of interesting and elegant attempts to deal
with this problem semantically (see in particular Heycock and Kroch 1999). How-
ever, the problem of context dependence reasserts itself, more strongly, with respect
to constructions involving there be. This construction gives rise to a range of dif-
ferent interpretations, depending on the properties of the post-copular noun phrase
(often called the ‘associate’) and the rest of the clause (often referred to as the
‘coda’ or the ‘extension’ in Huddleston and Pullum 2002)). In the examples in
(8.6) below, we have existential, presentational and locative readings associated
with minimally different syntactic contexts.
(8.6) a. There’s a chemist shop on Princes Street.
b. There is the chemist shop on Princes Street that you wanted to go to.
c. There’s that chemist shop again.
The existential/presentational distinction seems to correlate with the definite-
ness of the post-copular noun phrase. Clauses with definite associates are thus
4 Partee, in fact, allows a variable type with the arguments of the expression appearing in either
Again this construction does not seem to involve different interpretations for there
be, as illustrated in (8.12) where a definite or an indefinite may be conjoined with
a universal to give possible mixed readings.
(8.12) There’s the Chancellor/a lord of the realm and every student of mine
coming to my inaugural.
If it is true that the phrase there be itself does not have different interpretations
directly, then the interpretation of the various constructions involving this string
must result from inference over whatever single meaning it has and the meanings
of its associates and codas. Analyses of the existential construction typically con-
centrate on properties of the associate and mostly on the existential reading. As
already noted, in one of the most influential semantic accounts, Keenan (1987)
identifies associates as needing to be intersective DPs in order to give rise to an
existential reading. Musan (1995), on the other hand, analyses the construction
in terms of a temporal variable indicating stage level predication, while McNally
(1998) interprets the construction in terms of the properties of non-particulars. In
a more pragmatic account, Zucchi (1995) argues that the existential reading occurs
just in case the associate presupposes neither the existence nor the non-existence
of some entity. Ward and Birner (1995), concentrating on definite associates, again
adopt a pragmatic approach to the felicity of such constructions, attributing ac-
ceptability to the possibility of construing the post-copular definite as providing
‘hearer-new’ information.
We do not go into a discussion of these various accounts, but it is notable that
in none of them is there an analysis of the string there be. The following statement
by Louise McNally sums up the apparent attitude of most researchers in this area
(although very few even acknowledge this lacuna in their discussion):7
for its interpretation. With variability in acceptability also being dependent on the
context of utterance, we have strong reasons for preferring an analysis of be that is
inferential rather than semantic, with interpretation determined through pragmatic
negotiation between the associates of the copula and non-local context. This is the
approach that we take in this chapter: the interpretation of be is context-dependent
and not ambiguous.
8.1.2 Intransitive Be
One of the problems of trying to treat the copula as non-homonymous, however,
is the fact that it appears to be able to take complements of various numbers and
types. So, for example, the constructions illustrated in (8.1) imply a number of
different types that could be ascribed to the copula amongst which are e → (e → t)
(equative), (e → t) → (e → t) (predicative), and e → t (existential focus). This
flexibility of complement type is not matched by other auxiliary verbs, including
have, compare (8.13) with (8.14) and (8.15).
a friend of yours.
the teacher.
in the garden.
playing football.
(8.13)
A: Kim is disliked by Hannibal.
happy.
misunderstood.
*play cricket.
*a friend of yours.
*the teacher.
in the garden (ellipsis).
*playing football.
(8.14)
A: Kim can *disliked by Hannibal.
*happy.
*misunderstood.
play cricket.
a friend of yours.
the teacher.
in the garden (ellipsis).
*playing football.
(8.15)
A: Kim has *disliked by Hannibal.
*happy.
misunderstood.
*play cricket.
This variability in apparent complement type presents quite a serious problem in
trying to establish the syntactic properties of the copula, leading in frameworks like
GPSG (and HPSG) to the postulation of syntactic homonymy for be (Gazdar et
al. 1982, Warner 1993). If be is semantically non-homonymous, however, syntactic
homonymy should also be excluded. Of course, we could take the view that the type
of the copula is simply underspecified and assign it a type T y(U → t) where the
8.2. REPRESENTING THE CONTENT OF THE COPULA 289
metavariable ranges not only over different types,8 but also over strings of types.9
However, such a move does not really solve the problem, as it gets us back to
the quandary of how different complement types lead to different interpretations.
Instead, we take a more restricted view of the adicity of be than is normally taken:
that it projects one and only one argument. In other words, the copula is a verb
that has a subject but no complements.
This position is supported by the data in (8.16) below, which reveal a further
difference between the copula and the modals. The former allows construal of ex-
istence in a null context, while the latter, such as may and can, do not license
interpretations where the general modality, such as possibility and ability, are as-
cribed to the subject. Without a complement VP, modals can only be interpreted
elliptically, whereas, as we have already seen, be can give rise to a non-elliptical
interpretation of existence in intransitive contexts.
(8.16) a. Neuroses just are. (= Neuroses exist)
b. Neuroses just may. (6= Neuroses are possible)
c. The students just can. (6= The students are able)
These differences from the auxiliaries, the variability in apparent complement
type and non-elliptical interpretation in intransitive contexts, all lead to the same
conclusion: that be uniformly projects a one-place predicate of type e → t.10 11
and property subjects. We do not explore such constructions here, but they do not undermine the
essence of the current analysis. The important point here is that be does not project an internal
argument, whatever the properties of its subject argument may be.
290 CHAPTER 8. COPULA CONSTRUCTIONS IN ENGLISH
likely to be too strong for a full analysis of the auxiliary system in English, as it would prevent
multiple occurrences of the verb be itself, thus rendering ungrammatical strings like John has been
singing. However, we leave the necessary refinements for future research.
8.2. REPRESENTING THE CONTENT OF THE COPULA 291
IF ?T y(t)
THEN IF h↓1 iT y(e → t)
THEN Abort
ELSE put(T ns(P RES)); make(h↓1 i); go(h↓1 i);
(8.19) is put(T y(e → t), F o(BE), ?∃x.F o(x));
go(h↑1 i); make(h↓0 i); go(h↓0 i);
put(?T y(e))
ELSE Abort
ELSE Abort
T y(e → t),
?T y(e), ♦
F o(BE), ?∃x.F o(x)
From the tree in (8.20), the parse of a following subject noun phrase can proceed
straightforwardly. But what of non-initial is? Again, this is straightforward to
account for, in what should now be a familiar fashion. Should Introduction
and Prediction create subject and predicate nodes, an initial subject NP can be
parsed and after Completion the pointer will be at the open propositional node,
as shown in the first tree in (8.21). This allows the parse of the copula to give
the second tree in (8.21). The fact that the pointer is again at the subject node is
not a problem, as Completion can move the pointer back again to the topnode
(although not necessarily immediately as we shall see below).
(8.21) Parsing Kim is :
?T y(t), ♦ 7→ ?T y(t)
required, that shown in (8.24), where the predicate satisfies the presuppositional
constraint.13
(8.23) Parsing Mary was:
?T y(t)
in DS.
8.2. REPRESENTING THE CONTENT OF THE COPULA 293
Making this substitution gives rise to the output formula in (8.28a) which, by the
established equivalence in the epsilon calculus shown in (8.28b), gives rise to the
existential statement in (8.28c).
(8.28) a. F o(Neurosis′ (ǫ, x, Neurosis′ (x) ∧ P LU R(x)))
b. P (ǫ, x, P (x)) ↔ ∃x.P (x)
c. ∃x.Neurosis′ (x) ∧ P LU R(x)
What is interesting here is that the existential interpretation is provided directly
without any reference to Scope Evaluation. Indeed, the output in (8.28a) is
exactly what such evaluation would derive, given the scope statement Scope(Si <
x). We take this to give rise to further pragmatic effects, buttressing the existential
interpretation.
While more needs to be said about the existential focus construction, especially
with respect to the possibility of quantified subjects and the interaction with tense,
it should be clear from this discussion that the treatment of be as projecting seman-
tically underspecified content that may be pragmatically enriched provides a basis
of a unified account of both ellipsis in copula clauses and existential focus readings,
an unexpected result.
within the tree and the requirement that BE be replaced by some contentful con-
cept. Note that Merge is licensed only if the unfixed predicate is of the right sort,
i.e. not something that denotes an eventuality. This correctly excludes *John is
run, etc.16 This process is illustrated in (8.30), from the parse of the initial word
John, through the parsing of the copula, the unfolding of the unfixed node and the
parse of the predicate to give the result in the final tree.
(8.30) Parsing John is happy.:
a. Parsing John:
T n(0), ?T y(t)
T y(e),
T n(01), ?T y(e → t)
F o(ι, x, John′ (x)), ♦
b. Parsing John is:
T n(0), ?T y(t), T ns(P RES)
Prepositional predicates may be treated in the same way, under the (natural)
assumption that such expressions may be of predicate type. So, a sentence like that
in (8.31a) gets the formula value in (8.31b).
(8.31) a. Robert was on a train.
b. λx.On′ (x, (ǫ, y, Train′ (y)))(ι, y, Robert′ (y)).
For common noun predicates, in some languages such as Classical (and Modern)
Greek, the nominal predicate may be treated directly as a predicate just like an
adjective or a prepositional phrase and be analysed accordingly, the expression in
(8.32a) giving rise to the formula value in (8.32b) through Merge of the nominal
predicate with the metavariable projected by the copula.17
(8.32) a. ho sōkratēs ēn philosophos.
the.nom.sg Socrates.nom.sg be.3.sg.impf philosopher.nom.sg
‘Socrates was a philosopher.’
b. Philosoph′ (ι, y, Socrates′ (y)).
8.2.2 Do
Before further exploring other copular constructions, we pause here to look at the
other very underspecified auxiliary verb in English, do. The behaviour of this verb
is very similar to that of the copula, except that it associates with predicates that
denote true eventualities, i.e. those expressed by verbs in English. So we have the
pattern shown in (8.33).18
(8.33) a. Does Mary like beans?
b. Mary does.
c. Mary does like beans.
d. Mary likes beans, Mary does.
e. % Mary does happy.
We, therefore, treat do as projecting an underspecified predicate just like the copula,
only this time the metavariable is constrained only to be substituted by eventuality-
denoting predicates. As with be, we write this metavariable in mnemonic form:
F o(DO) =def F o(UEventuality )
Since do shows the NICE properties just like be, its associated lexical actions con-
struct a propositional template, as shown in the set of actions shown in (8.34).19
IF ?T y(t)
THEN put(T ns(P AST )); make(h↓1 i); go(h↓1 i);
put(T y(e → t), F o(DO), ?∃x.F o(x));
(8.34) did
go(h↑1 i); make(h↓0 i); go(h↓0 i);
put(?T y(e))
ELSE Abort
17 We discuss how nominal predicates in English may be analysed in Section 8.5 below.
18 Note that (8.33e) can only be interpreted as something like ‘Mary is engaged in actions that
can be construed as deriving from her being happy’, not that she is happy.
19 Unlike the copula, do might be best treated as ambiguous between a main verb and an
The analysis of an emphatic sentence like Mary did eat some beans proceeds in
exactly the same way as an analysis of the predicative copular construction shown
above, with an unfixed node providing the value of the eventuality metavariable.20
There is, of course, more to be said about do and we say a bit more in chapter 9 with
respect to ellipsis. But we hope to have shown how auxiliaries can be incorporated
into DS without undue problems.
20 We assume here that parsing finite forms of verbs abort if a local tense specification has
already been constructed, thus excluding *Mary did ate the beans, at least in standard British
and American English.
21 We assume here for convenience that once parsed, the genitive constitutes an argument of the
We thus get an interpretation in which John often meets Bill at the clubhouse
(when John is at the clubhouse).22 Notice here that we are now taking annotations
to metavariables more seriously that we have done so far.
What of the expletive uses of there? The most common assumption is that
it has become a full expletive without any remaining trace of its locative usage.
However, we find the assumption that there is fully bleached of all meaning to be
incompatible with an analysis that tries to account for the meanings of different
there be constructions. In particular, such a hypothesis seems to provide no basis
for an account of presentational uses. Consider again example (8.4c), repeated
below.
(8.4) c. There is the student that you wanted to see in the corridor.
This sentence might be used locatively to tell the hearer that some specified student
is here (in the corridor) or ‘presentationally’ to bring the situation as a whole to the
hearer’s attention, perhaps reminding her that her afternoon appointments are not
completed yet. Interestingly enough, the simple copula clause without there (the
student you wanted to see is in the corridor ) can be used to express the locative
reading but not the presentational one. The former reading follows from the explicit
locative, but the latter has to do with something more abstract, as we show below.
We therefore maintain that some remnant of the locative presupposition does
remain with the expletive, but that the projected metavariable satisfies not the place
of the locative relation, but the thing: ULOC(U,PLACE) .23 In other words, part of
22 We do not provide a full analysis of this example, as the discussion would take us too far from
the current topic, nor do we address the question of the variability in type associated with PPs.
23 Diachronically, this could arise from a locative inversion construction being interpreted as
providing a subject predicate relation directly and the initial locative being interpreted as entity-
298 CHAPTER 8. COPULA CONSTRUCTIONS IN ENGLISH
denoting like other type e subjects. This approach also provides a means of incorporating the
Perspective Structure in existential constructions of Borschev and Partee (1998), Partee and
Borschev (2002). But we leave this to one side.
24 Note that the pointer remains on the decorated subject node after parsing, allowing substi-
tution by Merge or through the pragmatic process of substitution. So we allow A cat, there is
in the garden, inter alia.
8.3. TOWARDS AN ACCOUNT OF ‘THERE BE’ 299
This allows the parse of the post-copular indefinite noun phrase, a riot, which
merges with the subject node to provide the content of the metavariable as illus-
trated in (8.41).
(8.41) Parsing There’s a riot :
?T y(t)
At this point, the subject node is complete and the pointer moves to the predi-
cate node which provides a means of analysing the coda, on Princes Street, as a
straightforward prepositional predicate phrase. Just as with normal predicative
constructions, this is achieved through an application of Late*Adjunction and
Merge, as shown in (8.42).25
(8.42) Parsing There’s a riot on Princes Street :
T n(0), ?T y(t)
However, the process by which such a Substitution of the metavariable can take
place is somewhat mysterious under the assumptions we have been making through-
out this book, since the process can only take place when the pointer is at the node
decorated by a metavariable. This constraint is necessary to ensure that Substi-
tution is not randomly applied so that, for example, construing it with an apple
in (8.43) is excluded.
(8.43) ??Every student who brought it to school gave Mary an apple.
We have not elaborated on the representation of the substitutional presuppo-
sitions associated with definites and lexical expressions such as proforms so far in
this book. However, the most natural way to treat these within DS is to take the
presupposition as a LINKed structure. This would mean the parsing there’s should
more properly give rise to the tree display in (8.44) rather than that in (8.39).
(8.44) Parsing There’s with LINK:
?T y(t)
The value of the subject metavariable, U, in the LINKed structure need not be
associated with a formula requirement, since it is a copy of the formula value on
the head node of the LINK structure whose value may therefore be identified at a
later point in the parse process. However, that on the internal argument node must
be satisfied before Elimination can apply, otherwise no substituend will ever be
forthcoming. What could possibly substitute at this point? Of course, the context
could provide an appropriate substituend (a place), but there is a more accessible
potential substituend given within the domain of the matrix proposition. This is
provided by the Axiom and is the index of evaluation, Si which we introduced in
chapter 3. In keeping with the general Relevance-theoretic assumptions of Dynamic
Syntax, we adopt the perspective on pragmatic processes such as substitution
whereby there is a tradeoff between processing cost and information gained, as
we have seen elsewhere in this book. Since Optimal Relevance is defined to be
determined by a trade-off between cognitive effort and informativeness (the more
effort required to access an interpretation the more informative it should be), a
8.3. TOWARDS AN ACCOUNT OF ‘THERE BE’ 301
hearer will be constrained to take as substituend the most accessible formula that is
likely to yield significant inferential effects. The pragmatic process of substitution
occurs within the construction of a propositional representation, however, and so
will tend to prefer substituends which are provided by the immediate discourse
because the domain over which other inferences are to be carried out may not yet
be complete. So we assume that, in order to compile the LINK structure and return
to parsing the rest of the string, the index of evaluation is indeed chosen as the
appropriate substituend. This is a valid move, given that the index of evaluation,
picking out some world-time-place, following Montague (1974), Lewis (1970) etc.,
can be construed as a place for the purposes of the locative predication.
The interpretation of the structure in (8.42), with Si duly substituted for V is
then straightforward. In chapter 3, a rule of evaluation for non-restrictive relative
clauses is provided in which the content of a LINKed tree projecting the content of
such a clause (such as who I like) is ‘cashed out’ at the topnode as a proposition
conjoined to the proposition expressed by the matrix clause. This rule of LINK
Evaluation is given in (3.13) and repeated below:
(3.13) LINK Evaluation 1 (Non-restrictive construal):
The rule is entirely neutral with respect to whatever determines the LINKed
structure (i.e. whether a relative clause or not) and we can thus derive for (8.42) the
conjoined expression in (8.45a) which is evaluated as in (8.45b) and is equivalent
to (8.45c).
(8.45) a. hSi < xi On′ ((ǫ, x, Riot′ (x)), PrincesSt′ ) ∧ LOC(ǫ, x, Riot′ (x), Si )
b. Si : On′ (a, PrincesSt′ ) ∧ LOC(a, Si )
where a = (ǫ, x, Riot′ (x) ∧ On′ (x, PrincesSt′ ) ∧ LOC(x, Si ))
c. ∃x.Riot′ (x) ∧ On′ (x, PrincesSt′ ) ∧ LOC(x, Si )
The interpretation we derive for the existential construction is effectively equiv-
alent to a small clause analysis: the content of the proposition being provided by
the associate and the locative coda. However, there are two differences. In the
first place, the informational effect of asserting the existential is different from an
assertion of A riot is (happening) on Princes Street. This is because in the process
of interpreting There is a riot on Princes Street the hearer is initially presented
with the information that some term needs to be identified that is associated with
some locative presupposition. The content of this term is then presented by the
associate which introduces a new variable, as discussed in Chapter 3, indicating
new information. The coda then provides an appropriate predicate, hence complet-
ing the proposition, but in two stages instead of one, thus giving rise to different
informational effect as discussed in chapters 4, 5 and elsewhere.
The second difference, of course, is that the presuppositional information asso-
ciated with there may be projected as part of the content of the string. This explicit
location of the content of the associate at the index of evaluation buttresses the
existential interpretation derived from the process of scope evaluation. In effect, by
using the there be construction to convey the information that a riot is happening
on Princes Street, a speaker emphasises the fact the riot exists at the place and
time chosen as the index of evaluation of the proposition.
302 CHAPTER 8. COPULA CONSTRUCTIONS IN ENGLISH
F o(U) F o(λx.LOC(x, Si ))
After Merge, the pointer moves onto the predicate node, but here there is no
substituend for BE provided by the string so one needs to be chosen from context.
At this point there are two possible instantiations of the metavariable, one, given
by the locative predicate in the LINKed structure, within the current tree and one
associated with parsing the common noun fruit, provided by the preceding question.
Substituting either predicate would be fine, but in keeping with assumptions set out
above, we take the most accessible predicate, F o(λx.LOC(x, Si )), to be the most
relevant. Since LINK evaluation does nothing more than restate the formula
already derived for the matrix proposition, this yields the final formula value in
(8.48) after scope evaluation.
(8.48) Si : LOC(a, Si )
where a = (ǫ, x, Apples′ (x) ∧ LOC(x, Si )
≡ ∃x.Apples′ (x) ∧ LOC(x, Si ))
In interpreting the answer in (8.46), the locative presupposition is crucial in deter-
mining the content. It explicitly provides the information that apples exist at the
appropriate time and place and hence conveys relevant information.
new variables, but select terms from context, we might expect some differences in
pragmatic interpretation of there be clauses with definite associates, but possibly
not in the way such examples are analysed. In this section, we look briefly at
definite associates and
sketch how the properties they have in DS gives rise to greater reliance on
pragmatics for interpretation than indefinite ones.
As we saw in passing in chapter 2 but have not so far made use of, the metavari-
ables projected by pronouns are sortally restricted to determine the range of terms of
which Substitution can apply, F o(UMale ) for a pronoun such as him, F o(USpeaker )
for first person pronoun, etc. However the use of anaphoric determiners such as
the, this, that serve to ensure that the descriptive condition that controls such se-
lection of terms as substituend can be arbitrarily complex. In this, they play a
dual role: they project a requirement that their value be independently found, but
nevertheless also project a predicate which is itself not in the immediate context,
hence triggering an extension of the context to provide a term that matches that
predicate. Consider for example, the dialogue (8.49), in the context of A’s function
of interviewing students who fail, cheat, or are otherwise in disgrace:
(8.49) A: Who’s seeing me today?
B: That student who cheated.
A: Oh yes, the student who downloaded stuff from the web.
Notice as a first informal observation that wh questions in a conversation between
administrator and assistant at the beginning of the day are characteristically a
request to be reminded as to what is the immediate context for the coming activities
of the day (one’s head otherwise being full of much more interesting matters). Here,
the first sentence sets out such a question as the context relative to which the definite
in the second has to be interpreted, this being the structure whose topnode has as
formula value:26
F o(Seeing′ (A)(WH))
The reply provides as substituend for the metavariable provided by the wh expres-
sion a partially specified term (U, Student′ (U) ∧ Cheat′ (U)), computed by building
a LINKed structure and compiling its content into the restrictor of the presented
nominal, the verb cheated apparently proffered as providing a predicate sufficient
for A to identify a term from A’s context. In the event, it does indeed lead to
the expansion of the minimal context provided by the wh question; but it does
so by using that information together with further general information. What A
has stored in some accessible context, given this trigger, is that some student had
downloaded information from the web, this being, let us assume, a premise of the
form,
Indeed his reply confirms this, equally presented via the presentation of a relative
clause. By the addition of one assumption, (8.50), and a step of deduction this
26 Here for simplicity, we present the wh expression as a simple metavariable.
304 CHAPTER 8. COPULA CONSTRUCTIONS IN ENGLISH
can provide the necessary term to serve as a substituend for (U, Student′ (U) ∧
Cheat′ (U)):27
(8.50) (Student′ (a) ∧ Download′ (a)) → Cheat′ (a)
Given that the term (ǫ, x, Student′ (x) ∧ Download′ (x)) can be substituted in the
input to this inference-step, the derived conclusion is:
where
Substitution of this term in the open structure provided by the wh question will
indeed yield an answer from which regular inferential effects may be derived, given
A’s administrative function.
Consider now a minor variant of (8.49):
(8.51) A: Who’s seeing me today?
B: There’s that student who cheated.
A: Oh yes, the student who downloaded stuff from the web.
The parsing of there’s proceeds as above and Late*Adjunction provides a means
of analysing the definite associate, as before with an intermediate step of inference.
Reasoning over what it means to have downloaded stuff from the web yields the
existential term ǫ, x, Student′ (x)∧Download′ (x)∧Cheat′ (x) exactly as in the simpler
case, replacing the metavariable projected from the demonstrative:28
(8.52) Parsing There’s that student who cheated :
Scope(Si ), ?T y(t)
required. Either it involves a separate lexicon, a lexicon for the language of thought, a storage of
postulates associated with the concepts expressed in LOT, or it is listed as a subpart of the lexical
specification of the word used, here cheat. Since we have not discussed lexically based reasoning
in this book at all, we leave this as an entirely open issue.
28 (8.52) omits the LINKed structure from which the restrictor on the metavariable for the
demonstrative is computed.
8.3. TOWARDS AN ACCOUNT OF ‘THERE BE’ 305
Merge applies at this point, leaving the subject node with a decoration:
F o(ǫ, x, Student′ (x) ∧ Download′ (x) ∧ Cheat′ (x))
LOC((ǫ,x,Student′ (x)∧Download′ (x)∧Cheat(x)),Si )
The content of the predicate metavariable still needs to be identified, and by the
assumptions presented above this will need to be supplied by some local predicate
as far as possible. There are a number of potential predicate substituends in (8.52),
one from the restrictor for the metavariable projected by the demonstrative article,
one from the restrictor associated with the definite article, one from the locative
presupposition, and one from the preceding linguistic context.
(8.53) a. λx.Student′ (x) ∧ Download′ (x)
b. λy.Student′ (y) ∧ Cheat′ (y)
c. λz.LOC(x, Si )
d. λx.Seeing′ (A)(x)
(8.53b) has been used to identify the substituend for the definite associate and so its
informativeness is weak. Under the natural assumption that information directly
represented within the current tree is more accessible than information derived
from the context, (8.53a) is less accessible than (8.53b) in not being part of the
parse of the existential sentence, and so would need to potentially lead to a more
informative proposition. However, by assumption this term identifies the entity
for the hearer so again the information associated with its restrictor is unlikely to
have any informational effect at all, leaving the locative predicate in (8.53c) as the
most accessible predicate to choose. However, given the fact that the addressee has
just asked about her day’s appointments, it is the predicate in (8.53c) which will
directly yield the requisite inferential effects in the context, so will be particularly
salient to the hearer. Making the appropriate substitution and compiling up the
tree yields the formula value in (8.54):
(8.54) hSi i Seeing′ (A)(ǫ, x, Student′ (x) ∧ Download′ (x) ∧ Cheat′ (x)) ∧
LOC((ǫ, x, Student′ (x) ∧ Download′ (x) ∧ Cheat′ (x)), Si )
Notice the importance of context here. The need to construe something informa-
tive to substitute for the predicate metavariable associated with the copula means
that certain examples involving there be will be difficult to interpret except in rich
contexts. For example, it is usually held that there be constructions involving a
definite associate and a locative coda are ungrammatical.
(8.55) ??There’s the student in the garden.
It is true that (8.55) is difficult to interpret in a null context. The explanation for
this provided by the current pragmatic approach is that the predicate projected by
the associate (Student ) is not informative, having been used to identify some ac-
cessible individual. Additionally, a locative interpretation for BE is not obviously
informative because the coda provides an explicit location for the referent of the
associate. Hence, some other predicate must be construed from a wider context to
deduce a relevant substituend for the predicate metavariable. In a null or restricted
context, therefore, it is difficult if not impossible to identify an appropriate inter-
pretation for the string in (8.55). But in a context in which (for example) there has
been a prior utterance by the hearer of the question Who’s missing?, the utterance
of (8.55) provides an instantiation of the predicate λx.M issing ′ (x) derived from
the question and being salient in the discourse. The actual content of (8.55) in
such a context would be something like that in (8.56), where Mary is the student
identified in the context.
306 CHAPTER 8. COPULA CONSTRUCTIONS IN ENGLISH
(8.56) Missing′ (ι, x, Mary′ (x)) ∧ In′ ((ι, x, Mary′ (x)), (ǫ, x, Garden′ (x))).
Further evidence in favour of a pragmatic approach to the interpretation of such
clauses comes from the fact that extra information contained in modifier construc-
tions can sufficiently enrich the context so that the hearer can extract sufficient
information to identify the relevant object (something often noted but rarely ex-
plored). Hence, the predicate modifier again in (8.55a) provides a context in which
the relevant rabbit is persistently in the garden, while the modifier and relative
clause in (8.57b) indicates that where the student is now is relevant to the hearer.
(8.57) a. There’s the rabbit in the garden again.
b. There’s the student you wanted to see in the corridor.
In the latter case, the interpretation is locational and this is easily derivable.
In a situation in which previous linguistic context is lacking or does not provide
an obviously relevant substituend for the predicate metavariable, the locative pre-
supposition is likely to be chosen as the most accessible predicate which is also, by
virtue of the lack of context, likely to be the most informative. So we derive for
(8.57a) the formula in (8.58a), on the assumption that Mary is the relevant student,
which supports the inference that she is here (now).
(8.58) a. Si : LOC((ι, x, Mary′ (x)), Si )
b. |= Si : At′ ((ι, x, Mary′ (x)), Si )
c. |= Si : Here′ (ι, x, Mary′ (x))
Notice that the analysis presented here says nothing directly about definite as-
sociates having to be ‘hearer new’ (Ward and Birner 1995). As with indefinite
associates, such an interpretation results from the process by which the interpreta-
tion of the string is ultimately derived. By uttering there be, the speaker induces
the hearer to construct a skeletal propositional tree as a promissory note for fol-
lowing information. The associate (and any coda) provide the requisite updating
of this structure and, by virtue of the fact that a nominal update of a propositional
structure is interpreted as some sort of focus (see Kempson et al. 2004), the asso-
ciate gets construed as new information, even though the definite form requires the
hearer to process its content as old (given) information. Given a dynamic system,
the discourse properties of these sentences do not have to be directly encoded, but
derive from the parsing process itself.
We have here presented a brief sketch of the there be construction in English that,
by treating both words in the string as expletives, provides a means of accounting
for variations in interpretation in terms of contextual effect and the properties of
definite and indefinite expressions. Much more could, of course, be said about the
construction and the pragmatic effects it induces, but it is, we hope, clear that
it is precisely the underspecified properties of the string there be that induces the
various interpretations in conjunction with the properties of its associates.
T y(e → t),
T n(00), T y(e),
F o(BE)
F o(ι, x, John′ (x)), ♦
?∃x.F o(x)
At this point the pointer is at the subject node and one might assume that it
would move straight on to the predicate node to satisfy its outstanding formula
requirement. However, nothing stops further specification of the node, provided
that no established conditions are contradicted. As the subject term has struc-
ture with terminal nodes decorated by the bottom restriction, Late*Adjunction
cannot apply. But some LINK relation could be projected from this position. As
mentioned in Chapter 7, appositive constructions such as illustrated in (8.62) can
be analysed through a construction rule that induces a LINKed structure from a
completed node of type e to a node requiring an expression of the same type. This
rule is given in (8.63) and its effect on tree growth is shown in (8.64).
(8.62) Professor Kempson, Ruth, a colleague from London is giving a talk next
week.
(8.63) LINK Apposition (Rule):
{. . . {T n(a), . . . , T y(e), . . . , ♦} . . . }
{. . . {T n(a), . . . , T y(e), . . . }, . . . }{hL−1 iT n(a), ?T y(e), ♦}
Applying LINK apposition to the tree in (8.61) yields the tree shown in (8.65) after
parsing the culprit.
(8.65) Parsing John is the culprit via LINK Apposition:
T n(0), ?T y(t)
derived from some operation on the denotation of a noun phrase. Instead, the
identity of the two terms in an equative clause is derived through an independently
required inference rule operating on the output of grammatical operations (LINK
Apposition and Evaluation) that are themselves independently required.
The analysis has not yet finished, however, as the value for the predicate still
needs to be identified. In substituting for the predicate metavariable in (8.65),
the context given in (8.4.1) provides the three candidate predicates in (8.69), the
most informative of which should be chosen as the substituend, according to the
relevance theoretic stance taken in this book.
(8.69) Possible Predicate Substituends:
a. From the culprit : λy.Culprit′ (y).
b. From substituend (and main predicate of A’s utterance):
λy.Drink′ (m)(y)
c. From the (last of the) milk : λx.Milk′ (x)
Of the predicates in (8.69), (8.69c), picking out the property of being milk, is least
likely to be chosen because of the (likely) processing cost needed to derive useful
inferential effect from the proposition that John is milk. Of the remaining two
predicates, that of being a culprit has been used to identify the appropriateness
of substituting ǫ, x, Drink′ (m)(x) for UCulprit′ (U) and so is less informative than
(8.69b), the property of drinking the last of the milk, leaving the latter as the ap-
parently most informative potential substituend in the context. However, this is of
the wrong sort to substitute for BE, being something that denotes an eventuality.
So, even though the predicate λy.Culprit′ (y) is not fully informative, it is neverthe-
less the only candidate for an appropriate substituend in the context, and so must
be chosen.30 The result of this Substitution yields (8.70a) which is evaluated
as (8.70b), a statement asserting that someone did steal the last of the milk (a
confirmation of A’s initial assertion in (8.4.1)) and that that someone is John, as
required.
(8.70) a. F o(Culprit′ (ι, x, John′ (x) ∧ Drink′ (m)(x)))
b. Si : Culprit′ (a)
where a = (ι, x, John′ (x) ∧ Drink′ (m)(x) ∧ Culprit′ (x))
There exists someone who is a culprit and drank the last of the milk
and that person is John.
Notice that the result is almost equivalent to the predicative assertion that John
is a culprit. However, because of the use of the definite, an inference is induced
to identify possible things that could truthfully be called a culprit. The step of
substituting an appropriate term for the metavariable projected by the definite
leads to an interpretation where John is said not just to be a culprit but in fact the
one who drank the last of the milk. By exploiting underspecification of various sorts
and the inferential process of substitution, we have thus provided an account of the
interpretation of equative clauses without the use of the logical identity operator,
leaving open the intriguing possibility that the logical relation of identity does not
form part of the semantic representation system of natural languages (although it
does form part of the interpretation language of that system), but simply from the
notion of content underspecification as applied to the copula.
30 Note that this result is arrived at without actually undertaking any further inference at this
point: milk not being a property semantically predicable of a human being and culprit (under
assumptions already made) subsumes drinking the last of the milk.
8.4. THE EQUATIVE AND SPECIFICATIONAL CONSTRUCTIONS 311
The pointer now returns to the incomplete subject node. As with the analysis of
equatives discussed above, Late*Adjunction cannot apply to develop this node,
in this case because definites are not expletive and so project a bottom restriction
on the node decorated by the metavariable. However, as before, Appositive LINK
formation may provide an open type e requirement permitting the parse of John,
yielding the tree in (8.73).
(8.73) Parsing The culprit is John:
?T y(t)
Unfortunately, at this point, because the host node for the appositive LINK
structure is decorated by a metavariable rather than an epsilon term, Appositive
LINK Evaluation in (8.67) on page 309 will not work here. However, we might
consider splitting the evaluation rule into a two stage process, one that first writes
the formula on the LINKed structure onto its host node (8.74) and then tries to
resolve the apparent contradictory result. One such resolution rule is shown in
312 CHAPTER 8. COPULA CONSTRUCTIONS IN ENGLISH
(8.75) which combines two epsilon terms into one, as we have already seen in the
analysis of equatives.31
(8.74) LINK Evaluation (revised)
How does this help in the analysis of The culprit is John? The answer is straight-
forward. An application of the revised LINK evaluation rule gives rise to the DU
shown in (8.76a). Since a metavariable subsumes any formula value at all, this is
equivalent to that shown in (8.76b) which contains no contradictory information,
with the content of the metavariable thus being supplied by the LINK structure.
In this way, we model the intuition that the initial definite provides a description
of some entity while the postcopular expression identifies it.
(8.76) a. T n(00), T y(e), F o(UCulprit′ (U) ),F o(ι, x, John′ (x)), [↓]⊥, ♦
b. |= T n(00), T y(e), F o(ι, x, John′ (x)Culprit′ (ι,x,John′ (x)) ), [↓]⊥, ♦
As before, we still need a value for BE. In the current context, the most
accessible predicate is that of being a culprit: λx.Culprit′ (x). This has not been
used to identify any substituend and there is no other accessible predicate which
it subsumes. It, therefore, must be chosen as substituend as shown in (8.77a) with
the formula in (8.77b) resulting as that for the propositional tree. From which some
further inference must be made between John’s culpability and A’s inability to find
socks.
(8.77) a. BE ⇐ λx.Culprit′ (x)
b. F o(Culprit′ (ι, x, John′ (x))) (= John is a culprit).
Specificational clauses thus may, as in this case, end up being truth-conditionally
equivalent to predicative clauses, but notice that the process by which the inter-
pretation is obtained is distinct. In parsing John is a culprit, a term is identified,
ι, x, John′ (x), and a property, λx.Culprit′ (x) , is predicated of this without need of
inference. In parsing The culprit is John, on the other hand, the possibility that
there is someone who is a culprit is presented to the hearer and this someone is
identified as John through a process of LINK Evaluation. The informational effects
are thus distinct, with the latter providing focus on John, and at the same time
(in the current context) providing information that may enable the hearer to find
their socks by indicating that somebody may be responsible for wrongdoing with
respect to those socks. Process is central to Dynamic Syntax and forms part of
the procedural ‘meaning’ of an utterance without the need to define different rep-
resentations or layers of information to specifically encode differences in meaning
between different constructions.32
31 Itmay be, as envisaged in footnote 15 of chapter 7 that this is the only evaluation rule needed
(with types suitably left unrestricted). The information copied onto the host node then being
passed up the tree resolved as locally as possible according to the type of the copied formula and
that of the host node. We leave this refinement for formulation at some other time.
32 See Cann (forthcoming b) for further discussion.
8.5. NOMINAL PREDICATES 313
T y(e → t),
T y(e), F o(ι, x, Mary′ (x))
F o(BE), ?∃x.F o(x)
There are two local (and potentially relevant) predicates available for substitut-
ing the predicate metavariable: λx.Mary′ (x) and λx.Teacher′ (x). The substitution
of either predicate in fact yields what we require, i.e. an existential statement. This
is shown in (8.81) where both substitutional possibilities are given first with the
existential statement with which they are truth conditionally equivalent shown in
(8.81c).34
(8.81) a. Mary′ (ι, x, Mary′ (x) ∧ Teacher′ (x))
b. Teacher′ (ι, x, Mary′ (x) ∧ Teacher′ (x))
c. ∃x.Mary′ (x) ∧ Teacher′ (x)
This account automatically excludes quantified expressions other than indefi-
nites from post-copular position. As discussed in chapter 3, scope resolution for
fully quantified expressions must be determined within the local propositional struc-
ture. However, appositional LINK formation is not propositional and there is no
means to resolve the scope requirement and so such expressions are ungrammatical,
just as with the right peripheral background topics discussed in Chapter 5. The
examples in (8.82) are thus straightforwardly excluded.35
(8.82) a. *Tom and Mary are every teacher.
b. *Those people are most students in my class.
c. *Many rabbits are many rabbits.
However, what of quantified subjects? These are fully grammatical with post-
copular indefinites and yet, while the LINK mechanism enables the post-copular
noun phrase to be analysed, there is no resolution rule providing an integrated
interpretation for the expression. Thus, while the tree in (8.83) is well-formed,
applying LINK evaluation would yield a DU on the subject node involving both a
tau and an epsilon term whose content cannot be reconciled.
T n(00), T y(e),
T y(e → t), F o(BE), ?∃x.F o(x)
F o(τ, x, Teacher′ (x))
This is not, however, a problem. All rules in DS are optional, so there is no re-
quirement to apply LINK Evaluation to the tree in (8.83). However, the parsed
post-copular term may provide the substituend for the predicate metavariable. In-
deed, it has to by any reasonable theory of pragmatics: if something is verbally
expressed, then the speaker must intend the hearer to use the utterance to de-
rive an interpretation. Hence, (8.84a) is the only coherent interpretation of Every
teacher is a fool, particularly as the alternative substitution in (8.84b) gives no
information whatsoever (being equivalent to every teacher is a teacher.
(8.84) a. hSi < xi Fool′ (τ, x, Teacher′ (x))
Scope Evaluation: S : Teacher′ (a) → Fool′ (a)
where a = (τ, x, Teacher′ (x) → Fool′ (x))
b. ???hSi < xi Teacher′ (τ, x, Teacher′ (x))
Scope Evaluation: Teacher′ (a) → Teacher′ (a)
where a = (τ, x, Teacher′ (x) → Teacher′ (x))
8.6 Summary
In this chapter, we have presented an analysis of the English copular verb, be, that
treats it uniformly as a one-place predicate with underspecified content. Within
the framework of Dynamic Syntax, this underspecification is represented as the
projection of a metavariable whose actual value must be derived pragmatically
from the context in which the copular clause is uttered. This context involves both
external and local linguistic content and the latter determines to a large degree
whether the copular clause is interpreted as a predicative or equative construction,
no distinction being made between predicative and equative be. It is also shown
how the existential there be construction fits into the general pattern and how
the pragmatic process of substitution can, within an overall Relevance Theoretic
framework, explain how different interpretations are given to different there clauses
and why certain combinations of expressions are difficult to interpret. The processes
specified for the analyses of the different constructions are needed elsewhere in the
grammar and so do not constitute an ad hoc set of assumptions to account for
constructions involving the copula. While there is much more to say about all
these constructions, the apparent success of this style of analysis in covering a
wide range of data with very different interpretive effects supports the dynamic
view of utterance interpretation and the need to move away from static models of
autonomous syntax.
316 CHAPTER 8. COPULA CONSTRUCTIONS IN ENGLISH
Chapter 9
General Perspectives
317
318 CHAPTER 9. GENERAL PERSPECTIVES
ysed relative clauses and co-ordination through the concept of LINKed structures.
But there remains the whole debate about the relation between argument and ad-
junct. Marten (2002b) provides arguments that these constitute yet another case
where there is an alternative analysis in terms of underspecification. According to
this view, VP adjuncts are optional arguments, and it is the verb which is lexically
not assigned a specific type: this is fixed only in the light of the itemisation of the
arguments. With Local *Adjunction now motivated within the panoply of op-
erations available, an alternative account is that Local *Adjunction licenses the
building of a locally unfixed predicate node, so that additional predicate-functor
nodes can be added to yield back a predicate node.3 Yet another alternative is that
all true VP adjuncts are built up as LINKed structures of identical type, as a form
of predicate apposition, analogous to John, the idiot (see chapters 7 and 8).
Detailed investigation of any of these phenomena might simply show that we
were wrong: the sceptic is entitled to think we have only taken data which happen
to fit our case. But, with the wide range of linguistic phenomena that we have
provided explanations for in this book, we remain optimistic that answers to these
problems lie, as above, in concepts of underspecification and their resolution in
context.
In this chapter, we mostly put aside analyses of specific linguistic phenomena
(apart from ellipsis) and address some wider issues. Throughout this book we
have stated that the grammar formalism is context-dependent. Indeed, without
appropriate substituends being contextually provided, many examples containing
pronouns simply cannot give rise to well-formed logical forms. So we need a theory
of context and, specifically, a theory of well-formedness with respect to context.
Given also that our theory is unapologetically parsing-oriented, it is not unrea-
sonable to ask questions about production (although, of course, such questions are
rarely asked of the more standard formalisms) and about the relation between pars-
ing, production and dialogue. Then, further on the horizon, questions arise with
respect to language acquisition and diachronic change, even about language evolu-
tion. This chapter touches on these questions in order to provide some initial, but
certainly not definitive, answers. We begin, however, by addressing the issue of the
relation between syntax and semantics.
stick to this restriction, formal articulation of the semantics is not an urgent task,
without which the claims made cannot even be judged. The expressive power of the
epsilon calculus is identical to that of predicate logic, constituting the explication
of arbitrary names in natural deduction systems of predicate logic. Its semantics
is, by definition, non-distinct from that of predicate logic, and this semantics is
formally very well understood. Even the very bald sketch of tense construal in
chapter 3 and references to groups and plurality in chapter 7 have well-known and
well understood semantics. (see Prior (1967), Link (1983), Lasersohn (1995), etc.).
What we have done, however, is not unrelated to semantic issues, for the demon-
strations of the complexity of the syntax-semantics correspondence in the litera-
ture often take the form of showing that a given string has more than one truth-
denotational content, associated with which are distinct structural properties, hence
justifying the invocation of ambiguity. Examples that we have considered include
restrictive and non-restrictive construal of relative clauses, E-type vs bound-variable
construal of pronouns, different interpretations of the verb be, for all of which we too
have suggested analyses. As with these phenomena, whenever ambiguity occurs on
a broad cross-linguistic basis, we have however taken this as requiring explanation
in terms of some relatively weak intrinsic characterisation of the individual items,
with mappings from that uniform input specification onto divergent outputs via
the interactions between different processes of update. The result has been that we
simultaneously capture the heterogeneity and diversity of natural language inter-
pretation, while nevertheless capturing the underlying systematicity of the natural
language expressions in question. This methodology has been applied throughout
the book. The wealth of apparent structural and semantic diversity which natural
languages may display is analysed in terms of the way relatively simple forms of
input interact with general principles that determine how information is built up
relative to context.
Of course, the ambiguity or under-specification of individual items is not flagged
by the expressions themselves, and has to be argued case by case. Nevertheless,
this emphasis on reducing apparent ambiguity to relatively weak specifications is
not simply an application of Occam’s Razor in terms of a preference for fewer, and
more general explanations (although we do believe this is ultimately the criterion
for choosing between competing analyses). We take the wealth of such data to vin-
dicate our claim about natural language that underspecification and the dynamics
of update constitutes the heart of natural-language structure and is a direct reflec-
tion of the dynamics of language processing, this in its turn being a reflection of
the general cognitive human capacity for information retrieval.
At this point, it is worth looking back to how we set out on the enterprise of
this book. What we set out, by way of introduction, were the two apparently inde-
pendent problems: the context-dependence of natural language, and the apparent
imperfection of natural language in displaying long-distance dependency effects.
What we have argued for is the novel view that the concepts of underspecification
which are familiar in semantic explanations apply also to the central syntactic core
of language as well. And, in extending this concept of underspecification into the
arena of syntax, the dichotomy between what should be explained in syntactic terms
and what in semantic terms collapses. The phenomenon of context-dependence is,
on this view, largely resolved (following the lead of Kamp (1981)) within the con-
struction algorithm that induces representations of semantic content.
It is this construction algorithm for building propositional forms relative to
context which through the rest of this book has been shown to be a vehicle for
expressing syntactic generalisations about natural languages. So the thirty-year-
old question of the relation between syntax and semantics, as raised by theoretical
320 CHAPTER 9. GENERAL PERSPECTIVES
systems.
9.2. CONTEXT AND PARSING 321
This concept of context, ranging over both information that may be independent
of the immediate parsing task and information constructed during the parse task,
is sufficient to cover the major types of pronoun resolution. For example, it covers
both intra-sentential and inter-sentential anaphora, because both the partial tree
under construction and the structure obtained by parsing the previous utterance
are taken as part of the context relative to which the string under construal is
interpreted:6
(9.1) a. John said he was upset.
b. John left. He was upset.
It applies equally to LINKed structures in which the tree may not be of type t, but
of type e:7
(9.2) a. As for John, he was upset.
b. If John left, he was upset.
It applies in the projection of LINKed structures, irrespective of whether the propo-
sitional structure is processed first, or subsequently. In either case, it is that which
is processed first which constitutes part of the context for the building of the LINK
transition and the construction of the copy. This was the basis of our account of
Japanese head-internal relatives, accounting for the way in which both the matrix
arguments are identified in (9.3), despite the structural configuration:
(9.3) John-ga sanko-no ringo-o muita-no-o tabeta
JohnN OM threeGEN appleACC peeledno−ACC ate
‘John ate the three apples which he peeled.’
It was also the basis of the account of right-dislocation effects such as:
(9.4) She talks too fast, that woman from Islington.
It also covers intra-sentential and inter-sentential anaphora for quantified expres-
sions, with the terms under construction freely able to act as terms to be substituted
by the anaphoric process, as long as a scope statement has been entered into the
structure, so that the construction of the term itself is complete (see chapter 3).
(9.5) a. A man left. He was upset.
b. Two men left. They were upset.
c. Every parrot sang a song, which it didn’t understand.
With the characterisation of context as a sequence of trees, of which the last
may be some tree under construction, it might appear that there is a left-right
asymmetry in context construction, which would not apply to cases where the
context may not literally always be prior to the update being carried out, as in
cases of Right-Node Raising:
6 In principle, the characterisation is neutral enough to allow for system-external inferences,
given that any expression of the language of inference can be expressed as a tree. It is well-known
that there is a feeding relation from general steps of inference onto tree structures, since anaphora
resolution may involved bridging cross-reference effects:
(i) I fainted outside the hospital. They operated rightaway.
(ii) The wedding pair looked miserable. He had just come from a hospital appointment, and she
had been fretting that he would be late.
However, at the current stage of development, there are no mechanisms for expressing these, as
the feeding relation necessary between decorations on tree-nodes, general forms of inference, and
the process of tree growth, remains to be defined.
7 See Gregoromichelaki (in preparation), where an account of conditionals is developed in which
the antecedent of a conditional is taken to project a linked structure providing the restriction on
a world variable.
9.2. CONTEXT AND PARSING 323
(9.6) John ignored, but Bill consoled, the woman from Birmingham.
In these, what is imposed as a requirement on the development of the second struc-
ture is a copy of the metavariable used to decorate a node in the first, incomplete,
structure. The right-peripheral expression decorating an unfixed node provides a
value for that copy of the metavariable in the second structure, enabling it to be
completed. Hence, the structure that was started second happens to be completed
before the first structure is completed. Despite this, the concept of context re-
mains time-linear, remaining that of a sequence of trees minimally including the
tree resulting from the task just completed.
There is however one type of anaphora resolution which is not so far covered by
our concept of context: the “sloppy” uses of pronouns (Karttunen 1976). This case
requires an extension of the notion of context to include an online lexicon, which,
anticipating the discussion of dialogue, we might call the “dialogue lexicon”.
(9.7) John keeps his receipts very systematically, but Tom just throws them
into a drawer.
To analyse these sloppy uses of pronouns, the pronoun is interpreted as having
as antecedent some term which carries the information that it itself contains an
metavariable. Thus, in the sloppy reading of (9.7) what substitutes for the metavari-
able is not the term previously constructed from parsing his receipts, which we might
represent as:
but a new term that is constructed using the actions by which the earlier term was
itself constructed. These actions include a point at which substitution takes place
of the metavariable projected by the possessive pronoun, his. Given the optionality
inherent in the system, a different choice of substituend may now be taken, yielding
a new term, with (ι, z, Bill′ (z)) in place of (ι, y, John′ (y)), as required.
Taking actions, both computational and lexical, as part of the linguistic context,
plays a much bigger role in ellipsis, where all three different concepts which the
process of tree growth makes available are brought into play – terms, structure,
and actions.
10 See chapter 2, footnote 38 for a characterisation of the domain in which substitution is re-
F o(U), ♦
⇑
F o(John′ )
contained within that term can be identified as the tau term under construction.
This is shown very schematically in (9.13):
(9.13) F o(Ignore′ (WH)(τ, x, Person′ (x)))
F o(WH) F o(Ignore′ )
?T y(e)
?T y(cn) F o(λP.(ǫ, P ))
F o(y) ?T y(cn → e)
F o(U), ♦
⇑ F o(Husband′ )
′
F o(τ, x, Person (x))
The genitive by definition projects a dependent epsilon term - i.e. one in which
the introduced term takes narrow scope with respect to the term contained in the
determiner specification. In the case of their husband, this means that an epsilon
term is projected of the form ǫ, y, Husband′ (U)(y) (see Kempson et al. (2001:144-
148) for a variant of this analysis). Parsing their husband and a logical form with
attendant scope statements can then be completed, to yield:
S < x < y F o(Ignore′ (ǫ, y, Husband′ (τ, x, Person′ (x))(y))(τ, x, Person′ (x)))
Finally, what might, on the face of it, seem to be data that are problematic for
the current analysis turn out to be commensurate with the account, as long as we
assume, as above, that actions just used in some context can be re-used as part
of the information provided by the context. The relevant data are the so-called
sloppy ellipsis construals, where, apparently, some linguistic form is re-used with a
modified interpretation:
(9.14) Q: Who upset his mother?
Ans: John.
In these cases, the fragmentary reply appears to require some retraction from the full
specification of the context to establish a predicate λx.Upset′ (ǫ, y, Mother′ (x)(y))
11 This dependency could be spelled out in full following the pattern set out in chapter 3, but
here we leave these names in their shorthand form where the dependence is graphically displayed.
9.2. CONTEXT AND PARSING 327
and reinterpret that predicate with respect to the term presented by the fragment.
Since Dalrymple et al. (1991), it has been assumed that such a process of abstrac-
tion applies on what is provided in the context to create the relevant predicate.
This is then applied to the new term. In DS, this would have to be defined as a
special operation on contexts. While this is formally a reasonably well-understood
possibility, it is unattractive in this framework as it would be incompatible with the
upwardly monotonic tree-growth process of construal: some aspect of the content
of the formula in context has apparently to be removed in order to yield what is
required for the next update.
An alternative is simply to presume that the actions used in establishing that
construal of the first utterance are themselves available in the context, and so can
be re-used just as terms or structure can be. The actions that have to be applied in
the construal of John in (9.14) as ‘John upset John’s mother’ are identical to what
the context makes available. The sloppy ellipsis construal arises because amongst
the actions so reiterated from the context is the projection of a metavariable to be
identified as before from the subject argument, which in the new context provided
by the just parsed subject expression John will give rise to the new predicate ‘upset
John’s mother’. The display in (9.15) shows the analysis of the question in (9.14)
while (9.16) recapitulates the actions undertaken to build the predicate node from
a parse of upset his mother (ignoring tense).
(9.15) Parsing Who upset his mother?:
F o(Upset′ (ǫ, x, Mother′ (WH)(x))(WH)), ♦
F o(U)
⇑ F o(Mother′ )
F o(WH)
IF ?T y(e → t) upset
THEN make(h↓1 i); go(h↓1 i); put(F o(Upset′ ));
go(h↑1 i); make(h↓0 i); go(h↓0 i); put(?T y(e));
make(h↓1 i); go(h↓1 i); his
put(λP.ǫ, P ); go(h↑1 i);
(9.16)
make(h↓0 i); go(h↓0 i); freshput(x); go((h↑0 i); ⇐=
make(↓1 ↓0 ); go(↓1 ↓0 ); put(F o(U, T y(e)));
SU BST IT U T E(α, α ∈ TContext ); go(h↑′ i); mother
make(h↓1 i); go(h↓1 i);
put(F o(Mother′ ), T y(e → (e → cn)))
The fragmentary reply John then decorates the subject node of a propositional tree
and the actions in (9.16) build the rest of the tree. Crucially, at the point where
328 CHAPTER 9. GENERAL PERSPECTIVES
F o(Bill′ ) ?T y(e → t)
T y(e → t),
F o(x), ?∃x.T n(x) F o(John′ ) F o(DO),
?∃x.F o(x), ♦
What the parsing of everyone John did in (9.2.2) has provided is, in sequence: the
partial term under construction got by parsing everyone, with its fresh variable x;
the induction of a LINKed structure with a copy of that variable x at an unfixed
node; a subject node decorated by F o(John′ ), and then a predicate node decorated
by a place-holding metavariable, DO. The tree cannot at this point cannot be com-
pleted, as there is no node with which the unfixed variable can Merge. However,
Late *Adjunction may apply to the predicate node and this provides the right
context for the sequence of actions in (9.19) to be triggered, these actions being
derived already from the earlier parse of the verb interviewed.
IF ?T y(e → t)
THEN make(h↓1 i); go(h↓1 i);
(9.19) put(F o(Interview′ ), T y(e → (e → t)))
go(h↑1 i); make(h↓0 i)
put(?T y(e))
9.2. CONTEXT AND PARSING 329
These actions provide all that is needed to complete the LINKed structure and build
up a composite restrictor for the tau term already introduced, since the unfixed node
decorated the variable can now Merge with the internal object node. The final
result is, as required:
S < x F o(Interview′ (τ, x, (Person′ (x) ∧ Interview′ (x)(John′ )))(Bill′ ))
to be evaluated as:
(Person′ (a) ∧ Interview′ (a)(John′ )) −→ Interview′ (a)(Bill′ )
The whole process of construal is monotonic: nothing needs to be removed by any
process of abstraction.
The analysis has the bonus of extending to other forms of ellipsis which are prob-
lematic for abstraction accounts, since they appear to involve abstraction of some
expression other than the subject, indeed abstraction over an expression within a
relative clause, standardly a strong island:
(9.20) The man who saw John told him his rights, but the one who saw Bill
didn’t.
Again, on the assumption that actions may be re-used, these are quite unproblem-
atic, as the actions that are re-used in setting up the interpretation of the second
define an update relative to the extended context which the newly introduced sec-
ond subject expression makes available.
While a lot more needs to be said about ellipsis, we nevertheless take these
preliminary considerations as indicative that the concept of context for the purposes
of the construction process has not only to contain sequences of representations of
content, but also sequences of actions, as implemented in the parsing process. This
however does not disturb the concept of context as needing to be defined over LOT
constructs, for these actions are all defined as updates in a tree-growth process.
Context, as now revised, is a record of what information has been constructed to
date, and how it has been presented. Notice that ellipsis is very restricted in its
distribution and is not fully anaphoric in the same sense in which pronouns are. In
general, ellipsis is licensed only with respect to an immediately preceding sentential
utterance. Thus in (9.21a), the ellipsis in the second string is licit and a sloppy
reading can be construed, giving the reading ‘John didn’t love John’s mother’.
However, in (9.21b), the ellipsis is awkward and, if it is construable at all, can only
mean that John did not love Bill’s mother.
(9.21) a. Bill loved his mother. John didn’t.
b. Bill loved his mother. He was very distressed when she died. ??John
didn’t (and held a big party).
We take this to indicate that it is only the set of actions associated with constructing
the immediately preceding propositional tree that forms part of the context. In
effect, we may hypothesize that this reflects short-term memory and the well-known
way that recall of actual preceding utterances quickly degenerates. The definition
of the contexts we suppose to be relevant with respect to parsing a string φ is given
in (9.22) as an ordered list of complete trees and a partial tree constructed over the
current string (which may just be a tree determined by the Axiom). The actions
of the current and immediately preceding trees also form part of the context.
(9.22) C =def < Ti , ..., Tn-1 , < Tn , An >, < Tφ , Aφ >>
where 0 ≤ i; every Tj , i ≤ j ≤ n, is a complete tree without requirements;
Tφ is a partial tree constructed as part of the parse of φ; and Aα is the set
of lexical, computational and pragmatic actions used in constructing Tα .
330 CHAPTER 9. GENERAL PERSPECTIVES
At any such point, where the context turns out not to be shared by the hearer, the
negotiation between speaker and hearer may become momentarily explicit. In the
more usual case, however, there is presumed identity of context, and it is on their
own individually established context upon which speaker and hearer individually
rely.
Further co-ordination between speaker and hearer roles is displayed by what
Pickering and Garrod label ‘alignment’ between speakers. If one interlocutor has
used a certain structure, word or choice of interpretation, then the respondent is
much more likely to use the same structure as in (9.27), rather than a switch of
structure as in (9.28):
(9.27) A: Who has Mary given the book?
B: She has given Tom the book.
(9.28) A: Who has Mary given the book?
B: ??She has given the book to Tom.
(9.29) A: Who was at the bank yesterday?
B: Sue was at the bank until lunchtime, and Harry stayed at the bank all
afternoon.
So there is tight co-ordination between speaker and hearer activities at syntactic,
semantic and lexical levels.
These informal observations about dialogue and the observed interplay between
shifting speaker and hearer roles may seem obvious, even trivial, but, as Pickering
and Garrod (2004) point out, dialogue presents a major hurdle for most theoretical
frameworks. The separation of competence from performance, with the articulation
of the former as logically prior, has meant that the data for linguistic modelling
have been sentences taken in isolation. The presumption has been that study of
performance, whether processing or production, can only be couched in terms of
how the principles encoded in such a cognitively neutral body of knowledge is put
to use in the various language-related tasks the cognitive system may engage in.
This separation of the articulation of the system itself from considerations of these
activities means that the formal properties of the system provide no clue as to what
such parsing and production systems might look like, other than dictating that they
must make reference at various points to the system: there is certainly no reason
to anticipate that these various applications might have anything to do with each
other. Reflecting this, models of parsing and production have been quite generally
studied in psycholinguistics as quite separate research enterprises with very little
to do with each other.12
ing and generation over a number of years (following Shieber (1988): see e.g. Neumann (1994)),
but nonetheless, with a direction-neutral grammar formalism, the two systems are defined as dif-
ferent applications of the shared grammar-formalism. One such difference lies in the fact that,
while strictly incremental parsing systems can be defined, generation systems remain head-driven
despite increasing psycholinguistic evidence to the contrary (see Ferreira (1996), Branigan et al
(2000), Pickering and Branigan (1999), Pickering and Garrod (2004), see Shieber et al (1990),
Stone (1998), Shieber and Johnson (1993), Stone and Doran (1997), Webber et al (2003).
332 CHAPTER 9. GENERAL PERSPECTIVES
what the task of the producer has to be is to start from some assumed represen-
tation of content and select some string of words from which a parser can recreate
that same representation of content?
Certainly, what is needed is a mapping from a decorated tree (or more abstractly
some logical representation)13 onto a linearised string from which the tree in ques-
tion could be constructed by a parsing system. However, the surprise is that we
can model this task using the very same rules, following possible parse routines,
with the goal tree merely serving as the filter against which all possible actions by
the generation system have to be matched, despite the fact that neither selection
of words nor their order are uniquely correlated with a given content. All that is
required in addition to the parse routine is that the choice of words to be selected
by the generation device must meet the requirement that they progress the building
of the goal tree. Reflecting these informal statements, we assume that in genera-
tion, the very same computational actions initiate the development of some tree,
but this time, each update step has to meet the restriction of not merely being an
enrichment as in parsing, but of being a precise form of enrichment – a sequence
of progressive enrichments towards yielding a particular tree, the selected goal tree
representing the interpretation to be conveyed.14
Though this is in principle straightforward, given that the parsing steps them-
selves are defined to be progressive enrichments, the search through the lexicon
for appropriate words becomes a major task. Seen overall, production involves a
sequence of triples: a partial string, a partial tree, and a complete tree. The first
step comprises an empty string, the Axiom step (or some other partial tree), and
the goal tree; the last step comprises some complete string, and two occurrences
of the goal tree.15 All steps in between these two involve updating the previous
partial tree, commensurate with the goal tree filter. For any update that passes
the filter, the selected word can be uttered, or, in writing systems, added to the
output partial string. Even in a simple case such as uttering John snores, given a
binary branching semantic structure, there is in principle two successive searches
of the lexicon to find words that provide the appropriate update to the opening
partial tree. And this is only the beginning, as for any one tree, there may be
several linearisation strategies. Even in such a simple case as this, there are two,
since the noun John could have been taken to decorate an unfixed node which then
merges with the subject node once that is subsequently constructed. So, with two
such parsing strategies, there would equally be at least two production strategies,
against each of which appropriate searches of the lexicon have to be set up, even for
this simple goal tree. This poses a substantial computational problem, particularly
for a theory committed to the account being incremental.
of content the goal tree. A goal tree is a tree with all requirements on nodes satisfied, and with
T y and F o decorations at each node.
14 The suggestion for this design of a generation system relative to DS assumptions was suggested
to Kempson, Otsuka and Purver by Matthew Stone (personal communication), and subsequently
developed by Otsuka and Purver.
15 In principle, a production task might start from any partial tree, but, to give the general ar-
chitecture initially we start from the simplest case, a string which constitutes a complete sentence.
16 One suggested solution to this problem in Otsuka and Purver (2003), Purver and Otsuka
(2003) involves restricting solutions to words representing terms that decorate terminal nodes in
three, selected from the lexicon as a first pass. This account has the disadvantage of not being
9.3. DIALOGUE AS A WINDOW ON PRODUCTION 333
variable as F o value will trivially satisfy the subsumption constraint, apparently without any reflex
of the parsing step whereby it might be interpreted. If this were the correct analysis, pronouns
would not be good evidence of the context-dependency of natural-language production: to the
contrary, pronouns would constitute a trivial production step, always licensed no matter what is
being talked about. However the presence of the requirement ?∃x.F o(x) in the lexical specification
of the pronoun forces the provision of a value in the building up of a logical form in all well-formed
strings containing a pronoun, hence also forcing the contextual provision of a value in production
on the assumption that production steps incrementally match the corresponding parse steps.
18 This account of pronoun generation has as a consequence that the utterance of a pronoun
in the context of a preceding sentence containing two possible antecedents will be ambiguous, a
problem which extends also to ellipsis. We take this to be correct. The process of substitution
which provides interpretations of pronouns is taken to select a value out of some arbitrary minimal
context, and where two antecedents are provided in any such context, two logical forms are derived
and the resulting ambiguity has to be resolved by evaluation of inferential potential (minimally
passing a consistency check).
334 CHAPTER 9. GENERAL PERSPECTIVES
19 This specification ignores the requirement for the preposition to in the indirect object con-
(9.31) give
IF ?T y(e → t)
THEN EITHER (Double Object)
make(h↓0 i); go(h↓0 i); put(?T y(e));
go(h↑0 i); make(h↓1 i); go(h↓1 i);
put(?T y(e → (e → t))); make(h↓1 i); go(h↓1 i);
put(T y(e → (e → (e → t))), F o(Give′ ), [↓]⊥);
go(h↑1 i); make(h↓0 i); go(h↓0 i); put(?T y(e))
OR (Indirect Object)
make(h↓1 i); go(h↓1 i); put(?T y(e → (e → t)));
make(h↓1 i); go(h↓1 i);
put(T y(e → (e → (e → t))), F o(Give′ ), [↓]⊥);
go(h↑1 i); make(h↓0 i); go(h↓0 i); put(?T y(e))
go(h↑0 ih↑1 i); make(h↓0 i); go(h↓0 i); put(?T y(e))
ELSE Abort
ELSE Abort
This is all we need to account for such so-called syntactic alignment, given that
it is the macro of update actions for a word which, once retrieved for parsing (or
production), gets added to the context. For once either of these specifications gets
selected, it is that pairing of word and update-specification that will get entered
into the context, to be retrieved by the second occurrence of the word. As with
instances of repetition, it is the stack of lexical actions that gets searched first to
see whether there are items that can project the required subsumption relation to
the goal tree; and in all cases, as long as one sub-case of the lexical specification is
entered in that active space, the other case will never need to be retrieved, exactly
analogous to the case of semantic alignment with ambiguous expressions.20
Such a recall of actions might, indeed, be made use of for a whole sequence
of words, giving rise to what Pickering and Garrod call ‘routinization’, so that an
extended sequence of actions may still be stored and retrieved as a unit. The per-
spective in which the speaker has no choice but to manipulate the same actions
as the hearer yields the alignment facts directly. The additional significance of
this result is that the multi-level alignment of apparently discrete levels of syn-
tax, semantics and lexicon, which are indeed taken by Pickering and Garrod as
evidence that these constitute independent levels, is here transformed into a single
phenomenon. The very success of the account thus confirms the contrary assump-
tion that syntactic, semantic, and lexical forms of specification do not constitute
discrete forms of representation.
Finally, and perhaps most significantly, given the problem it poses for other
frameworks, we expect that utterances can be shared. Both in parsing and produc-
ing an utterance, a single tree is constructed. If before completion, the hearer can
leap to the conclusion of what is being said, she can complete the partial tree she
has up till then constructed as a parser, and so shift into being a producer where
the task is to complete that tree against an already completed goal tree. Equally,
the speaker can shift into being a parser, as she, too, has a partial construction of
that tree. On this view, this is merely a shift from providing the input for updat-
ing some partial structure, to analysing some provided input with which to update
that very same partial structure. There is no major shift from one performance
mechanism to some entirely distinct mechanism. Unlike in other frameworks, the
20 Ferreira (1996) provides confirming evidence for preferring incremental accounts of production
over threshold activation accounts in which putative strategies are simultaneously activated but
progressively eliminated.
336 CHAPTER 9. GENERAL PERSPECTIVES
is notably confirmed by current research on production, both in outline and in detail: see the
ongoing work of V.Ferreira and colleagues, and also Pickering and Branigan (1999), Branigan et
al (2000).
9.4. CONTEXT AND WELL-FORMEDNESS 337
with a given position in a hierarchical semantic structure should ever occur at some
initial (or final) edge, from which its interpretation is not optimally projected. We
would then have to return to some equivalent of the movement account, with a
heterogeneous listing of different types of anaphoric expression, different types of
relative clause, and different left and right periphery effects. We would, in fact, be
back at our point of departure, with no reason to expect interaction of anaphoric
and structural processes. On the DS view, as we have seen, these various phenom-
ena emerge as by-products of the adopted stance.
This relative success of the parsing-oriented perspective reflects a deeper dif-
ference between the two approaches. On the Phillips view, there is no concept
of compositionality definable for natural language strings. With no concept of
structural underspecification to hand, constituency on his view is not a property
retainable at the output level of logical form (LF). Assignments of constituency
are established immediately upon their left-to-right projection on the basis of the
partial string under analysis; but these are then revised, with no necessary trace
of that constituent at the output LF. On the DS view, to the contrary, there is
a well-defined concept of compositionality, albeit distinct from the traditional one
(see section 9.1). It is is the cumulative contribution to the building up of a logical
form over a left-to-right sequence to a result in which no requirements as introduced
during the derivation of the remain outstanding. And so, as promised at the outset,
we take the arguments presented here to demonstrate to the contrary that parsing
is the basic mechanism.
(9.32) a. is a man
b. Wendy is thinking of
c. reviewing a chapter on those
d. is to be too
These strings all provide mappings from some appropriately defined partial tree to
an output partial tree; and the sequence of words provides the appropriate lexical
update actions that make such a tree update process possible. So, relative to some
partial tree as context, they can contribute as a sub-part of a well-formed sequence
of words.
This characterisation of a well-formed sub-string already distinguishes these
from impossible strings.
(9.33) a. *Man the died.
b. *Died man.
c. *is the often man.
In (9.33a), the computational actions of the system do not provide the appropriate
trigger for parsing man, and the lexical actions induced by the parsing of man
do not provide as output the appropriate node specification for parsing of the. So
there is no partial tree with a pointer at an appropriate node to provide appropriate
input for the parsing of died. A similar basis underpins the lack of well-formedness
of (9.33b,c). No general discourse-based concept of context is needed to distinguish
such strings such as those in (9.33) from those in (9.32): all that is required is the
mapping from one partial tree to another.
in the application of Elimination. This rule is subject to the constraint that no mother node can
have Formula decorations attributed at that node unless all requirements on the daughter nodes
are complete (see the discussion of the Right Roof Constraint, chapter 5, section 5.4).
340 CHAPTER 9. GENERAL PERSPECTIVES
(9.36) Some newlyweds have arrived. She isn’t looking too happy, though.
This concept of well-formedness preserves the unified concept of anaphora that
we espouse. Consider the sentences in (a)-(b) as a sequence:
(9.37) a. John came in.
b. He sat down.
The well-formedness of each of these strings can be established as follows. We begin
with a null context (an impossibility but a simplifying assumption) and derive the
context {T(9.37a) } consisting solely of the tree derived from parsing the string in
(9.37a). This forms the input context to the parsing of (9.37b):24
−−−−→ −−−−→ −→
(9.38) ∅ (9.37a) {T(9.37a) } Axiom {T(9.37a) , T?Ty(t) } he {T(9.37a) , The }
Substitution operates to replace the metavariable projected by the pronoun from
this new context to give: {T(9.37a) , TJohn }. Parsing the rest of string derives the
context {T(9.37a) , T(9.37b) } in which both trees are complete and so (9.37b) can be
considered well-formed relative to having parsed (9.37a).
For a text like (9.39(a,b)) in which the anaphor identifies its substituend inter-
nally, the reasoning goes analogously.
(9.39) a. I saw John yesterday.
b. Janet thinks she’s pregnant.
We begin with the null context again and progressively induce some context {T(9.39a }
through the parse of the string in (9.39a). We then progress through the parse of
the second string incrementally building up the context: {T(9.39a) , TJanet thinks she }.
At this point the only suitable substituend is derived internally to the current tree
being developed to give {T(9.39a) , TJanet thinks Janet }. The rest of the string is parsed
to give a context {T1 , T(9.39b) } in which both trees are complete. (9.39b) is thus well-
formed as a string. Notice that for this characterisation of well-formedness to hold,
the string must be complete, and a fully specified interpretation assigned; hence
at the juncture of its evaluation, the tree T which constitutes its interpretation is
itself part of the derived context.
On this characterisation we are not saying that she is pregnant is a well-formed
utterance relative to the context provided by the parsing of the incomplete sentence
Jane thinks that, even though the parsing of this substring provides the context
structure whereby the metavariable provided by she is identified, for she is pregnant
is a substring of Jane thinks that she is pregnant. Well-formedness of complete
strings is a property of utterances relative to a particular kind of context, one
where the trees are complete. This restriction to complete trees correctly yields the
result that right-node raising conjuncts cannot be uttered in isolation from some
string that provides the requisite structure.
(9.40) A: John likes
B: And Bill dislikes, that doctor from Washington.
Neither A’s nor B’s utterances are well-formed as complete utterances, even though
they are well-formed as substrings. In A’s utterance, the result is not a complete
tree. In B’s utterance, the context upon which its interpretation depends is not a
complete tree. (9.40) is only interpretable as a collaborative utterance, as we would
expect (see section 9.3 for discussion of such shared utterances). Similarly in the
phenomenon of mixed utterances:
24 T
X is a diagrammatic shorthand for a partial pointed tree, where X indicates how far the
parse has proceeded through the string from which T is being constructed.
9.4. CONTEXT AND WELL-FORMEDNESS 341
Is this a problem? We do not think so, given that what we are characterising
are well-formed utterances and not well-formed strings. Such utterances may not be
canonical and have a very restricted distribution, but once the concept of context is
taken seriously such a conclusion is unavoidable. Furthermore, since such strings are
only acceptable in very restricted contexts, it is possible in principle to capture this
by specifying that any tree that results from using the actions used in constructing
some other tree involves a non-canonical string. Nothing, however, follows from this.
Our definition of well-formedness allows an utterance to be well-formed whether or
not the number of contexts within which it is well-formed is very small. It may be
possible to provide a notion of degree of acceptability according to the nature of
the contexts with respect to which some string is uttered. However, it is not clear
that such a move is either possible or, indeed, necessary.
Through all this exploration of different concepts of well-formedness, one prop-
erty has remained constant. The concept of well-formedness does not express appro-
priacy of uttering one sentence after another: there is no attempt to express what
is an appropriate answer to a question, or what is a cohesive sequence of dialogue.
We are concerned solely to articulate what is intrinsic to ability in a language, in-
dependent of any such constraints. It is in this respect that the framework is not
providing a model of pragmatics. Accordingly, the concept of well-formedness is
expressed solely in terms of being able to derive a complete propositional formula,
with no requirements outstanding. Despite its context-dependence, it does not in-
corporate any implementation of pragmatic constraints, nor any reflection of such
constraints.
Thus the decision to define a concept of what it means for a utterance to be
well-formed relative to some fixed context has led to a richer overall account of
well-formedness for natural languages.
What differs in the two types of derivation that the speaker and hearer might
construct is the context that they severally bring to the communication task: the
speaker has much richer information about the individual in question than the
hearer. In the event, neither speaker nor hearer cares, because each, by analysis,
is making decisions relative to their own context; and in both cases a successful
result is derived. In being asked to make judgements in isolation, on the other
hand, informants are generally asked to do so without any context provided. In
evaluating such a sentence, they can nevertheless see that as speaker they might
have one context, which as hearer they would not. Now, as competent speakers of
the language, they “know” that the overwhelmingly general pattern in successful
dialogue is for speaker and hearer to share a common context in virtue or parsing
or producing by the same mechanisms.27 Accordingly, informants assume that the
sentence will give rise to problems in any utterance situation, and hence not be
fully well-formed. And so it is that when the speaker’s and hearer’s contexts for
the actions that they take are judged to come apart, judgements of well-formedness
become less secure. Notice how our characterisations of parsing and production
in terms of constructing partial representations relative to the individual’s own
context are providing a basis both for expressing both the successful use of such
strings, and yet nevertheless the judgement by informants that these strings are
not acceptable. Furthermore, given that what we require our concepts of well-
formedness to reflect is successful use of language, we take the characterisation of
such strings as well-formed to be correct.
at this juncture it is the threat of a gap between on-line parser and the grammar
formalism which is of concern.) Even in parsing some first constituent, say that
receptionist, from what we have set out in previous chapters, the parser has at least
three alternatives, to parse the noun phrase as decorating a fixed subject node, as
in That receptionist distrusts me, to parse it as some unfixed node to be updated to
a fixed position later, as in That receptionist, I distrust, or to parse it as decorating
the topnode of some structure, to be treated as linked to what is constructed as some
remainder, as in That receptionist, I distrust her. Because there is no lexical basis
for differentiating the choices to be made at this first step, it might look as though
we have to posit the construction not just of one, but at least three structures
in tandem, progressively decorating each one of them until such time later in the
parse at which all but one of them will have been rejected as putative candidates
by some filtering process yet to be formally modelled. Unless the window in which
parallel structures are sustained is very small, it looks as though we will have a gap
opening up between parsing on-line and the grammar formalism which threatens to
undermine the central claim of the theory. But it turns out that contributory factors
determine swiftly the selection to be made, for the implementation of pragmatic
constraints is not applied to the output of what is provided by the grammar (as in
other frameworks), but at each step in the construction process as provided by the
system.
Consider the difference in uttering that receptionist as subject and as a displaced
object. These are immediately distinguished by intonation: if an unfixed node is
to be built, as in That receptionist, I distrust, there will be a marked intonational
effect distributed across the entire expression indicating, at the very least, that the
process by which the resulting tree is to be built is not a canonical one. Moreover,
intonation is a para-linguistic feature, distributed right across a constituent and
identifiable from the outset of the parse. By contrast an utterance of That recep-
tionist distrusts me with neutral intonation will induce the canonical strategy (for
English) of Introduction and Prediction. Consider also the difference between
uttering that receptionist to be construed as subject, and uttering that receptionist
to be construed as decorating a structure related to the remainder only through
anaphora (That receptionist, I distrust her ). Such Hanging Topic Left Dislocation
Structures are invariably reported as requiring a distinctive intonation pattern.
Whether the intonation itself is sufficient to distinguish the effect of building an
unfixed node from that of building and decorating a linked structure is surely un-
certain, but this is not necessary, given that all utterances are interpreted relative
to context. The hallmark of unfixed nodes is that they are constructed in order
to provide some update to what may be independently identifiable as in context, a
distinctiveness from the remaining structure to be constructed which their assigned
intonation buttresses.29 The characteristic use of sentence-initial LINKed struc-
tures, on the other hand, is to provide the context relative to which what follows
is the update. The relation of these two to the context in which they are uttered
is not the same. Hence the combination of intonation and their distinct relation to
how they add to their context is sufficient to determine that, even at the first step
in constructing a representation of content, the hearer has either one, or maximally
two alternative strategies to construct. And with each next presented word, it can
be expected that this choice of alternatives will get narrowed down to the limit.30
29 See Steedman (2000) for exploration of effects of intonation within CCG; and also Kiaer
(in preparation), where this account of topic and focus is substantiated in detail. Kiaer (in
preparation) argues that intonation plays a critical role in enabling a fully incremental account
of Korean. See also Kurosawa (2003) for a discussion of the role of intonation in parsing certain
constructions in Japanese.
30 Of course it might be argued that what this disjunction of strategies shows is that we have
346 CHAPTER 9. GENERAL PERSPECTIVES
The claim we are making, then, is that the choice mechanism, however it is
formalised, does not apply to the output of the grammar. Indeed, the output of
the grammar is the output of the parser also. Rather, it applies to each individual
action of tree growth licensed by the grammar. It has to be granted, however,
that the challenge of modelling any such choice mechanism within Dynamic Syntax
remains entirely to be done.31
ble et al. 1997, Otsuka and Purver 2003, Purver and Otsuka 2003, Purver and Kempson 2004
a,b,c); but, in all cases, these have focused on testing the framework as a grammar formalism,
setting aside the disambiguation challenge.
32 See Tomasello (1999) for the role of imitation in joint attentional activities in language acqui-
sition.
9.6. CODA: ACQUISITION, CHANGE AND EVOLUTION 347
the child’s own lexicon stock is securely in place. So the phenomenon of apparent
mimicry is not so much the child copying whoever is talking to them, as using the
immediate context, given what they have just parsed. As we have already seen,
there is no need to construct higher-order hypotheses about the mental state of
either hearer or producer in successful dialogue, and this means that the child’s
system does not need the facility for representing such complex structures to be in
place before the child can be said to demonstrably have acquired language. These
can emerge later in more sophisticated types of communication, inducing indirect
pragmatic effects.33
What is it that the child has to learn, on this view? Like every other formalism,
the answer is, at least in part, a lexicon. Given the commitment to a grammar
formalism that reflects general parsing capacity, one thing is certain. No procedure
for developing semantic representations along a growth dimension can be available
to one language but not to another. As a parsing architecture, such principles
of growth are arguably hard-wired, and certainly available by definition. For the
analyst, it is only a matter of knowing where to look in each individual language
to find a strategy which is generally applicable in one language, and apparently
not available in another. For the child, what is required is to learn the relative
weighting of the devices the system makes available, the particular balance between
computational and lexical actions as expressed in the language to which he/she is
exposed. But, because of the ability to manipulate the immediate context, this
stock of information does not need to be securely in place before any production
activity whatever can get started, and we allow that the process of establishing this
appropriate balance can be gradual.
When it comes to the possibility of parameter settings determining language
acquisition in an individual language (see Chomsky 1986), these have no place in
this framework. The property of tree growth, construed as the basis of both compu-
tational and lexical actions, delimits the range of cross-linguistic variation. Strate-
gies for establishing interpretation may vary across languages only with respect to
whether they are defined as a general computational action (relative to some range
of constraints), or as a lexical action, triggered only by specific forms. No general
strategy is excluded in a language. Put bluntly, there are no parameter switches
that get established early on in language acquisition. In other respects, however,
with its broadly lexicalist stance, the assumptions to be made about language ac-
quisition by DS are not so very far from those of more orthodox frameworks.
When it comes to language change, the difference in perspective dictated by
Dynamic Syntax is more radical, at least in comparison with approaches to lan-
guage change articulated within current syntactic frameworks.34 It is perhaps not
surprising that the problem of language change has not had more attention from
syntacticians. Diachronic explanations, like performance phenomena, depend, ac-
cording to the orthodoxy, on the viability of an independently provided synchronic
account of a language system, which means turning to one’s favourite syntactic the-
ory; but the well-known formal linguistic frameworks are not well set up to explain
language change. The problem with language change is that it is obviously gradual,
and equally obviously involves a complex interaction between system-internal and
pragmatic, hence system-external, considerations. Neither of these properties is
33 As is widely known, such higher-order hypotheses do not emerge until about the age of 6
(see Tomasello (1999) for discussion). However, what the dialogue patterns suggest is that, even
in adult language use, such higher-order hypotheses are constructed only as required, and not
invariably. This point is made in Pickering and Garrod (2004).
34 Most work on syntactic change has been done within the Principles and Parameters or, more
naturally expressible within orthodox formalisms. But with the shift into a system
which articulates linguistic phenomena in terms of alternative strategies for con-
struction of semantic structure within a non-encapsulated system, we have a real
chance to explain language change in a way that is commensurate with psycholin-
guistic considerations.
On the one hand, from a system-internal stand-point, we can see the flexibility
of alternative strategies which the framework makes available in any individual
language as an advantage in language change:35 these allow alternative ways for
either interlocutor to recover the same content without necessarily noticing any
difference in the strategies they have severally employed. We posited just such a
case to account for the Greek specificity effects with clitic doubling in chapter 4.
Flexibility such as this provides a basis for gradual change in a system over time,
without any damage to the effectiveness of an individual system.
However, it is the bringing of dialogue considerations into the type of data which
the grammar can characterise that really opens up the spectrum of possibilities for
providing a formal basis to processes of language change. We can now explore ways
in which dialogue pressures on production choices can trigger syntactic change. To
take the shift from Latin to the modern Romance languages as a case study, we
suggested in passing in chapter 4 that the preposing of clitic pronouns might be
analysed as a lexical calcification of what had been earlier a free process of Local
*Adjunction.36 We can now substantiate this somewhat. It is a very well-known
observation that in Latin, the strong forms ille and ipse regularly occurred early on
in a sequence of noun phrases preceding the primarily final position of the verb; and
this positioning is taken to be the point of departure for what came to be the clitic
pronouns’ pre-verbal position. What was previously a puzzle was how this relatively
left-peripheral positioning could be explained, other than by the general apparent
desideratum that items which are in some sense ‘given’ in the discourse occur before
items which have not, which are ‘new’.37 But we can now see that this is due to the
alignment pressures on lexicalisation choices in production. Strong pronouns are
prototypical examples of context-dependent items, and it is these, as argued above,
which can be used by the speaker to minimise his production costs by using them
to be construed as an item already presented in the dialogue context, saving himself
the task of having to find the precise word. If the context against which the search
for this item is made as small as possible, the speaker has the possibility of further
minimising his production costs, by placing the anaphoric expression he has elected
to use as close to that context as possible. For if he were to do anything else and
place some expression indicating a term new to the discourse before the a context-
dependent expression, the context relative to which this expression is licensed would
no longer be minimal: it would contain not merely the structure containing the
antecedent from which the anaphoric expression is to pick up its interpretation,
but also the new term. So with this constraint of minimising production costs,
directly consonant with relevance constraints, we expect that context-dependent
items will be placed at the left periphery of any clausal sequence. The result is that
there does not have to be a rule or convention determining the positioning of these
strong pronouns, or indeed other anaphoric expressions: it is simply performance
pressures of a regular sort. Such pressures apply to all speakers in every utterance
35 It is independently attested as an advantage in the production task (Ferreira 1996).
36 This is of course a well-established view of clines of grammaticalisation (Hopper and Traugott
1993), but it has to date lacked an overall grammar formalism which provides the background for
sustaining it.
37 The phenomenon of ‘given’ where ordering is not otherwise determined is reported on a broad
cross-linguistic basis. See Uszkoreit (1987) for the same observation in German, Kiss (1987) in
Hungarian, etc.
9.6. CODA: ACQUISITION, CHANGE AND EVOLUTION 349
nomenon of clitics occurring after the verb is called enclisis. See Bouzouita (2001) for a setting
out of the change in the distribution of object clitic pronouns from medieval to modern Spanish
in Dynamic Syntax terms.
40 Though see Fischer (2002) for recorded occurrences of initial clitics in interrogatives in Old
French and Old Italian and a devastating critique of current Minimalist phonological and syntactic
accounts.
41 Relative clauses are taken to project an independent LINKed structure, so relative pronouns
fall into this set as the first structure-building operation in that structure.
350 CHAPTER 9. GENERAL PERSPECTIVES
first identification within the structure to which the enclitic pronoun contributes
its update actions.
Despite the functional motivation of placing ‘given’ items before items which
are ‘new’, there is a benefit to the learner in such distribution becoming encoded,
for, in so doing, it becomes deterministic, hence learnable by successive generations
without any of the hit and miss property of general dialogue pressures. ¿From
a system-internal perspective, the resultant situation is nevertheless non-optimal.
A complex disjunction has to be learned as a list by each succeeding generation.
We would therefore expect that simplification would take place, and quite rapidly.
This is exactly what has happened. The shift from Medieval to Renaissance Spanish
and the shift from there to modern Spanish can both be analysed in DS terms as
instances of progressive loss of lexically stipulated restrictions.42 The optimal state
would be a fully determined single position, as an agreement device, syntactically
adjacent to the verb, and phonologically defined as allowing nothing intermediate.
The current Romance languages are not quite there, but, in having clitics in finite
clauses invariably preverbal and invariably cliticising onto the verb, they are getting
a lot closer.
This account effectively reconstructs the well-known grammaticalisation cline
from anaphora to agreement, but with the new twist of being from a formal basis.
Lack of reversibility, which is a current point of controversy in diachronic syntax
(Campbell 2001, Hopper and Traugott 1993), is expected on this view: syntactic
change is expected to be progressively towards some optimum state relative to
a learnability criterion, on the assumption that such an optimum will act as an
attractor for shift from an earlier more complex specification to a finally transparent
one.43 There is a lot more to be said: indeed this gives little more than a hint of a
new way one might formally model syntactic change, but it is notably directly in
line with traditional informal views of language change.
Finally, and most broadly, this takes us to language evolution. With the trans-
formation of syntax into the progressive construction of semantic representation
following time-linear dynamics of parsing, with production essentially the same de-
vice, the evolution of language begins to have the air of inevitability. Of course this
is a vast area to explore, and some questions remain the same: how the capacity to
recognise and produce signals emerged, how and whether the construction of com-
plex concepts is independent of language itself; and so on. Other questions, though,
are new. In particular there is the role of alignment in the co-emergence of produc-
tion and parsing skills.44 And a number of other questions melt away. In particular,
there is no question to answer about how a capacity independent of parsing and
production comes to be in place. There is no longer any need to explain how the
separate and mismatching properties of syntactic and semantic mechanisms of such
a capacity could have emerged, and whether the requisite sub-systems could have
been acquired by a general reasoning device in the first place and then become
innate. Even the issue of how some encapsulated vocabulary for syntax consti-
tutive of the language faculty could be in place prior to any language emergence
evaporates, and, too, the question, for the alternative inferentialist view, of how
specialised vocabulary for syntax could have emerged from some general inferential
mechanism. On the new view, the vocabulary in terms of which structural proper-
42 As Bouzouita (2001) demonstrates, given the DS form of lexical specification, these forms of
there needs to be evidence of movement towards some justified optimum (see Sober 1991).
44 See Arbib forthcoming for the exploration of how mirror-neurone properties of the brain led,
ties of natural language are expressed is simply that required to express structural
properties of the language of thought (LOT ), with the added dimension of defining
properties of growth of such structures, a dimension which is in any case general
to the interpretation of all forms of input, not merely language. And this means
of representing LOT formulae and their structure would be part of the representa-
tional device for any inferential mechanism, whether general or specialised. And,
if the recursive complexity of thought is not contentious,45 then it follows that the
emergence of a language capacity reflecting this property is much less of a surprise.
On this view, all that has to evolve is the manipulation of sound-sequences, and
what systematic clusters of such sequences represent.46 At this point we leave all
these issues entirely open. All we can say – but with confidence – is that a new
story of language evolution is begging to be told.
45 See Hauser et al. (2002) for arguments that it is the recursive complexity of natural language
syntax which is the sole irreducible characteristic of the human capacity for language.
46 There are many points of contact between this view of language evolution and that proposed
REFERENCES
353
354 CHAPTER 10. REFERENCES
lection of Essays on Definite and Indefinite Descriptions and Other Related Phe-
nomena (pp. 558-84). Oxford: Oxford University Press.
Kempson, R., Meyer-Viol, W. and Gabbay, D. (2001). Dynamic Syntax. Oxford:
Blackwell.
Kempson, R., Meyer-Viol, W. and Otsuka, M. (2003). Growth of logical form: the
dynamics of syntax. In J. Peregrin (Ed.), The Semantic Turn. Oxford: Elsevier.
Kiaer, J. (in preparation). Incremental Parsing in Korean: At the Syntax-Phonology
Interface. Doctoral dissertation, King’s College London.
Kibble, R., Meyer-Viol, W. and Marten, L. (1997). Incremental interpretation us-
ing a tree description language. Amsterdam Colloquium 11, 199-204.
Kim, J. (1969). Events and their descriptions: some considerations. In N. Rescher
et al. (Eds.), Essays in Honor of Carl G. Hempel (pp. 198-215). Dordrecht: Rei-
del.
Kim, J. (1976). Events as property exemplifications. In M. Brand and D. Walton
(Eds.), Action Theory (pp. 159-177). Dordrecht: Reidel.
King, T. H. (1995). Configuring Topic and Focus in Russian. Stanford: CSLI.
Kiss, K. (1987). Configurationality in Hungarian. Dordrecht: Reidel.
Kiss, K. (2003). Argument scrambling, operator movement and topic movement in
Hungarian. In S. Karimi (Ed.), Word Order and Scrambling (pp. 22-43). Oxford:
Blackwell.
Knill, D. and Richards, W. (Eds.) (1996). Perception as Bayesian Inference. Cam-
bridge: Cambridge University Press.
Koopman, H. and Sportiche, D. (1983). Variables and the Bijection Principle. The
Linguistic Review 2, 139-160.
Kracht, M. (2002). The combinatorics of cases. Research in Language and Com-
putation 1, 59-97 .
Krifka, M. (1995). Swahili. In J. Jacobs, A. v. Stechow, W. Sternefeld, and T. Ven-
nemann (Eds.), Syntax: Ein internationales Handbuch zeitgenössischer Forschung,
Vol. 2 (pp. 1397-1418). Berlin, New York: Walter de Gruyter.
Kural, M. (1997). Postverbal constituents in Turkish and the Linear Correspon-
dence Axiom. Linguistic Inquiry 28, 498-519.
Kuroda, S. Y. (1999). Head internal relative clauses in Japanese. In M. Muraki
and E. Iwamoto (Eds.), Linguistics: In Search of the Human Mind. A Festschrift
for Kazuko Inoue (pp. 414-429). Tokyo: Kaitaku-sha.
Kurosawa, A. (2003). A Dynamic Syntax Account of the Syntactic and Pragmatic
Properties of Japanese Relative Clauses. Unpublished doctoral dissertation, King’s
College London.
Ladd, R. (1996). Intonational Phonology. Cambridge: Cambridge University Press.
Ladusaw, W. (1983). Logical form conditions on grammaticality. Linguistics and
Philosophy 6, 389-422.
Lamarche, J. (2003). Be the one. In M. Weissberger (Ed.), Proceedings of Sinn
und Bedeutung 7.
Lambek, J. (1961). On the calculus of syntactic types. In R. Jakobson (Ed.),
Structure of Language and Its Mathematical Aspects, Proceedings of the Symposia
in Applied Mathematics XII, American Mathematical Society (pp. 166-178).
Landman, F. (1986). Towards a Theory of Information. Dordrecht: Foris.
Landman, F. and Grosu, A. (1998). Strange relatives of the third kind. Natural
Language Semantics 6, 125-170.
Lasersohn, P. (1995). Plurality, Conjunction and Events. Dordrecht: Kluwer.
Lasnik, H. (1995). Case and expletives revisited. Linguistic Inquiry 26, 354-365.
Lasnik, H. and Stowell, T. (1991). Weakest crossover. Linguistic Inquiry 22, 687-
720.
360 CHAPTER 10. REFERENCES
Index
367
368 CHAPTER 11. INDEX
Ambiguity
Lexical
Processing
Quantifier-scope
Structural
Anaphor: see pronoun construal
Anaphora
Cross-sentential
Intra-sentential
Locative
Nominal
In interaction with long-distance dependency
In interaction with right-periphery effects
Interaction with structural processes: see crossover;
pronoun construal, resumptive;
dislocation, left;
dislocation ,right;
scrambling
See also: Pronoun
Antecedent, for anaphora
Arabic
Argument
Associate: see noun phrase, post-copular
Bantu
Bemba
Binding principles, see pronoun, locality constraint on
Bottom restriction, loss of : see pronoun, expletive
Case, relation
Accusative
Constructive
Dative
Morphological
Nominative
Requirement
Specification
Cataphora : see pronoun construal, expletive
Categorial grammar
Combinatory (CCG)
Type-logical (TLG)
Type-theoretic (TTG)
Chichewa
Clitic
Indirect object
Object
Subject
Competence
Complement clause
Complementiser
Compositionality
Constraints
Complex NP
Island
369
Local
Proper Binding
Right Roof
Content
Name
Partial
Propositional
Representation of
Context
Cognitive
Definition of
Dependence
Immediate
Lexical
And relative-clause construal
And wellformedness
Coordination
Analysis
Swahili
And agreement
And LINK
And Right node Raising
Copula
Equative
Existential
Intransitive
Locative
Predicational
Presentational
Specificational
Crossover
Extended strong
Strong
Weak
Weakest
Declarative unit
Definite
Article: see determiner, definite
Epithet
Noun phrase
Demonstrative
Dependency, long distance
Determiner
Definite
Indefinite
Dialogue
Alignment
Parsing in
Production in
Discourse representation theory (DRT)
Dislocation
Clitic Left
370 CHAPTER 11. INDEX
Intonation
Iota term
Irish
Italian
Japanese
Knowledge of language, see linguistic knowledge
Korean
Labels
Formula
Scope
Treenode
Type
Language
Acquisition
Change, syntactic
Evolution
Formal
Free word-order
Logical
Of thought
Of tree description
Verb-final
Verb-initial
Verb-medial
Language processing, dynamics of
Latin
Left-peripheral: see typology, left-dislocation
Left-periphery
Lexical actions, see actions, lexical
Lexical functional grammar (LFG)
Lexical item
Lexical meaning
Lexicon
Linear order: see word order
Linear precedence, see word order
Linguistic knowledge
Linked structure
Locality
Logical form
Luguru
Malayalam
Meaning, representation of
Metavariable
Formula
Predicate
Term
Minimalist Program
Movement processes
Modality, underspecified see tree relation, underspecified
Names
Arbitrary
Proper
372 CHAPTER 11. INDEX
Natural deduction
Negation
New (information)
Node
Address
Argument
Completed
Description
Functor
Locally unfixed
Mother
Non-terminal
Predicate
Restriction, terminal
Root
Subject
Terminal
Unfixed
Nominal
Noun
Lexical entry
English
Japanese
Arabic
Swahili
Japanese
Class, Swahili
Plural marking
Singular marking
Noun Phrase
Conjoined
Definite
Indefinite
Left-dislocated
Post-copular Quantified
Referential
Right-dislocated
Subject
Operator
Epsilon
Iota
Kleene*
Lambda
Modal
Tau
Parsing: see processing, dynamics
Perception
Performance
Pointer
movement
Pragmatics
Predicate
373
Predicate logic
Formulae
Proof
Prepositional Phrase
Presupposition
Pro-drop
Full
Subject
Production
Context-dependence of
Costs
Dependence on parsing
Task
See also: Generation
Pronominal, see: pronoun
Resolution
VP
Pronoun construal
Clitic, object/accusative
Subject
Doubling Expletive Interrogative
Intrusive
Locality constraint on
Locative
Reflexive
Relative
Resumptive
Sloppy
Strong
Quantification
Mixed
Strong
Quantifier
Existential
Generalised
Scope, indefinite
Universal
Quantifying expression
Questions
Functional
Wh
Reconstruction
Relative clause construal
Correlative
Head-initial
Head-internal
Head-final
Non-restrictive
Restrictive
Relativiser, see pronoun construal, relative
Relevance
Cognitive prinicple
374 CHAPTER 11. INDEX
Communicative prinicple
Relevance theory
Representation
Levels of
Semantic
Representation language
Requirement
Predicate
Propositional
scope
Modal
Tree node
Formula
Type
Right dislocation: see right periphery
Right Node Raising
Right-periphery
Right roof constraint
Romance
Romanian
Scope: see quantifier, scope
Statement: see transition rule: Scope evaluation
Scrambling
Local, short
Long-distance
Multiple
And quantifier construal
Semantics
Montague
Of formal languages
Truth-conditional
Sentence meaning
Spanish
Medieval
Modern
Specificity
Stress, contrastive
String
Natural-language
Subject
Suffix, role of
Swahili
Tau term
Tense
Specifications
Marker, Swahili
Japaneese
Time-linear processing
Parsing
Production
And grammar
Topic
375
Background
Contrastive
Marking
Topicalisation, see dislocation, left
Transition rules
*Adjunction
Anticipation
Appositive LINK Evaluation
Completion
Elimination
Generalised Adjunction
Introduction
Inverse Local*Adjunction
Late*Adjunction
LINK Adjunction
LINK evaluation
Link Evaluation(conjunction)
Local*Adjunction
Merge
Prediction
Recapitulation
Scope evaluation
Substitution
Term Resolution
Thinning
Tree
Description
Description language
Partial
LINKed
Complete
Growth, goal-directed
Logic, LOFT
Location, underspecified
Relation
Underspecified
Mother
Daughter, functor
Daughter, argument
Modal
Tree adjoining grammar
Truth conditions
Type
Logical
Typology,
Relative
Left-dislocation
Right-dislocation
Underspecification
Content
Formula
Lexical
376 CHAPTER 11. INDEX
Structural
Treenode Update of
Verb
Ungrammatical, see wellformedness
Update
Context
Monotonic
Incremental
Variable
Bound
Meta-variable
Variable-binding
Verb
Auxiliary
Transitive
Lexical entry
English
Japanese
Irish
Greek
Spanish
Swahili
Wellformedness, utterance
In a fixed context
In an arbitrary context
Judgements
Wh expression
Word meaning: see lexical meaning
Word order