Parametric Functional
Parametric Functional
of syntactic universals*
Martin Haspelmath
*I am grateful to Frederick J. Newmeyer, Bernhard Wälchli, and two anonymous referees for
useful comments on an earlier version of this paper.
Martin Haspelmath
structures, and for a whole range of syntactic domains linguists have a good initial
idea of what the limits on structural variation (i.e., the syntactic universals) are.
But why should languages be limited in these particular ways in choosing their
structures? Generative and functional linguists have given very different answers
to this question, but these answers are rarely compared to each other, and the dif-
ferences in the methodological foundations of the two research programmes are
not well understood by most linguists. The present paper aims to elucidate this
situation. I will not attempt to hide my personal preference for the functional ap-
proach, but I will highlight strengths and weaknesses of both approaches.
I should note at the outset that the goal of explaining language universals is
not usually taken as the main goal of generative research. The following quotation
is typical:
The comparative approach in the generative tradition addresses the following
questions: (i) what is knowledge of language? (ii) how is this knowledge acquired?
. . . In order to answer these questions we have to identify which linguistic properties
can vary across languages and which are constant. (Haegeman 1997: 1)
. The main reason for this assessment is the vast gulf separating linguists’ views, as reflected
in current textbooks on different cognitively oriented approaches to syntax, e.g., Taylor (2002)
(on Cognitive Grammar) and Adger (2003) (on generative grammar). It could, of course, turn
out that one of the two approaches is exactly on the right track and the other approach is totally
wrongheaded. But I find it much more likely that, while both capture a few aspects of the truth,
they are both groping in the dark, equally distant from their common goal of mentally realistic
description, which is simply too ambitious.
Parametric versus functional explanations of syntactic universals
It has sometimes been asserted that linguistic universals should not be con-
strued as properties of observable systems of speaker behaviour, but as properties
of mental systems. For example, Smith (1989: 66–67) writes:
Greenbergian typologists try to make generalizations about data, when what they
should be doing is making generalizations across rule systems.
This apparently means that Greenbergians generalize over abstract systems (regard-
less of their cognitive realization), whereas true universals will only be found by
generalizing over cognitive systems. It is easy to agree that a systematic comparison
of cognitive systems would be a worthwhile task, but unfortunately it is totally im-
practical, because we know so little about what these systems might be (see note 1).
Even for well-studied languages like English or Japanese, there are very few things
that cognitively oriented syntacticians agree about, and even within a single research
tradition (such as the Chomsykan tradition), there may be new guiding ideas every
10 years or so that lead linguists to very different analyses of basically the same data.
Thus, cross-linguistic generalizations over known cognitive systems do not seem to
be on the order of the day for the present generation of syntacticians.
Instead, what all comparative linguists do in practice is compare abstract
(Platonic) rule systems that approximately model the observed behavior of speakers
in spontaneous language use and acceptability judgements. This is a much more
tractable task that is clearly within our reach. There are hundreds of descriptions
of languages from around the world that are accessible to any linguist because they
only use widely understood theoretical vocabulary (“basic linguistic theory”), and
these are typically the basis for typological generalizations in the Greenbergian tra-
dition. In the Chomskyan tradition, too, almost all comparisons are based on (what
are openly acknowledged to be) provisional and largely conventional analyses.2
Such analyses may be claimed to be about cognitive rule systems, but in practice this
has little relevance for the comparative syntactician’s work. Those generative syn-
tacticians that are considering a fairly large number of languages have necessarily
based their conclusions also on secondary sources from outside the Chomskyan
tradition (e.g., Baker 1988, 1996; Ouhalla 1991; Freeze 1992; Cinque 1999, 2005;
Johannessen 1998; Julien 2002; Neeleman and Szendrői, this volume), and this kind
of work has not (at least not widely) been rejected by generative linguists.
. The extent to which analyses are based on convention (rather than empirical or conceptual
motivation) is rarely explicitly recognized, but in private, many linguists will frankly say that the
theoretical framework they teach or work in is motivated as much by social convenience as by
their convictions. (A personal anecdote: When I was still a Ph.D. student, a well-known generative
linguist with an MIT background advised me to state my insights in generative terms, so that my
work would be more widely recognized.)
Martin Haspelmath
Most tellingly, generative linguists have usually not refrained from proposing
UG-based explanations for universals that are amenable to functional explanations.
. See Kiparsky (2008) for an attempt to identify a number of properties that distinguish non-
cognitive universals (“typological generalizations”) from cognitive universals (“universals”).
Parametric versus functional explanations of syntactic universals
If their only real interest were the characterization of UG, one would expect them
to leave aside all those universals that look functional, and concentrate on those
general properties of language that seem arbitrary from a communicative point
of view. But this is not what is happening. For instance, as soon as topic and focus
positions in phrase structure had been proposed, generativists began exploring the
possibility of explaining information structure (“functional sentence perspective”)
in syntactic terms, rather than leaving this domain to pragmaticists. Particularly
telling is Aissen’s (2003) discussion of Differential Object Marking (DOM), which
is known to have a very good functional explanation (e.g., Comrie 1981; Bossong
1985; Filimonova 2005). Aissen does not consider the possibility that this functional
explanation makes an explanation in terms of UG superfluous, stating instead that
the exclusion of DOM from core grammar comes at a high cost, since it means
that there is no account forthcoming from formal linguistics for what appears to
be an excellent candidate for a linguistic universal. (Aissen 2003: 439)
Thus, the reason that DOM was not tackled seriously before by generativists
seems to be the fact that the generative tools did not seem suitable to handle it,
until Optimality Theory and harmonic alignment of prominence scales entered
the stage.
I conclude from all this that the topic of this paper, the explanation of ob-
servable (phenomenological) syntactic universals, is highly relevant to both main-
stream generative linguistics and mainstream functional linguistics.4
. A final potential objection from an innatist point of view should be mentioned: If one thinks
that “all languages must be close to identical, largely fixed by the initial state” (Chomsky 2000: 122),
then one should seek explanations for those few aspects in which languages differ, rather than
seeking explanations for those aspects in which languages are alike (see also Baker 1996: ch. 11).
Again, while such an approach may be attractive in theory, it is not possible in practice: While it
is a meaningful enterprise to list all known universals (as is attempted in the Konstanz Universals
Archive, http://ling.uni-konstanz.de/pages/ proj/sprachbau.htm), it would not be possible to list
all known non-universals, and one would not know where to begin explaining them.
Martin Haspelmath
This approach does two things at the same time: First, it explains how children
can acquire language (Haegeman’s second goal), because “they are not acquiring
dozens or hundreds of rules; they are just setting a few mental switches” (Pinker
1994: 112). Second, it offers a straightforward way of explaining implicational
universals, the type of universals that comparative linguists have found the most
intriguing, and that are attested the most widely. If a parameter is located at a
relatively high position in the network of categories and principles, it will have
multiple consequences in the observed system. The simplest example is the order
of verb and object, noun and genitive, adposition and complement, etc.: If the pa-
rameter is located at the higher level of head and complement (assuming that verb/
object, noun/genitive, adposition/complement can be subsumed under these cat-
egories), setting the head-ordering switch just once gives us the order of a range of
lower-level categories, thus accounting for the implicational relationships among
their orderings. Thus, “small changes in switch settings can lead to great apparent
variety in output” (Chomsky 2000: 8; see also Baker 2001a: ch.3 for extensive dis-
cussion of this point).
This research programme proved very attractive, and it is usually described as
“highly successful, leading to a real explosion of empirical inquiry into language
of a very broad typological range” (Chomsky 2000: 8; see also Newmeyer 2005: 40,
in an otherwise very critical context). One even reads occasionally that it provided
the solution for Plato’s Problem (the problem of language acquisition despite the
poverty of the stimulus), so that deeper questions can now be asked (Grewendorf
2002: 99; Chomsky 2004; Fukui & Zushi 2004: 11).5
. But one also reads more cautious assessments. Chomsky (2000: 8) reminds the reader that
the P&P approach “is, of course, a program, and it is far from a finished product . . . one can
have no certainty that the whole approach is on the right track”.
Parametric versus functional explanations of syntactic universals
in the first column and the positive or negative settings in the second column.
There would not be many parameters (“a few mental switches”, Pinker 1994: 112,
“there are only a few parameters”, Adger 2003: 16), perhaps 20 (Fodor 2003: 734),
perhaps 50–100 (Roberts & Holmberg 2005), and at any rate a number compa-
rable to the number of chemical elements in the periodic table (Baker 2001a:
ch. 6). Reducing the description of the syntax of a language from 1300 pages to one
or two pages would truly be a spectacular success that is worth the effort.
Since carrying out the research programme is not an easy task, we may still be
a few decades (or more) away from this ultimate goal. But what progress has been
made over the last quarter century, since the P&P programme was inaugurated?
Certainly one might reasonably expect the following indications of progress:
Whether progress has indeed been made along these lines is not easy to say,
and one’s assessment will of course depend on one’s perspective. Baker (2001a)
offers a highly optimistic view of the progress of generative comparative syntax,
comparing it to the situation in chemistry just before the periodic table of elements
was discovered: “We are approaching the stage where we can imagine producing the
complete list of linguistic parameters, just as Mendeleyev produced the (virtually)
complete list of natural chemical elements” (p. 50). In the following subsection, I
will present a more sober assessment (see also Newmeyer 2004, 2005: 76–103).
the very least one might expect handbooks or textbooks to contain lists of well-
established parameters. But the voluminous Oxford Handbook of Comparative
Syntax (Cinque & Kayne 2005, 977 pages) contains no such list. It has language,
name and subject indexes, but no parameter index. The subject index just allows
one to find references to a handful of parameters. Likewise, textbooks such as
Culicover 1997, Biloa 1998, Ouhalla 1999 (all of them with “parameter” in their
title) do not contain lists of parameters. The only laudable exception is Roberts
(1997), who gives a short list of parameters at the end of each chapter. But in
general it is very difficult even to find out which parameters have been proposed
in the literature.
It seems to me that one reason for this lack of documentation of proposed
parameters is that most of them are not easy to isolate from the assumptions about
UG in which they are embedded, and these differ substantially from author to
author and from year to year. Even if all works stated the parameters they propose
explicitly in quotable text (many do not), these proposals would be difficult to
understand without the context. The highly variable and constantly shifting as-
sumptions about UG are thus a serious obstacle for a cumulative process of acqui-
sition of knowledge about parameters. One would hope that in general the best
proposals about UG are adopted, and that the general changes in assumptions
are in the right direction. But so far it seems that there is too little stability in our
picture of UG to sustain a cumulative process of parameter detection, where one
linguist can build on the discoveries (not merely on the generalizations and ideas)
of another linguist.
It is also not clear whether there are more and more parameters for which
there is a consensus in the field. In the absence of an authoritative list, I examined
16 textbooks, overview books and overview articles of generative syntax for the
parameters that they mention or discuss (Atkinson 1994; Biloa 1998; Carnie 2002;
Culicover 1997; Fanselow & Felix 1987; Freidin 1992; Fukui 1995; Haegeman 1997;
Haider 2001; Lightfoot 1999; Jenkins 2000; McCloskey 1988; Ouhalla 1999; Pinker
1994; Radford 2004; Smith 1989). The result is shown in Table 1.
There are only 7 parameters which are mentioned in these works, presum-
ably because the textbook and overview authors sensibly limited themselves to
less controversial parameters. However, it is difficult to argue that the number of
parameters for which there is a consensus has increased over the years. Let us look
at the main parameters in Table 1 one by one.
The head-directionality parameter is the easiest parameter to explain to
readers with little background knowledge, so it is the most widely used example
for parameters. However, since Travis’s work of the 1980s (e.g., Travis 1989) it has
been clear that the simplistic version of the parameter that the textbooks mention
can at best be part of the story, and Kayne’s (1994) wholesale attack on it has been
Parametric versus functional explanations of syntactic universals
very influential. A big problem is that the Greenbergian word order generaliza-
tions are only tendencies, and parameters can only explain perfect correlations,
not tendencies (Baker & McCloskey 2005). Another problem is that assumptions
about heads and dependents have varied. Determiners used to be considered
specifiers (i.e., a type of dependent), but now they are mostly considered heads.
Genitives used to be considered complements, but now they are generally con-
sidered specifiers. And since the 1990s, almost all constituents are assumed to
move to another position in the course of the derivation, so that it is the landing
sites, not the underlying order of heads and nonheads that determines word order.
Finally, Dryer (1992) and Hawkins (1994, 2004) provided strong arguments that
the factor underlying the Greenbergian correlations is not head-nonhead order,
but branching direction. As a result of all this, not many generative linguists seem
to be convinced of this parameter.
Martin Haspelmath
. The polysynthesis parameter, Baker’s (1996) prime example of a macroparameter, did not
make it on the list because it is hardly discussed in the literature apart from Baker’s work. This
is a great pity, because in my view, the correlations reported by Baker in his ch. 11 constitute the
most interesting typological hypothesis in the entire generative literature.
Parametric versus functional explanations of syntactic universals
(e.g., Vikner 1997). But this latter point is known to be very problematic and con-
troversial (see Roberts 2003 for an overview of verb movement).
The verb movement parameters also illustrate another widespread limitation
of parametric explanations: Very often the range of data considered come from a
single language family or subfamily, & there is no confirming evidence from other
families (this is also deplored in Baker & McCloskey 2005). Thus, Vikner (1997)
considers only Germanic languages plus French (another Indo-European lan-
guage with close historical ties to Germanic), and Roberts and Holmberg’s (2005)
showcase example of a parameter that captures valid cross-linguistic correlations
concerns exclusively the Scandinavian branch of the Germanic subfamily of Indo-
European. Kayne (2000) discusses a number of parameters that are relevant exclu-
sively to Romance.
Another widespread approach is to compare just two widely divergent lan-
guages, and to try to connect the differences observed between them by proposing
a small number of parameters. The most typical example of this approach is the
language pair Japanese/English (see, e.g., Kuroda 1988; Fukui 1995) (but also
French and English, Kayne 1981).
So far it seems that sophisticated parameters such as those that were devel-
oped on the basis of a single family or a pair of languages have rarely (if ever)
been confirmed by evidence coming from unrelated families in different parts of
the world.
Finally, I see no signs of an increasing amount of broad cross-linguistic re-
search in the principles-and-parameters approach. Works such as those by Johan-
nessen (1998), Julien (2002), and Cinque (2005) remain exceptional and have not
been particularly influential in the field. Baker (1988) was really the only truly
influential work that adopts a broad cross-linguistic approach, and although Baker
(1996) proposed a much more interesting parameterization hypothesis, it was
much less well received in the field.
. A similar quote can be found in Baltin (2004: 551): “I have never seen convincing evidence for
macroparameters.” And in a personal e-mail (May 2006), David Adger says: “. . . macroparameters
. . . nice idea but doesn’t seem to work.”
. Curiously, Kayne does not seem to be concerned that by limiting himself to closely related
languages, he runs the risk that he will discover shared innovations that have a purely historical
explanations, rather than properties that are shared because of the same parameter setting.
Parametric versus functional explanations of syntactic universals
The possible objection that OT is thus not radically different from P&P is
countered by advocates of OT by pointing out that “a parameter that is “off ” is
completely inactive. But a constraint that is crucially dominated can still be active”
(McCarthy 2002: 242). For example, in Italian the dominated constraint Subject is
not always irrelevant: When a subject does not have a topic antecedent, the subject
appears overtly to satisfy the constraint Subject. Thus, OT constraints can account
for between-language variation and for within-language variation, unlike param-
eters, which only govern between-language variation (McCarthy 2002: 110).
A big difference in practice between OT and P&P is that OT analyses must be
explicit about which constraints are assumed and how they are ranked. The con-
straints are given catchy names (to fit the column heading of a constraint tableau),
and their content is typically spelled out in a separate paragraph of the text, as in (5)
(from Legendre 2001: 5):
(5) FullInt: Lexical items must contribute to the interpretation of a structure.
. According to Lightfoot (1999: 259), “if there are only 30–40 structural parameters, then
they must look very different from present proposals . . . a single issue of the journal Linguistic
Inquiry may contain 30–40 proposed parameters” (see also Newmeyer 2005: 81–83).
Parametric versus functional explanations of syntactic universals
broader typological context” (McCarthy 2002: 108). This is actually quite similar
in spirit to the P&P approach, whose vision is the exhaustive description of a lan-
guage by simply determining the settings of all the parameters, once they have
been discovered (see §2.2). The difference is that P&P has never come particularly
close to this vision, not even in the description of small fragments of a grammar,
and it is common to find P&P works that make very little reference to parameters.
Although in principle every P&P analysis of a particular language makes predictions
about other possible languages, the analyses are typically so complicated that it is
difficult to see what exactly is predicted. OT is more straightforward: “To propose
a constraint ranking for one language in OT is to claim that all possible rerankings
of these constraints yield all and only the possible human languages” (Legendre
2001: 15).
However, in practice this has not led to a large number of broad cross-
linguistic syntactic studies in the OT framework, so the success of this approach
is not yet apparent. Legendre (2001: 15), in the introduction to Legendre et al.
(eds.) 2001, mentions only a single paper in the entire volume as an example of
“typology by reranking”. And Sells (2001: 10–11), in the introduction to Sells
(ed.) (2001), states: “Typological predictions are brought out most directly in the
papers here by Choi, Morimoto and Sharma, but are implicit in most of them.” So
in actual practice, OT tends to be like P&P in that authors mostly aim at insightful
descriptions of particular languages. There is of course nothing bad about lim-
iting oneself to a language one knows well, but the skeptical reader cannot help
being struck by the contrast between the ambitious rhetoric and the typically
much more modest results.
role in their theorizing. For instance, Bresnan (1997: 38) provides a functional mo-
tivation for the constraint ProAgr (“Pronominals have agreement properties”):
“The functional motivation for the present constraint could be that pronouns . . .
bear classificatory features to aid in reference tracking, which would reduce the
search space of possibilities introduced by completely unrestricted variable refer-
ence.” Aissen (2003) explicitly links the key constraints needed for her analysis
to two of the main factors that functionalists have appealed to: iconicity and
economy. The idea that OT constraints should be “grounded” in functional con-
siderations is even more widespread in phonology (e.g., Boersma 1998; Hayes
et al. 2004), but the trend can be seen in syntax, too. Another reflection of this
trend is the appearance of papers that explicitly take a functionalist stance and use
OT just as a formalism, without adopting all the tenets of standard generative OT
(e.g., Nakamura 1999; Malchukov 2005).
I believe that the affinity between OT and functionalist thinking derives
from the way in which OT accounts for cross-linguistic differences: through the
sanctioning of constraint violations by higher-ranked constraints. The idea that
cross-linguistic differences are due to different weightings of conflicting forces
has been present in the functionalist literature for a long time (cf. Haspelmath
1999a: 180–181). OT’s contribution is that it turned these functional forces into
elements of a formal grammar and devised a way of integrating the idea of conflict
and competition into the formal framework.
But if certain nonoccurring languages are ruled out directly by non-formal func-
tional considerations, then it is unclear why the elaborate system of indirectly
functionally derived constraints is needed in the first place (see also Newmeyer
2005: 223–224). The DOM universals are amenable to a straightforward and com-
plete explanation in functional terms, so adopting a UG-based explanation instead
only makes sense if one takes an antifunctionalist stance for independent reasons.
Martin Haspelmath
There are many different authors who use the label “functionalist”, so I will con-
centrate on my own approach here (e.g., Haspelmath 1999a, 2004b, 2005, 2008),
which is largely compatible with the approaches of other authors such as Green-
berg (1966), Comrie (1981), Givón (1995), Dryer (1992, 2006), Stassen (1997),
Croft (2001, 2003), Cristofaro (2003, to appear), Hawkins (2004). I will also discuss
other nongenerative approaches to typology here that do not necessarily claim to
have explanations for universals.
4.1 Th
e fundamental difference between the functionalist
and the generative approach
The difference between functional and generative linguistics has often been framed
as being about the issue of autonomy of syntax or grammar (Croft 1995; New-
meyer 1998), but in my view this is a misunderstanding (see Haspelmath 2000).
Instead, the primary difference (at least with respect to the issue of explaining uni-
versals) is that generative linguists assume that syntactic universals should all be
derived from UG, i.e., that unattested language types are computationally (or bio-
logically) impossible (cf. §2.5), whereas functionalists take seriously the possibility
that syntactic universals could also be (and perhaps typically are) due to general
ease of processing factors. For functionalists, unattested languages may simply
be improbable, but not impossible (see Newmeyer 2005, who in this respect has
become a functionalist).
This simple and very abstract difference between the two approaches has
dramatic consequences for the practice of research. Whereas generativists put
all their energy into finding evidence for the principles (or constraints) of UG,
functionalists invest a lot of resources into describing languages in an ecumenical,
. Aissen also refers to Jäger (2004), noting that his evolutionary approach “may explain more
precisely why such grammars are dysfunctional and unstable”. But Jäger’s functionalist approach
entirely dispenses with the formalist elements of Aissen’s system and is thus not compatible with
her analysis.
Parametric versus functional explanations of syntactic universals
terms by Georg von der Gabelentz (1901: 481), in the passage that contains the
first mention of the term typology:11
Aber welcher Gewinn wäre es auch, wenn wir einer Sprache auf den Kopf zusa-
gen dürften: Du hast das und das Einzelmerkmal, folglich hast du die und die
weiteren Eigenschaften und den und den Gesammtcharakter! – wenn wir, wie es
kühne Botaniker wohl versucht haben, aus dem Lindenblatte den Lindenbaum
construiren könnten. Dürfte man ein ungeborenes Kind taufen, ich würde den
Namen Typologie wählen.
Plank (1998) documents a large number of mostly forgotten early attempts to link
phonological properties of languages with nonphonological (morphological and
syntactic) properties. Klimov (1977, 1983) (summarized in Nichols 1992: 8–12)
proposed a typology that links a large number of diverse lexical, syntactic and
morphological properties. And while Greenberg (1963) was conservative in the
kinds of implications that he proposed, others following him made bolder claims
on correlations involving word order patterns. Thus, Lehmann (1973, 1978) links
“the VO/OV distinction to phonological properties. For example, he claims that
the effect of phonological processes is “progressive” in OV languages, “anticipa-
tory” in VO. OV languages tend to have vowel harmony and progressive assimila-
tion; VO languages tend to have umlaut and anticipatory assimilation” (Lehmann
1978: 23).
Thus, quite analogously with the generative notion of “deep” parameters (or
“macroparameters”), the nongenerative literature is full of claims of “deep” impli-
cational universals. While the nongenerative literature does not make a connec-
tion to ease of language acquisition, it seems that the idea that there are just a few
basic blueprints of human languages, and that linguists could discover the key to
all the tight interconnections between structural features, is attractive and hard to
resist independently of the Chomskyan way of thinking about language.
. “But what an achievement it would be if we were able to confront a language and say to it:
“You have such and such a specific property and hence also such further properties and such
and such an overall character” – were we able, as daring botanists must have tried, to construct
the entire lime tree from its leaf. If an unborn child could be baptized, I would choose the name
typology.”
Parametric versus functional explanations of syntactic universals
set up on the basis of just a few languages and no serious attempt was made to justify
them by means of systematic sampling. When systematic cross-linguistic research
became more common after Greenberg (1963) (also as a result of the increasing
availability of good descriptions of languages from around the world), none of the
holistic types were found to be supported. For example, the well-known morpho-
logical types (agglutinating/flectional) were put to a test in Haspelmath (1999b),
with negative results. It is of course possible that we have simply not looked hard
enough. For example, I am not aware of a cross-linguistic study of vowel harmony
and umlaut whose results could be correlated with word order properties (VO/OV)
of the languages.
But there does seem to be a widespread sense in the field of (nongenerative)
typology that cross-domain correlations do not exist and should not really be
expected. After the initial success of word order typology, there have been many
attempts to link word order (especially VO/OV) to other aspects of language struc-
ture, such as comparative constructions (Stassen 1985: 53–56), alignment types
(Siewierska 1996), indefinite pronoun types (Haspelmath 1997: 239–241), and
(a)symmetric negation patterns (Miestamo 2005: 186–189). But such attempts have
either failed completely or have produced only weak correlations that are hard
to distinguish from areal effects. Nichols’s (1992) large-scale study is particularly
telling in this regard: While she starts out with Klimov’s hypotheses about cor-
relating lexical and morphosyntactic properties, she ends up finding geographical
and historical patterns instead, rather than particularly interesting correlations.
The geographical patterning of typological properties was also emphasized by
works such as Dryer (1989), Dahl (1996), and Haspelmath (2001), and the publica-
tion of The World Atlas of Language Structures (Haspelmath et al. 2005) documents
the shift of interest to geographical patterns. According to a recent assessment by
Bickel (2005), typology has turned into
a full-fledged discipline, with its own research agenda, its own theories, its own
problems. The core quest is no longer the same as that of generative grammar,
the core interest is no longer in defining the absolute limits of human language.
What has taken the place of this is a fresh appreciation of linguistic diversity in
its own right . . . Instead of asking “what’s possible?”, more and more typologists
ask “what’s where why?”.
1. If a language lacks overt coding for transitive arguments, it will also lack overt
coding for the intransitive subject (Greenberg 1963, Universal 38).
2. Grammatical Relations Scale: subject > object > oblique > possessor If a lan-
guage can relativize on a given position on the Grammatical Relations Scale, it can
also relativize on all higher positions (Keenan & Comrie 1977).
3. Alienability Scale: kin terms/body-part terms > part-whole/spatial relations
> culturally basic items > others If a language allows juxtaposition to express a pos-
sessive relation with a possessed item from one of the positions on the Alienability
Scale, it also allows juxtaposition with all higher positions (Nichols 1988: 572)
Parametric versus functional explanations of syntactic universals
. Another kind of universal syntactic generalization that has been explained functionally
is the Greenbergian word order correlations (see Hawkins 1994, 2004). These could be seen as
“cross-domain” in that they make generalizations between noun phrase and clause structure,
but the explanatory account provided by Hawkins makes crucial reference to the efficient use of
noun phrases in clauses. Similarly, as a reviewer notes, there are probably correlations between
word order and case-marking, e.g., a tendency for SVO languages to show less case-marking
that distinguishes subject and object than SOV or VSO languages. Again, these are explained
straightforwardly in functional terms. So larger correlations may exist, but they are found espe-
cially where the functional motivation is particularly clear.
Martin Haspelmath
the functionalist and other nongenerative typological literature and are used to
create a series of constraints with fixed ranking that can describe all and only the at-
tested systems. However, as I pointed out in §3, OT cannot say why the constraints
are the way they are, and Aissen does not succeed in explaining how prominence
scales such as those in (1)–(7) come to play a role in the makeup of constraints.
One gets the impression that Aissen creates the constraints because they work, and
they work because they are derived from the valid intra-domain implications that
we have seen. Aissen’s approach does not bring us closer to understanding why the
implicational universals should hold.
. In many texts, the number of intransitive and transitive clauses is about equal. But the case
of the intransitive subject (nominative or absolutive) is virtually always also used for one of the
transitive arguments, so it is always more frequent than the case used for the other transitive
argument.
Parametric versus functional explanations of syntactic universals
i. The functional explanations have nothing to do with the absence of an au-
tonomy assumption. Syntax and grammar are conceived of as autonomous, and
a strict competence-performance distinction is made. But there is no assumption
that competence grammars are totally independent and isolated: Grammars can
be influenced by performance.
ii. The functional explanations do not contradict the idea that there is a Uni-
versal Grammar, even though they do not appeal to it. Functional explanation and
Universal Grammar are largely irrelevant to each other.14
iii. The functional explanations do not presuppose a cognitively real (i.e., de-
scriptively adequate) description of languages. Phenomenological (i.e., observa-
tionally adequate) descriptions of languages are sufficient to formulate and test the
universals in (1)–(7) (Haspelmath 2004a).
. A reviewer comments that the scales in §4.4 are formulated in terms of categories, and
that these categories must come from UG, so that implicitly, the functional explanations, too,
appeal to UG. However, notice that with the exception of the scale in (2), all of the categories
are semantic, or can easily be defined as semantic categories. (For the scale in (2), a redefini-
tion in terms of semantic categories should be on the functionalists’ agenda.) That semantic
or conceptual categories must be universal at some level is clear from the fact that translation
between languages is in principle possible. Recognizing that conceptual structure is to some
extent universal does not amount to claiming that there is a Universal Grammar that is specific
to language. Some universality of conceptual (and phonetic) categories is also necessary to be
able to compare languages, but universality of strictly linguistic (morphosyntactic and phono-
logical categories) is not necessary (see Haspelmath 2006b).
Martin Haspelmath
iv. The explanations do not involve the notion of “restrictiveness” of the descrip-
tive framework, appealing instead to factors external to the description and the
framework (cf. Newmeyer 1998: ch. 3 on external vs. internal explanations).
v. The functional explanations have no direct implications for language
acquisition,15 but otherwise they are more easily tested than the generative
explanations.
5. Summary
Explaining syntactic universals is a hard task on which there is very little agree-
ment among comparative syntacticians. I have reviewed two prominent approaches
. Of course, the same functional factors that are responsible for adult systems are likely to
play a role in language acquisition, but this is quite independent of the functional explanation
of universals.
Parametric versus functional explanations of syntactic universals
to this task here, the generative parametric approach and the functionalist ap-
proach in the Greenbergian tradition (as well as the generative Optimality Theory
approach, which diverges in interesting ways from the parametric approach).
The disagreements between them start with the kinds of descriptions that form
the basis for syntactic universals: Generativists generally say that cognitively real
descriptions are required, whereas functionalists can use any kind of observation-
ally adequate description. Generativists work with the implicit assumption that all
universals will find their explanation in Universal Grammar, whereas functional-
ists do not appeal to UG and do not even presuppose that languages have some of
the same categories and structures (see Haspelmath 2006b). Functionalists attempt
to derive general properties of language from processing difficulty, whereas gen-
erativists see no role for performance in explaining competence.
But I have identified one common idea in parametric generative approaches
and nongenerative approaches to syntactic universals:16 the hope that the bewil-
dering diversity of observed languages can be reduced to very few fundamental
factors. These are macroparameters in generative linguistics, and holistic types
in nongenerative linguistics. The evidence for both of these constructs was never
overwhelming, and in both approaches, not only external critics, but also insiders
have increasingly pointed to the discrepancy between the ambitious goals and the
actual results. Generativists now tend to focus on microparameters (if they are
interested in comparative grammar at all), and nongenerative typologists now
often focus on geographical and historical particularities rather than world-
wide universals.
However, I have argued that there is one type of implicational universal that
is alive and well: intra-domain implications that reflect the relative processing
difficulty of different types of elements in closely related constructions. Such im-
plicational universals cannot be explained in terms of parameters, and the only
generative explanation is an OT explanation in terms of harmonic alignment of
prominence scales. However, this is a crypto-functionalist explanation, and unless
one is a priori committed to the standard generative assumptions, there is no
reason to prefer it to the real functional explanation.
References
. Interestingly, this idea seems to be much less present in Optimality Theory.
Martin Haspelmath
Aissen, J. 2003. Differential object marking: Iconicity vs. economy. Natural Language and
Linguistic Theory 21(3): 435–483.
Atkinson, M. 1994. Parameters. In The Encyclopedia of Language and Linguistics, Vol. 6,
R.E. Asher (Ed.), 2941–2942. Oxford: Pergamon.
Baker, M. 1988. Incorporation: A Theory of Grammatical Function Changing. Chicago IL:
University of Chicago Press.
Baker, M. 1996. The Polysynthesis Parameter. Oxford: OUP.
Baker, M. 2001a. The Atoms of Language: The Mind’s Hidden Rules of Grammar. New York NY:
Basic Books.
Baker, M. 2001b. Configurationality and polysynthesis. In Language Typology and Language Uni-
versals: An International Handbook, Vol. 2, M. Haspelmath, E. König, W. Oesterreicher &
W. Raible (Eds), 1433–1441. Berlin: Mouton de Gruyter.
Baker, M. 2003. Lexical Categories: Verbs, Nouns, and Adjectives. Cambridge: CUP.
Baker, M. & McCloskey, J. 2005. On the relationship of typology to theoretical syntax. Paper pre-
sented at the LSA Symposium on Typology in the U.S. (Downloadable at http://www.rci.
rutgers.edu/~ mabaker/papers.html)
Baltin, M. 2004. Remarks on the relation between language typology and Universal Grammar:
Commentary on Newmeyer. Studies in Language 28(3): 549–553.
Bickel, B. 2005. Typology in the 21st century: Major current developments. Paper presented at
the 2005 LSA Meeting (Symposium on typology in the United States).
Biloa, E. 1998. Syntaxe générative: La théorie des principes et des paramètres. Munich: Lincom.
Boersma, P. 1998. Functional Phonology: Formalizing the Interactions between Articulatory and
Perceptual Drives. The Hague: HAG.
Bossong, G. 1985. Differenzielle Objektmarkierung in den neuiranischen Sprachen. Tübingen: Narr.
Bresnan, J. 1997. The emergence of the unmarked pronoun: Chichewa pronominals in Optimal-
ity Theory. Berkeley Linguistics Society 23: 26–46. (Special Session on Syntax and Semantics
in Africa).
Bybee, J. 1988. The diachronic dimension in explanation. In Explaining Language Universals,
J.A. Hawkins (Ed.), 350–379. Oxford: Blackwell.
Bybee, J. 2003. Mechanisms of change in grammaticization: The role of repetition. In Handbook
of Historical Linguistics, R. Janda & B. Joseph (Eds), 602–623. Oxford: Blackwell.
Carnie, A. 2002. Syntax: A Generative Introduction. Oxford: Blackwell.
Chomsky, N. 1988. Language and Problems of Knowledge: The Managua Lectures. Cambridge
MA: The MIT Press.
Chomsky, N. 1991. Some notes on economy of derivation and representation. In Principles and
Parameters in Comparative Grammar, R. Freidin (Ed.), 417–454. Cambridge MA: The MIT
Press.
Chomsky, N. 2000. New Horizons in the Study of Language and Mind. Cambridge: CUP.
Chomsky, N. 2004. Beyond explanatory adequacy. In Structures and Beyond: The Cartography of
Syntactic Structures, Vol. 3, A. Belletti (Ed.), 104–131. Oxford: OUP.
Chomsky, N. & Lasnik, H. 1993. The theory of Principles and Parameters. In Syntax: An Inter-
national Handbook of Contemporary Research, Vol. 1, J. Jacobs et al. (Eds), 506–569. Berlin:
Mouton de Gruyter.
Cinque, G. 1999. Adverbs and Functional Heads: A Cross-Linguistic Perspective. Oxford: OUP.
Cinque, G. 2005. Deriving Greenberg’s universal 20 and its exceptions. Linguistic Inquiry 35(3):
315–332.
Parametric versus functional explanations of syntactic universals
Cinque, G. & Kayne, R. (Eds), 2005. The Oxford Handbook of Comparative Syntax. Oxford: OUP.
Comrie, B. 1978. Ergativity. In Syntactic typology, W.P. Lehmann (Ed.), 329–394. Austin TX:
University of Texas Press.
Comrie, B. 1981. Language Universals and Linguistic Typology. Oxford: Blackwell.
Comrie, B. 1990. Holistic versus partial typologies. In Proceedings of the Fourteenth International
Congress of Linguists, Vol. 1, W. Bahner, J. Schildt & D. Viehweger (Eds), 139–148. Berlin:
Akademie-Verlag.
Cristofaro, S. 2003. Subordination. Oxford: OUP.
Cristofaro, S. To appear. Language universals and linguistic knowledge. In J.J. Song, (Ed.),
Handbook of Typology. Oxford: OUP.
Croft, W. 1995. Autonomy and functionalist linguistics. Language 71: 490–532.
Croft, W. 2000a. Explaining Language Change: An Evolutionary Approach. London: Pearson.
Croft, W. 2000b. Parts of speech as language universals and as language-particular categories.
In Approaches to the Typology of Word Classes, P. Vogel & B. Comrie (Eds), 65–102. Berlin:
Mouton de Gruyter.
Croft, W. 2001. Radical Construction Grammar. Oxford: OUP.
Croft, W. 2003. Typology and universals. Cambridge: CUP.
Croft, W. & Cruse, A. 2004. Cognitive Linguistics. Cambridge: CUP.
Culicover, P.W. 1997. Principles and Parameters: An Introduction to Syntactic Theory.
Oxford: OUP.
Dahl, Ö. 1996. Das Tempussystem des Deutschen im typologischen Vergleich. In Deutsch –
typologisch, Lang, E. & Zifonun, G. (Eds), 359–368. Berlin: Mouton de Gruyter.
Dixon, R.M.W. 1997. The Rise and Fall of Languages. Cambridge: CUP.
Dryer, M. 1989. Large linguistic areas and language sampling. Studies in Language 13: 257–92.
Dryer, M. 1992. The Greenbergian word order correlations. Language 68: 81–138.
Dryer, M. 1997a. Are grammatical relations universal? In Essays on Language Function and
Language Type, J. Bybee, J. Haiman & S.A. Thompson (Eds), 115–143. Amsterdam: John
Benjamins.
Dryer, M. 1997b. Why statistical universals are better than absolute universals. Chicago Linguis-
tic Society 33: 123–145. (The Panels).
Dryer, M. 2006. Descriptive theories, explanatory theories, and basic linguistic theory. In
Catching Language: The Standing Challenge of Grammar Writing. F.K. Ameka, A. Dench &
N. Evans (Eds), 207–234. Berlin: Mouton de Gruyter.
Enrico, J. 2003. Haida syntax, 2 Vols. Lincoln NB: University of Nebraska Press.
Fanselow, G. & Felix, S. 1987. Sprachtheorie, Vol. 1. Tübingen: Francke.
Filimonova, E. 2005. The noun phrase hierarchy and relational marking: Problems and counter-
evidence. Linguistic Typology 9(1): 77–113.
Finck, F.N. 1909. Die Haupttypen des Sprachbaus. Leipzig: B.G. Teubner. (Reprinted: Darmstadt,
Wissenschaftliche Buchgesellschaft, 1965).
Fodor, J.D. 2003. Setting syntactic parameters. In The Handbook of Contemporary Syntactic The-
ory, M. Baltin & C. Collins (Eds), 730–767. Oxford: Blackwell.
Freeze, R. 1992. Existentials and other locatives. Language 68(3): 553–595.
Freidin, R. 1992. Foundations of Generative Syntax. Cambridge MA: The MIT Press.
Fukui, N. 1995. The principles-and-parameters approach: A comparative syntax of English and
Japanese. In Approaches to Language Typology, M. Shibatani & T. Bynon (Eds), 327–372.
Oxford: Clarendon Press.
Martin Haspelmath
Fukui, N. & Zushi, M. 2004. Introduction. In The Generative Enterprise Revisited: Discussions
with Riny Huybregts, Henk van Riemsdijk, Naoki Fukui and Mihoko Zushi, 1–25. Berlin:
Mouton de Gruyter.
Gabelentz, G. von der. 1901. Die Sprachwissenschaft, ihre Aufgaben, Methoden und bisherigen
Ergebnisse. Leipzig: Weigel.
Gilligan, G. 1987. A Cross-Linguistic Approach to the Pro-drop Parameter. Ph.D. dissertation,
University of Southern California.
Givón, T. 1995. Functionalism and grammar. Amsterdam: John Benjamins.
Givón, T. 1980. Ute Reference Grammar. Ignacio CO: The Ute Press.
Greenberg, J.H. 1963. Some universals of grammar with particular reference to the order of
meaningful elements. In Universals of grammar, J.J. Greenberg (Ed.), 73–113. Cambridge
MA: The MIT Press.
Greenberg, J. 1966. Language Universals, with Special Reference to Feature Hierarchies. The
Hague: Mouton.
Grewendorf, G. 2002. Minimalistische Syntax. Tübingen: Francke.
Grimshaw, J. & Samek-Lodovici, V. 1998. Optimal subjects and subject universals. In Is the Best
Good Enough?, P. Barbosa et al. (Eds), 193–219. Cambridge MA: The MIT Press.
Haegeman, L. 1997. Introduction: On the interaction of theory and description in syntax. In The
New Comparative Syntax, L. Haegeman (Ed.), 1–32. London: Longman.
Haider, H. 1994. (Un-)heimliche Subjekte – Anmerkungen zur Pro-drop Causa, im Anschluß
an die Lektüre von Osvaldo Jaeggli & Kenneth J. Safir (Eds), The Null Subject Parameter.
Linguistische Berichte 153: 372–85.
Haider, H. 2001. Parametrisierung in der Generativen Grammatik. In Language Typology and
Language Universals, M. Haspelmath, E. König, W. Oesterreicher & W. Raible (Eds), 283–
293. Berlin: Mouton de Gruyter.
Hale, K. 1983. Warlpiri and the grammar of non-configurational languages. Natural Language
and Linguistic Theory 1(1): 5–47.
Hale, K. & Reiss, C. 2000. Phonology as cognition. In Phonological Knowledge: Conceptual and
Empirical Issues, N. Burton-Roberts, P. Carr & G. Docherty (Eds), 164–184. Oxford: OUP.
Haspelmath, M. 1993. A Grammar of Lezgian. Berlin: Mouton de Gruyter.
Haspelmath, M. 1997. Indefinite Pronouns. Oxford: OUP.
Haspelmath, M. 1999a. Optimality and diachronic adaptation. Zeitschrift für Sprachwissenschaft
18(2): 180–205.
Haspelmath, M. 1999b. The Agglutination Hypothesis: A belated empirical investigation. Paper
presented at the ALT Conference, Amsterdam.
Haspelmath, M. 2000. Why can’t we talk to each other? A review article of Newmeyer, F. 1998.
Language Form and Language Function. Cambridge MA: The MIT Press. Lingua 110(4):
235–255.
Haspelmath, M. 2001. The European linguistic area: Standard Average European. In Language
Typology and Language Universals, M. Haspelmath, E. König, W. Oesterreicher & W. Raible
(Eds), 1492–1510. Berlin: Mouton de Gruyter.
Haspelmath, M. 2004a. Does linguistic explanation presuppose linguistic description? Studies
in Language 28(3): 554–579.
Haspelmath, M. 2004b. Explaining the Ditransitive Person-Role Constraint: A usage-based
account. Constructions 2004(2). (www.constructions-online.de)
Haspelmath, M. 2005. Argument marking in ditransitive alignment types. Linguistic Discovery
3(1): 1–21.
Parametric versus functional explanations of syntactic universals
Haspelmath, M. 2006a. Universals of causative verb formation. Paper presented at the Sympo-
sium Current Topics in Typology, Freie Universität Berlin.
Haspelmath, M. 2006b. Pre-established categories don’t exist – consequences for language
description and for language typology. Paper presented at the DGfS Annual Conference,
Bielefeld.
Haspelmath, M. 2008. Creating economical morphosyntactic patterns in language change. In
Language Universals and Language Change, J. Good (Ed.), 185–214. Oxford: OUP.
Haspelmath, M. To appear. A frequentist explanation of some universals of reflexive marking.
Haspelmath, M., Dryer, M.S., Gil, D. & Comrie, B. (Eds). 2005. The World Atlas of Language
Structures. Oxford: OUP.
Hawkins, J.A. 1994. A Performance Theory of Order and Constituency. Cambridge: CUP.
Hawkins, J.A. 2004. Efficiency and Complexity in Grammars. Oxford: OUP.
Hayes, B., Kirchner, R. & Steriade, D. (Eds). 2004. Phonetically Based Phonology. Cambridge: CUP.
Jackendoff, R. 1977. X Bar Syntax: A Study of Phrase Structure. Cambridge MA: The MIT Press.
Jaeggli, O. & Safir, K. (Eds). 1989. The Null Subject Parameter. Dordrecht: Foris.
Jäger, G. 2004. Learning constraint sub-hierarchies: The Bidirectional Gradual Learning Algo-
rithm. In Pragmatics in OT, R. Blutner & H. Zeevat (Eds), 251–287. Houndmills: Palgrave
MacMillan.
Jelinek, E. 1984. Empty categories, case, and configurationality. Natural Language and Linguistic
Theory 2: 39–76.
Jenkins, L. 2000. Biolinguistics: Exploring the Biology of Language. Cambridge: CUP.
Johannessen, J.B. 1998. Coordination. Oxford: OUP.
Joos, M. (Ed.), 1957. Readings in Linguistics: The Development of Descriptive Linguistics in Amer-
ica since 1925. New York NY: American Council of Learned Societies.
Julien, M. 2002. Syntactic Heads and Word-formation. Oxford: OUP.
Kager, R. 1999. Optimality Theory. Cambridge: CUP.
Kayne, R. 1981. On certain differences between French and English. Linguistic Inquiry 12:
349–371.
Kayne, R. 1984. Connectedness and Binary Branching. Dordrecht: Foris.
Kayne, R. 1994. The Antisymmetry of Syntax. Cambridge MA: The MIT Press.
Kayne, R. 1996. Microparametric Syntax: Some introductory remarks. In Microparametric Syn-
tax and Dialect Variation, J. Black & V. Montapanyane (Eds), ix–xvii. Amsterdam: John
Benjamins. (Reprinted in Kayne 2000: 3–9).
Kayne, R. 2000. Parameters and Universals. Oxford: OUP.
Keenan, E. & Comrie, B. 1977. Noun phrase accessibility and universal grammar. Linguistic
Inquiry 8: 63–99.
Keller, R. 1994. Language Change: The Invisible Hand in Language. London: Routledge.
Kiparsky, P. 2008. Universals constrain change, change results in typological generalizations. In
Language Universals and Language Change, J. Good (Ed.). Oxford: OUP.
Kirby, S. 1999. Function, Selection and Innateness: The Emergence of Language Universals.
Oxford: OUP.
Klimov, G.A. 1977. Tipologija jazykov aktivnogo stroja. Moskva: Nauka.
Klimov, G.A. 1983. Principy kontensivnoj tipologii. Moskva: Nauka.
Kuroda, S.-Y. 1988. Whether we agree or not: A comparative syntax and English and Japanese.
Linguisticae Investigationes 12: 1–47.
Legendre, G., Grimshaw, J. & Vikner, S. (Eds). 2001. Optimality-theoretic Syntax. Cambridge
MA: The MIT Press.
Martin Haspelmath
Plank, F. 2003ff. Das grammatische Raritätenkabinett: A leisurely collection to entertain and in-
struct. (Online collection of rare features in the world’s languages: http://ling.uni-konstanz.
de:591/universals/ introrara.html)
Prince, A. & Smolensky, P. 2004. Optimality Theory: Constraint Interaction in Generative Gram-
mar. Oxford: Blackwell. (Originally circulated in 1993).
Radford, A. 2004. Minimalist Syntax. Cambridge: CUP.
Rizzi, L. 1982. Issues in Italian Syntax. Dordrecht: Foris.
Roberts, I. & Holmberg, A. 2005. On the role of parameters in Universal Grammar: A reply to
Newmeyer. In Organizing Grammar: Linguistic Studies in Honor of Henk van Riemsdijk,
H. Broekhuis, N. Corver, M. Huybregts, U. Kleinhenz & J. Koster (Eds), 538–553. Berlin:
Mouton de Gruyter.
Roberts, I. 1997. Comparative Syntax. London: Arnold.
Roberts, I. 2003. Head movement. In The Handbook of Contemporary Syntactic Theory, M. Baltin &
C. Collins (Eds), 113–147. Oxford: Blackwell.
Sapir, E. 1921. Language. An Introduction to the Study of Speech. New York NY: Harcourt, Brace
and Jovanovich.
Sells, P. (Ed.), 2001. Formal and empirical issues on Optimality Theoretic syntax. Stanford: CSLI
Publications.
Sells, P. 2001. Introduction. In Formal and Empirical Issues on Optimality Theoretic Syntax,
P. Sells (Ed.), 1–16. Stanford CA: CSLI.
Sharma, D. 2001. Kashmiri case clitics and person hierarchy effects. In Formal and Empirical
Issues on Optimality Theoretic Syntax, P. Sells (Ed.), 225–256. Stanford CA: CSLI.
Siewierska, A. 1996. Word order type and alignment type. Sprachtypologie und Universalien-
forschung 49(2): 149–176.
Silverstein, M. 1976. Hierarchy of features and ergativity. In Grammatical Categories in Austra-
lian Languages, R.M.W. Dixon (Ed.), 112–171. Atlantic City NJ: Humanities Press.
Skalička, V. 1966. Ein typologisches Konstrukt. Travaux du Cercle Linguistique de Prague 2:
14–28. (Reprinted in: Hartmann, P. (Ed.), Typologische Studien, 335–341. Braunschweig:
Vieweg).
Smith, N. 1989. The Twitter Machine: Reflections on Language. Oxford: Blackwell.
Stassen, L. 1985. Comparison and Universal Grammar. Oxford: Blackwell.
Stassen, L. 1997. Intransitive Predication. Oxford: OUP.
Taylor, J.R. 2002. Cognitive Grammar. Oxford: OUP.
Travis, L. 1989. Parameters of phrase structure. In Alternative Conceptions of Phrase Structure,
M. Baltin & A. Kroch (Eds), 263–279. Chicago IL: University of Chicago Press.
Vikner, S. 1997. V°-to-I° movement and inflection for person in all tenses. In The New Compara-
tive Syntax, L. Haegeman (Ed.), 189–213. London: Longman.