Modality in Grammar and Discourse
Modality in Grammar and Discourse
Editorial Board:
Wallace Chafe (Santa Barbara) Ronald Langacker (San Diego)
Bernard Comrie (Los Angeles) Charles Li (Santa Barbara)
R.M.W. Dixon (Canberra) Andrew Pawley (Canberra)
Matthew Dryer (Buffalo) Doris Payne (Oregon)
John Haiman (St Paul) Frans Plank (Konstanz)
Kenneth Hale (Cambridge, Mass.) Jerrold Sadock (Chicago)
Bemd Heine (Köln) Dan Slobin (Berkeley)
Paul Hopper (Pittsburgh) Sandra Thompson (Santa Barbara)
Andrej Kibrik (Moscow)
Volumes in this series will be functionally and typologically oriented, covering specific
topics in language by collecting together data from a wide variety of languages and language
typologies. The orientation of the volumes will be substantive rather than formal, with the
aim of investigating universals of human language via as broadly defined a data base as
possible, leaning toward cross-linguistic, diachronic, developmental and live-discourse data.
The series is, in spirit as well as in fact, a continuation of the tradition initiated by C. Li (Word
Order and Word Order Change, Subject and Topic, Mechanisms for Syntactic Change) and
continued by T. Givón (Discourse and Syntax) and P. Hopper (Tense-Aspect: Between
Semantics and Pragmatics).
Volume 32
JOAN BYBEE
University of New Mexico
SUZANNE FLEISCHMAN
University of California, Berkeley
Introduction 1
Joan Bybee
University of New Mexico
Suzanne Fleischman
University of California, Berkeley
This volume is the result of a symposium on Mood and Modality held at the
University of New Mexico in 1992, the goal of which was to bring together
linguists whose research has targeted this area of grammar but whose ap
proaches to it reflect differing perspectives on functional linguistics. The
symposium was planned so as to include a diversity of languages, of foci
(synchronic and diachronic), and of theoretical orientations, especially with
regard to the interaction of morphosyntactic, semantic and discourse-prag
matic factors.
Given the complexity of this linguistic domain, the many and diverse
ways it comes to be expressed in different languages, and the tendency of
researchers to work narrowly within the confines of their own theoretical and
methodological frameworks, there was reason to fear that symposium partici
pants might not find adequate common ground nor a sufficiently common
metalanguage to be able to communicate with one another. However, just as a
similar symposium on tense and aspect a decade earlier (Hopper 1982)
confirmed the status of those categories as valid cross-language categories of
grammar—what we refer to, following Bybee & Dahl (1989), as 'gram
types'—, so too the current symposium succeeded in demonstrating that
despite differences in terminology, language areas, and theoretical perspec
tives, we were in effect all examining similar phenomena and could partici
pate in meaningful dialogue about our data and analyses. Thus while the
papers in this volume present a wide range of topics and perspectives, they
2 Modality in Grammar and Discourse
A second issue we wish to clarify at the outset of our discussion concerns the
relationship between domains of modality and their expression in natural
languages. In some approaches to modality, function is studied for its own
sake. In this volume we take the position that analysis of function should
explain distribution of form. That is, the modal categories we operate with do
not lead an autonomous existence in some abstract logical or semantic space;
rather, they correspond to—indeed are determined by—(a) the formal distinc
tions made in particular languages, (b) documented pathways of language
change, and (c) prominent cross-language patterns of form-function correla
tion.
In the area of modality, however, cross-language comparison has been a
difficult task, for several reasons. First, because the semantic/functional do
main of modality is so broad; second, because modality, as we have discov
ered, lends itself best to investigation in social, interactive contexts
(elaborated below); third, and conceivably most important, because of the
extent to which languages differ in their mapping of the relevant semantic
content onto linguistic form. A case in point that came up repeatedly in the
course of our discussions concerns the category irrealis and the nature of the
realis/irrealis distinction. The languages that came under our scrutiny differed
in terms of what they classify as realis and what they classify as irrealis,
prompting us to question whether the distinction is cross-linguistically valid
at all, and if so, whether it corresponds to a gram-type distinction, such as
perfective/imperfective, or whether it more closely resembles a supercategory
such as mood.
As noted above, it also became clear over the course of our discussions at
the symposium that many of the functions of modality are inextricably em
bedded in contexts of social interaction and, consequently, cannot be de
scribed adequately apart from their contextual moorings in interactive
discourse. Several papers in this volume explore particular modalities specifi
cally as they occur in contexts of face-to-face communication. But even in
those that do not make explicit reference to contexts of social interaction, this
factor is nonetheless in evidence.
4 Modality in Grammar and Discourse
A problem with the deontic notion for linguists, however, is the imperfect
nature of its fit with the corresponding linguistic categories that we encounter
in the world's languages as well as in language change: i.e., it is at once too
broad and too narrow. For one thing, unlike 'agent-oriented' modality—a
supercategory label that will be used by most papers in this volume in
preference to 'deontic' 3 —deontic modality fails to distinguish subcategories
that are expressed inflectionally (i.e. as grammaticalized moods), such as
imperative, from lexical ox periphrastic (i.e. auxiliary) expressions of obliga
tion or permission. Furthermore, deontic modality as traditionally understood
excludes certain semantically related notions such as ability (physical and
mental) and desire that have linguistic expression similar to that of permission
and obligation. While one argument for the category 'deontic' might be the
well-documented pathway of change whereby deontic modals over time come
to acquire epistemic functions, in actual fact this change affects a broader
range of meanings than the term 'deontic' indicates.
The traditional division of modality into epistemic and deontic reveals
some interesting cases of polysemy in which the same form can be used for
both types of modality. Thus English may can express either deontic permis
sion (you may come in now) or epistemic possibility (this may be your lucky
day!), while must can express deontic obligation (you must be here by seven)
as well as inferred probability (that must be the mailman at the door). A
diachronic view of this polysemy yields the observation, documented in many
languages, that so-called deontic meanings typically evolve into epistemic
meanings. 4
With regard to the difficulties attaching to 'deontic' as a supercategory
label, even this change from deontic to epistemic meaning in fact affects a
broader range of categories. While it is true that obligation markers may come
to be used for epistemic functions such as probability or inference, in the case
of permission markers it is not 'permission' per se that licenses a meaning of
epistemic possibility (e.g. in the case of may). Virtually all permission mark
ers can be traced back to expressions of ability, permission being just one
sense of a more generalized 'root-possibility' meaning that arises from ability
(Bybee, Perkins and Pagliuca 1994). Root possibility predicates general ena
bling conditions (e.g. it can take three hours to get there). These include
permission, which is a social enabling condition. Bybee (1988) has shown
that it is the root possibility sense that gives rise to epistemic possibility.
A second instance of categories other than narrowly-defined deontic
6 Modality in Grammar and Discourse
The paper by Suzanne Romaine takes a diachronic look at the Tok Pisin
particle bai, which now functions primarily as a future marker but which also
has a variety of modal functions (as do most future markers). Drawing on an
extensive corpus of data (synchronic and diachronic, spoken as well as
written), Romaine traces the stages through which the clause-initial time
adverb baimbai 'by and by' grammaticalizes into the reduced pre-verbal
marker bai and in the process acquires a set of future and 'irrealis' functions.
Her study also points out the striking conformity of these developments to
cross-linguistically established trends.
Irrealis modality is also a primary focus of Suzanne Fleischman's paper,
summarized in the section below on the interaction of modality with other
categories of grammar.
As noted above, there are certain similarities between Irrealis categories
and the Subjunctives of European languages. Patricia Lunn's paper is repre
sentative of recent work on the Spanish Subjunctive in its appeal to pragmatic
considerations—contextual and interactional factors—to account for the use
of Subjunctive vs. Indicative forms in several varieties of discourse. Lunn
shows that Subjunctive coding is not limited to unreal and non-assertive
propositions, as suggested in traditional accounts of the Spanish Subjunctive;
this mood can also be used to signal background information in literary texts,
and in journalistic discourse to mark particular information as 'common
knowledge'.
In the final section of this volume we group together four papers that in
different ways explore the interaction between modality and other domains of
grammar, specifically: negation, complementizers, past tense, and imperfec-
tive aspect. In these papers we see once again the crucial role that interactive
contexts play in shaping the meanings that result from the combinations of
grammatical categories.
Frank Palmer's contribution investigates the systematic irregularity
(this is not an oxymoron) that we find across languages in the behavior of
modals and in the meanings that emerge when modals appear under the scope
of negation. A particularly widespread irregularity involves the strategies
languages use to express the semantic notions of 'necessary-not' and 'not-
necessary'. This may involve use of a different verb altogether from the one
12 Modality in Grammar and Discourse
NOTES
* We are grateful to Greg Thomson for assisting the editors and authors in manuscript
preparation and copy editing. The indexes were prepared by Lisa Dasinger and Jacki
Trademan.
1 In this volume we avoid the term 'mode' because of the problematic ambiguities it
presents, being used with widely different meanings in the grammars of different
languages. In many European languages it translates what is here referred to as 'mood',
while in the grammars of certain non-European languages it is used to label categories
whose meanings fall under the headings of tense and aspect. We are aware, of course,
that languages often bundle tense, aspect, and mood information into portmanteau
morphology, thereby making it difficult to decide how to label such categories.
2 The term 'evidential' was first introduced by Jakobson (1957) as a tentative label for a
verbal category that indicates the source of the information on which a speaker's
statement is based. As currently understood, evidentiality covers a range of distinctions
involved in the identification of the source of one's knowledge. Various languages have
grammaticalized evidential markers indicating whether or not the speaker vouches
personally for the information contained in a statement. (See Chafe and Nichols 1986,
Willett 1988).
3 In place of the traditional distinction of linguistically-relevant modals into 'epistemic'
and 'deontic', Bybee (1985) recategorizes the modals into 'agent-oriented,' 'speaker-
oriented', and 'epistemic'. These categories will be defined and elaborated on below.
Other categories distinguished in modal logic, e.g. 'dynamic' and 'alethic' modalities
(cf. Lyons 1977:791, Palmer 1986:102-103), will not be discussed here, being less
germane to the analysis of modality in natural language.
4 The earlier 'deontic' meanings may or may not be preserved. English must, for example,
retains its obligative meaning, whereas might has lost its earlier abilitative meaning.
REFERENCES
Bybee, Joan L. 1985. Morphology: A Study of the Relation between Meaning and Form
(=Typological Studies in Language, 9) Amsterdam & Philadelphia: John Benjamins.
Bybee, Joan L. 1988. "Semantic Substance vs. Contrast in the Development of Grammati
cal Meaning." Proceedings of the Berkeley Linguistic Society. 14.247-264.
Bybee, Joan & Östen Dahl. 1989. "The Creation of Tense and Aspect Systems in the
Languages of the World". Studies in Language 13.51-103
14 Modality in Grammar and Discourse
Bybee, Joan L., William Pagliuca, & Revere D. Perkins. 1991. "Back to the Future".
Approaches to Grammaticalization. ed. by E. C. Traugott &. B. Heine, vol. 2, 17-58.
Amsterdam & Philadelphia: John Benjamins.
Bybee, Joan L., Revere D. Perkins & William Pagliuca. 1994. The Evolution of Grammar:
Tense, Aspect and Modality in the Languages of the World. Chicago: University of
Chicago Press.
Chafe, Wallace L., & Johanna Nichols. 1986. Evidentiality: The Coding of Epistemology
in Language. Norwood, N.J.: Ablex.
Gerhardt, Julie. 1985. "On the Use of will and gonna: Toward a Description of Activity
Types for Child Language". Discourse Processes 8.143-75.
Gerhardt, Julie. 1990. "The Relation of Language to Context in Children's Speech: The
Role of hafta Statements in Structuring 3-Year-Old's Discourse". IPrA Papers in
Pragmatics A. 1-57.
Hopper, Paul, ed. 1982. Tense-Aspect: Between Semantics and Pragmatics. Amsterdam:
John Benjamins.
Jakobson, Roman. 1957. Shifters, Verbal Categories, and the Russian Verb. Cambridge,
Mass: Russian Language Project, Department of Slavic Languages and Literatures,
Harvard University. (Repr. The Selected Writings of Roman Jakobson. Vol 2.130-147
[1971] The Hague: Mouton.)
Lyons, John. 1977. Semantics , vol. 2. Cambridge: Cambridge University Press.
Palmer, Frank R. 1986. Mood and Modality. Cambridge: Cambridge University Press.
Willett, T. 1988. "A Cross-Linguistic Survey of the Grammaticization of Evidentiality".
Studies in Language. 12.51-97.
I
Agent-Oriented and Epistemic Modality
Agent-Oriented vs. Epistemic Modality
Some Observations on German Modals1
Bernd Heine
University of Cologne
1. Introduction
2. German modals
A number of authors have pointed out that German modals differ from their
English counterparts essentially in being more verb-like (Abraham
Agent-Oriented vs. Epistemic Modality 19
1989:357); some (cf. Jenkins 1972:9-12; Steele et al. 1981; Abraham 1992)
go so far as to argue that a categorial boundary separates German and English
modals, the latter belonging to a category called AUX and the former to that
of Verb. German modals such as können 'can' or müssen 'must', etc. differ
from their English counterparts, e.g., in the following ways:
2.2. A survey
In the present section, eight German modals are looked at with a view to
determining how and to what extent they are associated with agent-oriented/
deontic and epistemic modality, respectively, or, to use the terminology of
German grammarians, with "objective" and "subjective" modality. The mo
dals in question are:
müssen 'must'
sollen 'should'
können 'can'
wollen 'want, will'
mögen 'like, may'
dürfen 'be allowed to, may'
möchten 'want to, would like to'
werden 'will'
One may wonder what justification there is to include werden, essentially
a future tense marker, within the paradigm of modal auxiliaries. While I do
not wish to argue that werden is a modal, as has been done elsewhere (see
especially Vater 1970), the main reason for including it here is that like "true
modals" it exhibits a distinction between non-epistemic and epistemic uses.
Note furthermore that können, like English can, has two non-epistemic
senses: root possibility and permission (cf. Bybee, Perkins, and Pagliuca
1994). These two senses will not be distinguished in the quantitative analysis
that follows; our concern will be essentially with the distinction between non-
epistemic and epistemic modality, rather than between agent-oriented and
epistemic modality. These distinctions will be clarified below.
In the association of the eight modals with these two categories of
modality in given utterances, the following three situations will be distin
guished, where "A" stands for agent-oriented and "E" for epistemic modality:
Agent-Oriented vs. Epistemic Modality 21
In what follows, quantitative data are presented that bear on the question
of whether and to what extent certain standardized utterances are associated
with the two kinds of modality. To this end, index values are calculated on the
basis of the three-fold classification proposed above: a value of 1.00 is given
to an utterance whenever a focal sense is involved, 0.66 in the case of a non-
focal sense, and 0.33 in the case of a marginal sense. I illustrate this procedure
with reference to examples (1) and (2) above: the utterance in (1) (Er muß
mindestens 1.80 m sein) would receive an index value of 1.00 for its focal
epistemic sense and of 0.66 for its non-focal agent-oriented sense. Utterance
(2), on the other hand, would have an index value of 1.00 for its focal agent-
oriented sense and of 0.33 for its marginal epistemic sense. Values are
calculated for declarative (e.g., Er muß kommen 'He must come'), interroga
tive (Muß er kommen? 'Does he need to come?'), and negative utterances (Er
muß nicht kommen 'He need not come'), for main verb perfect forms (Er muß
gekommen sein 'He must have come'), auxiliary past tense forms (Er mußte
kommen 'He had to come'), as well as for first- (Ich muß kommen 'I must
come'), second- (Du mußt kommen 'You must come'), and third-person
subject utterances (Er muß kommen 'He must come'). The sum totals are
divided by the total number of utterances considered, thereby establishing
average index values for agent-oriented and epistemic modalities. The maxi
mum value attainable is 1.00, the minimum value 0 (zero). For example, a
modal will receive the maximum value of agent-oriented modality if in all
utterances considered it has a focal agent-oriented sense, and the minimum
value if it has neither focal, non-focal or marginal agent-oriented senses; see
APPENDIX for details. 6
A number of calculations have been carried out on the basis of selected
parameters; the results obtained are summarized in Tables 1 through 4. In
Table 1 the eight modals are compared in terms of the degree to which they
correlate with each of the two kinds of modality. As the index values suggest,
the modals can be arranged along a scale for each of the two modalities. At
Agent-Oriented vs. Epistemic Modality 23
one end of the scale are möchten and dürfen, which have the highest values for
agent-oriented modality and exhibit no epistemic uses, at least not in the
corpus considered. [FN 7] At the other end we find können and sollen, which
have the lowest values for agent-oriented and the highest values for epistemic
modality.
The extent to which these modals are associated with agent-oriented and
epistemic modality appears to correlate with their relative degree of grammat-
icalization in general and with prototypical 'verbiness' in particular: thus, the
highest A values and the lowest E values are found with the items most
prototypically verb-like. Wollen 'will', for instance, is more verbal (has more
verbal properties) than modals like müssen 'must', können 'can', or sollen
'should', in that it can still take subordinate clauses introduced by daß 'that'
as complements and, accordingly, does not require subject identity between
the auxiliary and the main verb, as in (3). Also it can be passivized, as in (4)
(Vater 1970).
(3) Hans will, daß Anna zu Hause bleibt.
Hans wants that Anna at home stays
'Hans wants Anna to stay home.'
(4) Von Hans wird gewollt, daß Anna zu Hause bleibt.
by Hans is wanted that Anna at home stays
'Hans is requested to leave Anna at home.'
24 Bernd Heine
Table 2. Index of modality of four German expressions (on the basis of eight German
modals; see Appendix)
Index value of modality
Agent-oriented Epistemic
a. Sie kommt. 'She is coming.' 0.87 0.02
b. Sie weiß es. 'She knows it.' 0.77 0.32
c. Sie hat viel Geld. 'She has a lot of money.' 0.67 0.27
d. Sie kennt ihn. 'She knows him.' 0.37 0.59
The behavior of the modals also differs greatly in accordance with the
type of main verb figuring in the utterances. Thus, verbs of action and telic
verbs associate predominantly with agent-oriented modality; cf. utterance (a)
in Table 2, involving the motion verb kommen 'come'. The situation is more
complicated in the case of stative verbs: while some, like wissen 'know
(information)' and haben 'have', gravitate toward agent-oriented uses,
kennen 'know, be acquainted with', has a majority of epistemic uses.
The modal behavior of the eight items considered also varies in accord
ance with the type of proposition and the tense and aspect in which the modals
occur, as can be seen in Table 3. According to this table, in Perfect construc
tions the modals are confined to agent-oriented uses (a). Even with verbs such
as kennen, which generally associate strongly with epistemic modality, an
epistemic reading is unlikely once the modal auxiliary has a Perfect form.
Furthermore, in Interrogative (b) and Negative (c) propositions and in the
Past tense (d) the modals correlate primarily with agent-oriented rather than
with epistemic modality. Epistemic uses prevail, however, if the main verb is
in the Perfect (f), and Progressive constructions8 are almost entirely associ
ated with epistemic modality (g).
Finally, the interpretation of an utterance is also affected by the gram
matical person of the subject. As Table 4 suggests, an utterance is most likely
to have an agent-oriented interpretation when the subject referent is first-
person and least likely when it is third-person; conversely, epistemic modality
correlates most strongly with third-person and least strongly with first-person
subjects.
3. Context
These are but a few of the factors that may be relevant to the evocation of
contextual frames, possible additional factors being frequency and/or inten
sity of experience, perceptual salience, propensity for memory storage, or the
stereotypes an utterance tends to evoke.
In addition to contextual frames, the choice between the two kinds of modality
depends crucially on certain conceptual properties. Agent-oriented uses of
German modals are associated with the following properties:
a. There is some force (F) that is characterized by an "element of
will" (Jespersen 1924:320-1), i.e., that has an interest in an event
either occurring or not occurring.9
b. The event is to be performed typically by a controlling agent (C).
c. The event is dynamic (D), i.e., it involves the manipulation of a
situation and is conceived of typically as leading to a change of
state.
d. The event has not yet taken place at reference time, i.e., its occur
rence, if it does in fact take place (see (e) below), will be later than
the reference time (L).
e. The event is non-factual (Palmer 1986:96), though there is a certain
degree of probability that it will occur (P).
Take utterance (9), for example, where F is the speaker (9a) or some
other force the speaker has in mind (9b), and C is the sentence subject er. The
event is dynamic (D), as can be concluded from the use of the action verb
kommen, and is supposed to take place later than at reference time (L).
Finally, the modal muß expresses a high degree of probability (= strong
obligation) that the event will in fact take place (P).
(9) Er muß kommen.
he must come
'He has to come.'
a. [A: "I insist that he comes"]
b. [A: "His boss insists that he comes"]
Differences between the various modals exist with regard to the degree of
probability that the event will take place: the probability is low in the case of
30 Bernd Heine
können and dürfen (permission) but high in the case of müssen (strong
obligation), with the remaining modals being intermediate. But the modals
differ also in other respects. There are essentially three kinds of relevant
participants: the speaker (S), the modal force (F), and the agent (C), the last of
these being typically, though not necessarily, coded as the sentence subject.
While F is different from C in the case of modals such as müssen 'must',
sollen 'shall, should' or können 'can' (= "subject-external modality"), as can
be seen, e.g., in (9), it is identical with C in the case of the modals wollen,
mögen, and möchten (= "subject-internal modality"), as in (10), where the
subject ich 'I' is both the modal force F and the agent C, and in this instance
also S.
While we can distinguish the two kinds of modality on the basis of the
conceptual properties C, F, D and L, these properties are not of equal impor
tance. Consider, for example, utterance (12b). The use of muß does not entail
L; that is, there is no later-than-reference-time constraint involved: the action
of schlafen can be assumed to take place over an interval that includes
reference time, yet (12b) has a focal agent-oriented sense.
(12) a. Warum ist Klaus nicht hier?
why is Klaus not here
'Why is Klaus not here?'
b. Er muß schlafen.
he must sleep
'He has to sleep.'
It might seem as if F is also irrelevant in (12b) since no explicit force is
mentioned. Yet one could inquire about a possible F by asking: Wér ist dafür
verantwortlich, daß er schlafen muß? 'Who is responsible for the fact that he
has to sleep?' Thus, F may be said to be potentially present in (12b), albeit
backgrounded.
In examples like (12b), where L does not apply, the difference between
agent-oriented and epistemic modality becomes minimal: While (12b) has a
focal sense of agent-oriented modality, it may also be understood to convey an
epistemic meaning, i.e. 'he must be sleeping.' In such cases we are dealing
with what Coates refers to as merger: the agent-oriented and the epistemic
senses of (12b) are in a both/and relationship, they are mutually compatible;
in order to understand (12b) it is therefore not necessary to decide which of the
two senses is intended since they are not mutually exclusive (see Section 4).
32 Bernd Heine
Furthermore, there are contexts in the use of müssen 'must' where both C
and L are absent and D is largely irrelevant, as in (13). Verbs that behave in a
similar way to frieren 'freeze, be cold' are leiden 'suffer', weinen 'cry', or
gähnen 'yawn'.
(13) Er muß frieren.
he must freeze
'He is freezing/feeling cold.'
In examples like (13), F is some unidentified force, and we are dealing
with an instance of focal agent-oriented and non-focal epistemic modality.
Once F is eliminated, however, an agent-oriented interpretation is ruled out:
no F is implied in (14), which therefore expresses exclusively epistemic
modality.
(14) Ihn muß frieren.
him must freeze
'He must be freezing/cold.'
What the observations made in this section suggest is that of the four
properties considered, only F is obligatorily connected with agent-oriented
modality: modal utterances that lack F lack the "element of will" to which
Jespersen (1924:320-21) referred. In such cases we are dealing with an
epistemic rather than an agent-oriented sense.
Typically, the lack of F will leave all other components of the modal
concept unaffected. Thus, P remains the same whether or not F is present: the
degree of probability that the situation described in the utterance will obtain is
high in the case of müssen 'must' and low in the case of können 'can',
irrespective of F. Take, for example, the modal sollen 'shall, should': in the
agent-oriented interpretation of (15), F may refer either to the speaker (15a)
or to another person (15b), while in the epistemic interpretation of (15), the
speaker is excluded as source of the modal force (15c); the modal source is
associated with someone other than the speaker, as suggested by the fact that
(15d) would not be an appropriate paraphrase of (15).
3.5 Discussion
The distinction between the two categories of modality has been associated
with a number of factors. First, as pointed out above, one major distinguishing
property is that agent-oriented senses are likely to refer to situations that are
supposed to obtain later than at reference time (= property L). Thus, an
utterance like (17) (= (5)) has future reference when used in its agent-oriented
sense (17a) but present reference when used epistemically (17b).
(17) Das Bier sollte kalt sein.
the beer should cold be
T h e beer should be cold.'
a [A: "I want the beer to be cold (so you'd better put it into the
fridge again")]
b [E: "I have reason to assume that the beer (standing in front of
me) is cold"]
Similar observations have been made for English modals. Coates
(1983:235), for example, notes that, with the exception of root can, English
modals in their agent-oriented uses always have future reference (see also
Palmer 1986:97; Bybee, Perkins, and Pagliuca 1994):
[...] one of the conditions of using Root MUST, SHOULD or OUGHT (=
'Obligation') or Root MAY or CAN (= 'Permission') is that the speaker
believes that the action referred to in the main predication has not already
been achieved. In other words, commands, recommendations and permis
sion-granting utterances all refer to an action which will be carried out at a
time subsequent to the utterance (Coates 1983:233).
34 Bernd Heine
4. On models
grammaticalization chains (Heine 1992). In the case of be going to, the shift
from physical motion to grammatical function proceeded in a chain-like
manner. This is suggested on the one hand by historical evidence (cf. Pérez
1990) and on the other hand, by the present use patterns of the construction.
That conceptual shift in the process of grammaticalization is chain-like
and continuous is due to the particular circumstances giving rise to new
grammatical meanings: such meanings are derived from existing meanings
via context extension (Aijmer 1985), and through the conventionalization of
invited inferences or conversational implicatures (Traugott and König 1991).
This reasoning has been advanced with reference to the transition from agent-
oriented to epistemic meanings. Thus, Traugott (1989:50-51) argues that from
'permission' one can implicate 'expectation': thus, if I say You may go, I may,
in the right circumstances, implicate that I want you to go, from which you
may infer that you have some obligation to go. With reference to the modality
of obligation, Traugott and König observe:
[...] must in the epistemic sense of 'I conclude that' derived from the
obligative sense of 'ought to' by strengthening of conversational inferences
and subjectification. If I say She must be married in the obligation sense, I
invite the inference that she will indeed get married. This inference is of
course epistemic, pertaining to a state of affairs that is anticipated to be true
at some later time (Traugott and König 1991:209).
Sweetser (1982) on the other hand suggests that the conceptual shift from the
agent-oriented to the epistemic domain is a clear instance of a metaphorical
process. Bybee, Pagliuca and Perkins (forthc.) again argue that the English
modal auxiliaries must and should both have experienced an extension from
agent-oriented to epistemic modality, but whereas the extension of must is
suggestive of metaphorical transfer, that of should might be due to the
conventionalization of implicature and, hence, would have to be dealt with in
terms of the context model.
The different behavior of must and should had already been pointed out
by Coates (1983:14ff.). In her work on English modal auxiliaries, she ob
served that in the transition from one kind of modality to another there are
three types of indeterminacy, which she refers to, respectively, as gradience,
ambiguity, and merger. Gradience concerns the nature of the continuum of
meaning, e.g., in the transition from the core of ABILITY to the periphery of
POSSIBILITY of English can. Ambiguity may be described in terms of the
following properties (cf. Coates 1983:15-16):
Third, such observations suggest that there are reasons to regard the transition
from agent-oriented to epistemic concepts as being metaphorically structured,
as has been argued for independently by Sweetser (1982; 1988), Bybee and
Pagliuca (1985), and Heine, Claudi and Hünnemeyer (1991): it involves a
transfer from one domain of human experience to another, from the
sociophysical world to the epistemic world (Sweetser 1982:492-5), the former
being a dynamic world of willful human beings who act and are capable of
imposing their will on other agents, the latter essentially a static world, one
that may but need not be associated with human participants (Heine, Claudi
and Hiinnemeyer 1991:176-178).
Agent-Oriented vs. Epistemic Modality 43
The result is that temporal distance serves as a metaphorical vehicle for social/
interpersonal distance as well as for a number of other notions relating to
evidentiality, speaker subjectivity, etc.
A different explanation is volunteered by Bybee (This volume) with
regard to English modal verbs used in the Past tense. She discusses some of
the same evidence adduced by Fleischman but describes the phenomenon
without reference to metaphor; rather, her parameters are the particular se
mantics characterizing modals on the one hand, and the use of past tense on
the other:
[...] the use of wanted to in present time in Modern English is possible
because it implies that certain conditions on carrying out the wanted predi
cate may not be met. The so-called polite or remote uses of Past tense as in
(21) / wanted to ask you a question.
arise in the same way. That is, (21) implies that there might be conditions
that are unmet. Among these implied conditions is the question of whether
the addressee wants to be asked a question — thus the deferential use of the
Past. (Bybee, this volume).
a. First is the fact that modal verbs are stative verbs. When modals are
used in the Past what is asserted is that a state existed before the
moment of speech, but not that the state necessarily still exists in
the present.
b. Since the state may still exist in the present, a modal in the Past
tense may be used in contexts where it has present relevance: it
may receive the interpretation "that the modality is still in effect,
and [that] the predicate action will be carried out if the right
conditions are met" (Bybee, this volume).
c. The combination of the modal sense and the Past sense is said to
produce a hypothetical reading; compared to Present tense modals,
modals in the Past tense may then express hypothetical situations
in conditional apodosis, as can be seen in (30a), as opposed to
(30b):
(30) a. If I saw Judy, I would tell her the news.
b. If I see Judy, I will tell her the news.
d. In the context of a dialogic discourse (typically involving first and
second person referents), the hypothetical sense may then be ex
ploited to express a deferential or polite sense, as in (29) above,
e.g., to tone down or weaken the illocutionary force of an otherwise
assertive speech act, as described in the above quotation by
Fleischman.
As is to be expected in such processes of grammaticalization, with the
development of new senses, old uses may gradually lose in significance. For
example, the more the hypothetical sense gains ground, the more the past
sense disappears. Stage (a) is no longer relevant for English Past tense modals
such as would and could', for should, Bybee (This volume) adds that "one
could argue that there are no past uses at all".
I do not wish to delve into the question as to whether the development
sketched in (a) through (d) is appropriate only to languages with a fuzzy/
overlapping zone between the morphological categories of past tense and
subjunctive, or to languages lacking a subjunctive category altogether. In
German, for example, Bybee's scenario does not seem to apply in cases where
there is a well marked formal distinction between Past tense and Subjunctive:
Agent-Oriented vs. Epistemic Modality 45
It would seem, however, that this question does not touch the main point made
by Bybee. What is obvious, is that Bybee's account offers a legitimate
alternative to that of Fleischman, one that rests on a series of inferences that
can be described in terms of context-induced reinterpretation (Heine, Claudi
and Hiinnemeyer 1991, Ch. 3).
Bybee (this volume) argues that Fleischman's metaphorical explanation
of a transfer from temporal distance to social distance in certain politeness
forms does not account for the fact that it is always modal verbs that are
involved in this transfer. Yet Bybee's account does not explain how it is that
modals which express a situation involving unmet conditions are also used for
marking a certain kind of social relationship (politeness). What I wish to argue
here again (cf. Heine, Claudi and Hiinnemeyer 1991), is that an explanation
for a conceptual shift in the process of grammaticalization must have two
components: one relates to the macro-effects of the process which are sugges
tive of a shift from one domain of human conceptualization to another, in this
case from the domain of time to that of social relations; the other relates to the
micro-effects which are suggestive of a gradient sequence of contextual and
inferential extensions leading to the emergence of chain-like use patterns of
the linguistic items concerned.
While Fleischman's analysis highlights the macro-level of the process,
Bybee's approach perspectivizes the micro-level. The way in which the two
models are interrelated is sketched graphically in Figure 1 (cf. Heine, Claudi
and Hiinnemeyer 1991, Ch. 4). Thus, rather than contradicting or excluding
one another, the two models complement each other and are both required for
a better understanding of grammaticalization.
46 Bernd Heine
Domain
TIME SOCIAL RELATIONS
Context
Figure 1. A metaphor-and-context model for some of the senses of English would and
should.
5. Conclusions
In previous research it has been established that epistemic uses of modals tend
to develop out of agent-oriented uses; that is, the latter are older than the
former (see especially Shepherd 1982; Bybee and Pagliuca 1985; Traugott
1989; Bybee, Pagliuca and Perkins 1994.). From this it follows that contexts
associated with agent-oriented modality reflect more conservative or less
grammaticalized use patterns than contexts associated with epistemic modal
ity. While there may be exceptions, this conclusion seems to hold true for
most of the data considered here; it is also confirmed by diachronic analysis.14
With reference to German this would mean in particular that the most con
servative behavior is encountered if
a. modals such as möchten 'would like to', dürfen 'be allowed to,
may', or wollen 'want', rather than modals like sollen 'should' or
können 'can' are involved,
b. the main verb is an action verb like kommen 'come',
c. the modal is used in the perfect or the past tense,
d. the modal occurs in interrogative rather than in declarative utter
ances,
e. the modal occurs in negative rather than in affirmative utterances, or
f. if the subject is a first or second person, rather than a third person
referent.
Agent-Oriented vs. Epistemic Modality 47
Some related observations have also been made for English modals. Traugott
(1989:52), for example, notes that older meanings tend to be maintained
longer in negative environments, and draws attention to the relative paucity of
epistemic (as opposed to agent-oriented) uses of must not and mustn't in
British English and to the maintenance of the volitional sense of will in We
won't go.
One main claim made in this paper is that it is not possible to describe the
transition between agent-oriented and epistemic modality in German modals
exclusively in terms of linguistic categorization. What has to be taken into
consideration in addition are the contextual frames with which particular uses
of a modal are associated. Utterances having human agents as their subject
and dynamic main verbs may still have a focal epistemic sense if associated
with a contextual frame that discourages an agent-oriented interpretation.
More important, however, is the presence vs. absence of the modal force F in
determining which variety of modality a given modal utterance receives.
Finally, as we saw in Section 4, the transition from one kind of modality
to another has elements of discontinuity as well as continuity, and can thus be
accounted for by appeal to both the metaphor model and the context model of
grammaticalization. In this respect, the problem looked at in this paper does
not differ significantly from other instances of grammaticalization studied so far.
NOTES
1 I wish to express my gratitude to Werner Abraham, Joan Bybee, Eithne Carlin, Wallace
Chafe, Ulrike Claudi, Zygmunt Frajzyngier, Tom Givón, Christa Kilian-Hatz, Christa
König, Andreas "Donald" Lessau, Frank Lichtenberk, Franz Potyka, Heinz Roberg,
Dan Slobin, Thomas Stolz, Eve Sweetser, as well as a number of other participants of the
Albuquerque symposium on Mood and Modality for valuable discussions on this paper,
most of all to Suzanne Fleischman. I am also indebted to the Deutsche Forschungs
gemeinschaft (German Research Society) for its financial support.
2 In the following, y will use the terms "agent-oriented" and "deontic" as defined by
Bybee, Pagliuca and Perkins (1991, 1994), with slight modifications to be specified
below. Note that the distinctions referred to above are not only terminological; for
Coates (1983), for example, "root modality" embraces "deontic" and "dynamic catego
ries", and "epistemic modality" also includes "alethic" modality.
3 See Steele et al. (1981:260-264) and Heine (1993) for additional exemplification.
4. Note, however, that, compared to full-fledged verbs, they have a reduced paradigm of
personal inflections, lacking e.g. the present tense 3sg. suffix -t; see below.
48 Bernd Heine
5 Square brackets "[]" are used in this paper for contextual information relevant for the
semantic interpretation of an utterance.
6 While it is hoped that the observations made in the course of the survey will reveal some
salient semantic characteristics of German modals, the survey is biased in several ways.
For example, the three informants, all students of the University of Cologne having some
acquaintance with linguistics, had to take the three-fold classification proposed here for
granted in their responses.
7. The situation would be different if subjunctive uses of modals were included.
8 I am referring to the non-standard progressive of the form Er ist am Essen 'He is eating'
which, although not acceptable in High German, is nevertheless found in some form or
other in many modern German dialects.
9 Cf. the notion of preference as defined by Givón (1990:529).
10. For further evidence on the metaphorical nature of conceptual shift in grammaticaliza-
tion, see Heine, Claudi and Hünnemeyer 1991.
11 The examples provided by Traugott (1989:207) involve such developments as that from
GO to future, from COME to perfect, or from BE AT/IN to progressive. The process
concerned is described by Traugott as one involving "spatio-temporal metaphors".
12 In addition to these two models, a third model has to be distinguished, namely the
bleaching or containment model (Tom Givón, p.c.; see Heine, Claudi and Hiinnemeyer
1991:108ff.). We will not further deal with this model here, essentially since it is
confined to semantic substance while our primary concern is with the cognitive and
pragmatic strategies underlying the process of grammaticalization (but see below).
13 As noted above, this does not necessarily apply to werden, whose status as a modal is controversial.
14 The present paper is based exclusively on synchronic findings.
REFERENCES
Bybee, Joan L., Revere D. Perkins, and William Pagliuca forthc. The Grammaticization of
Tense, Aspect, and Modality in the Languages of the World. Chicago, London:
University of Chicago Press.
Bybee, Joan L. This volume. The Semantic Development of Past Tense Modals in
English. Paper prepared for the Symposium on Mood and Modality, University of
New Mexico, Albuquerque, 8 - 1 0 May, 1992.
Calbert, Joseph P. 1971. "Modality and Case Grammar." Working Papers in Linguistics
(Ohio State University), August 1971:85-132.
Claudi, Ulrike and Bernd Heine 1986. "On the Metaphorical Base of Grammar." Studies
in Language 10,2:297-335
Coates, Jennifer 1983. The Semantics of the Modal Auxiliaries. London/Canberra: Croom
Helm.
Fisiak, Jacek, ed. 1985. Historical Semantics, Historical Word Formation. Berlin:
Mouton de Gruyter.
Fleischman, Suzanne 1989. "Temporal Distance: A Basic Linguistic Metaphor." Studies
in Language 13,1:1-50.
Givón, T. 1990. Syntax: A Functional-Typological Introduction. Volume II. Amsterdam/
Philadelphia: John Benjamins.
Haiman, John 1985. Iconicity in Syntax. (Typological Studies in Language, 6.) Amster
dam/Philadelphia: John Benjamins.
Heine, Bernd 1992. "Grammaticalization Chains." Studies in Language 16,2:335-68.
Heine, Bernd 1993. Auxiliaries. New York: Oxford University Press.
Heine, Bernd, Ulrike Claudi and Friederike Hünnemeyer 1991. Grammaticalization: A
Conceptual Framework. Chicago: University of Chicago Press.
Helbig, Gerhard and Joachim Buscha 1988. Deutsche Grammatik. Ein Handbuch für den
Ausländerunterricht. Eleventh edition. Leipzig: VEB Enzyklopädie.
Jacobs, R. and P. Rosenbaum, eds 1970. Readings in English Transformational Gram
mar. Waltham, Mass.: Ginn and Co.
Jenkins, L. 1972. Modality in English Syntax. Indiana University Linguistics Club:
Mimeograph.
Jespersen, Otto 1924. The philosophy of grammar. London: Allen and Unwin.
Lakoff, George & Mark Johnson 1980. Metaphors We Live by. Chicago/London: Univer
sity of Chicago Press.
Langacker, Ronald W. 1985. "Observations and Speculations on subjectivity." In Haiman
1985. Pp. 109-150.
Levinson, Stephen C. 1983. Pragmatics. Cambridge: Cambridge University Press.
Palmer, Frank R. 1986. Mood and Modality. Cambridge: Cambridge University Press.
Pérez, Aveline 1990. "Time in Motion: Grammaticalisation of the Be Going to Construc
tion in English." La Trobe University Working Papers in Linguistics 3:49-64.
Perlmutter, D. M. 1970. "The Two Verbs 'Begin'." In Jacobs, R. and P. Rosenbaum 1970.
Pp. 107-119.
Ramat, Anna Giacalone 1991. "Grammaticalization Processes in the Area of Temporal
and Modal Relations." To appear Studies in Second Language Acquisition 14 (1992).
Shepherd, Susan C. 1982. "From Deontic to Epistemic: An Analysis of Modals in the
History of English, Creoles, and Language Acquisition." In: Ahlqvist, Anders 1982.
Pp. 316-323.
50 Bernd Heine
Steele, Susan M., Adrian Akmajian, Richard Demers, Eloise Jelinek, Chisato Kitagawa,
Richard Oehrle, and Thomas Wasow 1981. An Encyclopedia of AUX: A study in
Cross-Linguistic Equivalence. (Linguistic Inquiry Monographs, 5.) Cambridge,
Mass./London: MIT Press.
Stephany, Ursula 1989. "Modality in First Language Acquisition: The State of the Art."
To appear in Proceedings of the Symposium on Modality in Language Acquisition,
Berlin, Free University, 1-3 May, 1989.
Stolz, Thomas 1991. Von der Grammatikalisierbarkeit des Körpers. I: Vorbereitung.
Typescript, University of Bochum.
Sweetser, Eve Eliot 1982. Root and Epistemic Modals: Causality in Two Worlds."
Berkeley Linguistics Society 8: 484-507.
Sweetser, Eve Eliot 1984. Semantic Structure and Semantic Change: A Cognitive Linguis
tic Study of Modality, Perception, Speech Acts, and Logical Relations. Ph. D. disserta
tion, University of California at Berkeley.
Sweetser, Eve Eliot 1988. Grammaticalization and Semantic Bleaching. Berkeley Lin
guistics Society 14:389-405.
Traugott, Elizabeth C. 1989. "On the Rise of Epistemic Meanings in English: An Example
of Subjectification in Semantic Change." Language 65,1:31-55.
Traugott, Elizabeth C. & Bernd Heine, eds 1991.1, 1991.2. Approaches to Grammatical
ization. 2 volumes. Amsterdam/Philadelphia: John Benjamins.
Traugott, Elizabeth C. and Ekkehard König 1991. "The Semantics-Pragmatics of Gram
maticalization Revisited." Traugott, E. C. and B. Heine 1991.1. Pp. 189-218.
Vater, Heinz 1970. On the Generation of Modal Verbs. Paper read at the Ethno-Linguistic
Seminar, Bloomington, Indiana.
ABBREVIATIONS
APPENDIX
The data presented below have been elicited from three native speakers of High German.
While a number of discrepancies were noted among these informants, the overall ratios
turned out to be essentially the same.
Agent-Oriented vs. Epistemic Modality 51
A. Linguistic data
1. Present
Er muß kommen A E2 Er muß Geld haben E Er muß sie kennen E
Er soll kommen A E Er soll G. haben E Er soll sie kennen E
Er kann kommen A E2 Er kann G. haben A E Er kann sie kennen E
Er will kommen A Er will G. haben A E Er will sie kennen E
Er mag kommen A E2 Er mag G. haben A2 E Er mag sie kennen E
Er darf kommen A Er darf G. haben A Er darf s. kennen
Er möchte kommen A Er möchte G haben A Er möchte s. kennen A
Er wird kommen A E2 Er wird G. haben A2 E Er wird sie kennen A2 E
2. Past
Er mußte kommen,. A Er mußte Geld haben. Er mußte sie kernten.
sollte A E sollte E sollte
konnte A konnte A konnte
wollte A wollte A wollte E2
mochte A mochte A2 mochte
durfte A durfte A durfte A
3. Perfect
hat kommen Er hat Geld haben Er hat ihn kennen
müssen. A müssen. A2 müssen. A2
sollen. A sollen. A2 sollen. A2
können. A können. können. A2
wollen. A wollen. A wollen. A2
4. Negation
Er muß nicht kommen. A Er muß kein Geld haben. Er muß sie nicht kennen. A2 E
soll A soll A soll Al E
kann A kann Al E kann E
will A will A will A E
mag A mag A2 mag A El
darf A darf A darf A
möchte A möchte A möchte A
wird A wird Al E wird E
52 Bernd Heine
5. Question
Muß er kommen? A Muß er Geld haben ? A2 Muß er sie kennen? A E
Soll A Soll A1 Soll
Kann A Kann A El Kann E
Will A Will A Will El
Mag A Mag Al Mag
Darf A Darf A Darf A
Möchte A Möchte A Möchte A
Wird A Wird Al Wird E2
0 = minimal value)
Index of modality
(1.00- maximal,
B Quantitative data (Based on A above)
modality
Value of
J
J
Agent-oriented
Agent-oriented
Epistemic
Epistemic
Kind of proposition
Kind of expression
Kind of modal
Dürfen 47 0 7 63 0.75 0
Möchten 41 1 6 54 0.76 0.18
Wollen 48 23 8 72 0.67 0.32
Können 38 30 8 72 0.53 0.42
Müssen 31 31 8 72 0.43 0.43
Mögen 23 25 7 63 0.37 0.40
Werden 25 32 6 54 0.46 0.59
Sollen 30 40 8 72 0.42 0.56
The Expression of Root and Epistemic
Possibility in English1
Jennifer Coates
Roehampton Institute
Root Epistemic
In an utterance such as she must go to bed now, F may refer to the speaker
(T insist that she goes to bed now') or to an absent parent ('her mother insists
that she goes to bed at this time') or to any other source of power the speaker
may have in mind. The pronoun she is the agent (A), the event — go — is
dynamic (D), the event has not yet taken place at the moment the utterance is
produced (L), and there is a high probability (P) that the event referred to will
take place.
As Heine (this volume) says: "While prototypical instances of agent-
oriented modality are characterised by the presence of the properties [i.e. F,
A, D, L and P], prototypical instances of epistemic modality lack all proper
ties except P". This means that the fewer of these properties there are in any
given instance, the weaker will be the semantic contrast between the two
interpretations of these sentences. If we look at the examples given in (1)
above, we can justify the claim that the semantic contrast between root and
epistemic meaning is strong here by applying Heine's criteria. In (la) and (lb)
all of the properties are present, while only P is present in (lc). In other words,
examples of root must in English would normally be classified as prototypical
examples of agent-oriented modality, while examples of epistemic must
would be classified as prototypically epistemic.
Let's turn to the contrast between root and epistemic possibility. Examples
(2) and (3) below are typical instances of the expression of root possibility:2
(2) well I think there is a place where I can get a cheap kettle
(S. 1.4.62)
(3) I am afraid this is the bank's final word. I tell you so that you may
make arrangements elsewhere. (W.7.9.37)
Can (see example 2) is the normal exponent of root possibility in English,
while may, as illustrated in example (3), is the exponent of root possibility in
more formal contexts (in this case, a letter from a Bank Manager). Properties
A, D, and L are present in both (2) and (3), but F (force) is absent.
In everyday discourse, can expressing root possibility is most commonly
found in examples like (4) and (5) below, general statements of possibilities
with impersonal subjects.
(4) certain things can be sex-linked to the Y chromosome (S.5b.2.54)
(5) first thing in the morning they come, you can hear the whistle
(S.1.14A.43)
58 Jennifer Coates
These two examples display none of Heine's properties apart from P. Even
more perplexing are archetypal examples of can such as (6) below, which
Palmer (1990:152-4) describes as 'existential':
(6) Lions can be dangerous
Palmer justifies his use of the term 'existential' by claiming that paraphrases
involving 'some' {some lions are dangerous) or 'sometimes' {lions are some
times dangerous) are more appropriate than paraphrases using 'possible for'.
What is intriguing about examples like (6) is that, besides lacking F, A, D, and
L (the properties associated with non-epistemic modality), they also lack P. In
other words, some examples of can lack all the properties which Heine claims
are normally associated with modal meaning.
Although we have seen that examples of root possibility vary in the
number of properties associated with them (examples (2) and (3) are associ
ated with properties A, D, L and P, examples (4) and (5) are associated only
with P, while example (6) is associated with none of the properties), they all
share one characteristic: absence of F. In his analysis of German modals,
Heine argues that only F is obligatorily connected with root modality: "once
the use of a modal is characterised by a lack of F then it lacks that 'element of
will' that Jespersen (1924:320-1) referred to, and we are dealing with an
epistemic rather than an agent-oriented sense" (Heine, this volume). On the
basis of this argument, (2), (3), (4), (5) and (6) all fail as instances of root
modality.
Let's look now at the expression of epistemic possibility in English.
Epistemic possibility has many exponents, notably maybe, perhaps, I think,
possibly, probably and the modal auxiliaries may, might and could. Examples
are given in (7), (8), (9) and (10) below.
la + + + + + root obligation
1b + + + + root necessity
lc + epistemic necessity
2/3 - + + + + root possibility
4/5 +
6 - - - - - 'existential'
7
8
9
-
+ +
+
+ + epistemic possibility
Figure 2.
+
Matrix showing presence or absence of Heine's properties
modal: I'm not sure in (7), don't know in (8), I think it's unlikely in (9), and the
only snag is in (10). (We can compare earlier examples of root possibility: I can
get a cheap kettle (2) and you can hear the whistle (5). Both are statements of
fact; subjectivity is not involved. The difference between I can get a kettle and I
may get a kettle is that in the latter the speaker's uncertainty is encoded too.)
We therefore need to add S to the matrix to show that this acts as the
criterial property where the expression of possibility is concerned, (see Figure 3)
If we ignore P, which is common to both root and epistemic modality
(except in unusual cases like (6)), we can see that the reason for the weakness
of the root/epistemic distinction in the expression of possibility is the absence,
in utterances involving root possibility, of properties normally associated with
root meaning. Exponents of root necessity (e.g. lb) differ from exponents of
epistemic necessity (e.g. lc) on five measures, whereas exponents of root
possibility (e.g. 2/3) may differ from exponents of epistemic possibility (e.g.
9) by as little as one property (presence or absence of S).
One of the reasons that the weakness of this distinction has been unprob-
lematic for speakers of English is that root and epistemic possibility are, by
and large, expressed by different linguistic forms. May is the only modal form
which regularly expresses both root and epistemic possibility, and when
expressing root possibility it is restricted to the most formal contexts (as in (3)
above). The extent of overlap between the two forms is small, as the following
statistics show (these record my analysis of a representative sample of 200
cases of can and 200 cases of may, all examples of spoken British English
taken from the Survey of English Usage):
la + + + + + - root obligation
lb + + + + + - root necessity
2/3 — + + + +
root possibility
4/5 + :}
6 - - - - - - 'existential'
lc - - - - + + epistemic necessity
7 - - - - +
8
9
-
-
-
+
-
+
+
+
+
+ + epistemic possibility
+
Figure 3. Matrix to show the distinction between root and epistemic meanings in English
+
The Expression of Root and Epistemic Possibility in English 61
Merger, as I have explained elsewhere (see Coates 1983; Leech and Coates
1980), refers to instances where two meanings co-exist in a both/and relation
ship. In other words, two readings are available for a given utterance, but
instead of having to choose one meaning and discard the other (as with
ambiguous examples), the hearer is able to process both meanings. Merger
occurs quite frequently in more formal texts (example 11) and is becoming
endemic in academic writing (as example (12) illustrates):
62 Jennifer Coates
(11) or the pollen may be taken from the stamens of one rose and
transferred to the stigma of another (W. 10.3.27)
(12) ... the process of simplification ... through which even forms and
distinctions present in all the contributory dialects may be lost
(Trudgill 1986:126)
In both these examples, the only property clearly present is P. Properties
associated with root meaning (F, A, and D) are absent, while S, normally
criterial for epistemic meaning, is not typically associated with this kind of
formal style with its passives and inanimate subjects. So instances such as
these lack clear markers of either root or epistemic meaning. The two mean
ings merge, and the reader is not required to choose one or the other:3
First, it provides the missing negative form in the epistemic must para
digm (see Palmer, this volume). The invariant form can't (not cannot or can
not) expresses 'it's necessarily the case that ... not ...' (nec ~ p) or 'it's not
possibly the case that' (~ poss p) in examples such as (14). (Note the stress on
can't and the fall-rise intonation contour.)
(14) [speaker describes friends arriving early]
I almost phoned them up and said come a bit later — and then I
thought oh they 've probably left by now — so I didn 't and — twelve
thirty, now that... can't be them, and it was (S.2.7.6)
Second, can is used in interrogative constructions to express epistemic
possibility. Example (15) is a cliché of pop music and Hollywood-style films,
but it makes the point clear:
(15) Can it be true?
['Is it possible that this is true? that she loves me?']
Such examples have a clear relationship with the use of can't discussed above,
in that (15) could be glossed it can't be true! (it must be false). Example (16)
comes from a radio discussion of Government policy on pensions:
(16) Can that be sensible ?
['Is it possibly the case that that is sensible?']
The speaker was clearly trying to make the point That can't be sensible!
In British English, these are still the only contexts in which a form of can
is used with epistemic meaning (though could is making headway as an
alternative to might in the expression of tentativeness — see example 10). But
in American English, can is starting to appear in other contexts. The following
example occurred during the Symposium on Mood and Modality (held at the
University of New Mexico in May, 1992) as a participant finished her presen
tation:
(17) we hope this coding system can be useful [to other linguists work
ing in the field]
This utterance meant something like 'we hope there's a chance that this
system will be useful'. For British speakers, this utterance is not possible: a
British speaker would have to say 'we hope this coding system will be useful',
thereby losing the subjective force. As the person who uttered (17) com-
64 Jennifer Coates
4. Conclusions
In this paper, I have argued that the distinction between root and epistemic
meanings is much weaker in the case of possibility than in other areas of
modal meaning. I have demonstrated that the weakness of this distinction
arises from the nature of root possibility, typical examples of which are not
associated with Heine's properties F, A, D and L (normally criterial for root
meaning). As a direct consequence of the weakness of the root/epistemic
distinction, instances of merger are common (in the case of may), and epis
temic readings are beginning to occur in declaratives with can. Speakers will
exploit the potentialities of the English modal system to say the things they
need to say. Whether this means that can, like the other English modal
auxiliaries, will develop the full range of epistemic meanings remains to be
seen.
NOTES
1 I would like to record my gratitude to Bas Aarts, Joan Bybee, Suzanne Fleischmann,
Talmy Givón, Bernd Heine and Charles Meyer for their comments on earlier drafts of
this paper.
2 Examples from this point onwards will be taken from the Survey of English Usage,
University College, London (prosodic information omitted).
3 Can is also beginning to be involved in merger, in contexts where the speaker/writer
wishes to hedge what they are saying. A nice example occurs on Inland Revenue (UK)
Tax Forms: False statements can result in prosecution.
REFERENCES
Bybee, Joan. 1988. "Semantic Substance Versus Contrast in the Development of Gram
matical Meaning". Proceedings of the Fourteenth Berkeley Linguistic Society 247-279.
Bybee, Joan & William Pagliuca. 1985. "Cross-Linguistic Comparisons and the Develop
ment of Grammatical Meaning". Historical Semantics and Historical Word Formation
ed. by J. Fisiak, 59-84. The Hague: Mouton.
Coates, Jennifer. 1983. The Semantics of the Modal Auxiliaries, London: Croom Helm.
Coates, Jennifer & Geoffrey Leech. 1980. "The Meanings of the Modals in Modern
British and American English". York Papers in Linguistics 8.23-34.
Guo, Jiansheng. This volume. "The Interactional Stucturing of Meaning: Children's Use
and Development of the Mandarin Modal neng 'can'".
66 Jennifer Coates
Haegeman, Liliane. 1983. The Semantics of Will in Present-day British English. Brussels:
Paleis der Academiën.
Heine, Bernd. This volume. "Agent-Oriented vs. Epistemic Modality — Some Observa
tions on German Modals".
Jespersen, Otto. 1924. The Philosophy of Grammar. London: Allen & Unwin.
Leech, Geoffrey & Jennifer Coates. 1980. "Semantic Indeterminacy and the Modals".
Studies in English Linguistics ed. by Sidney Greenbaum, Geoffrey Leech & Jan
Svartvik, 79-90. London: Longman.
Lyons, John. 1977. Semantics. Cambridge: Cambridge University Press.
Palmer, Frank. 1990. Modality and the English Modals. (2nd edition). London: Longman.
Stephany, Ursula. 1986. "Modality". Language Acquisition ed. by Paul Fletcher &
Michael Garman. (2nd edition) 375-400. Cambridge: Cambridge University Press.
Traugott, Elizabeth Close. 1989. "On the Rise of Epistemic Meanings in English: An
Example of Subjectification in Semantic Change". Language 65.31-55.
Trudgill, Peter. 1986. Dialects in Context. Oxford: Blackwell.
Contextual Conditions for the Interpretation
of 'poder' and 'deber' in Spanish*
Carmen Silva-Corvalán
University of Southern California
1. Introduction
This study deals with some aspects of the grammar of poder 'can, may' and
deber 'must' in Spanish. Poder and deber are considered to be modal verbs or
modal auxiliaries both on syntactic and semantic grounds (see, among others,
Marcos Marín 1975:211; Narbona 1989; Rivero 1977).1 Many grammars of
Spanish (e.g. Gili Gaya 1976; Hernández 1986; Real Academia Española
1973) include querer 'want', saber 'know', osar 'dare', soler 'be used to', etc.
as 'modal verbs' as well. Syntactically, however, poder, deber, osar and soler
are different in that they share the requirement to occur exclusively in con
struction with an Infinitive with an equivalent subject, a constraint which does
not apply to querer and saber (see Narbona 1989, and Rivero 1977 for further
discussion of the syntax and semantics of these and other periphrastic verbal
constructions). Semantically, poder and deber stand apart as the only modals
that can make a statement, in at least one interpretation, about the possibility
of p, as the paraphrases of examples (1 a-b), given in (2 a-b), show.
There are, then, syntactic and semantic arguments for treating poder and
deber as a class in Spanish. In contrast to modal verbs in English, whose
syntactic and semantic characteristics have been studied in depth (see Coates
1983, Perkins 1982 & 1983, Palmer 1977 & 1979, Halliday 1970, and numer
ous references in these studies and in the present volume), Spanish modals
have received little attention, perhaps due to the fact that their syntactic
characteristics have not appeared to be as distinct as those of the English
modals when compared with other verbs which may occur in auxiliary posi
tion in verbal periphrases.
Note, however, that poder and deber pose a number of special syntactic
and semantic questions of general and specific interest: What syntactic con
straints apply to these modal verbs? What is the most adequate way to
describe their semantics? Can a monosemantic approach account for the
various uses oí poder and deber? What is the relation between the messages
conveyed by poder and by deber (as in examples 3-4)? What role do the
linguistic and extra-linguistic contexts play in the interpretation of the mean
ing of these forms? Indeed, as can be seen in the suggested readings of the
following examples, poder and deber can convey different messages, which
include such notions as ability, permission, possibility, and necessity.
2. Theoretical frameworks
Sweetser further agrees with the observation made by other linguists (e.g.
Coates, Kratzer, Perkins) that pragmatic factors will determine which world
the modal is taken as operating in. Thus, the identification of a real-world
cause will determine a root reading, while the identification of a body of
premises will determine an epistemic one.6 In either case, the contextual
interpretation of such modals as can and may (as well as of verbs like let and
allow) appears to be that of "taking away a potential barrier" (485). Sweetser
acknowledges that there may be ambiguity between real-world force and
epistemic force, which in turn implies contextual ambiguity, but she leaves
open the question of the basic meaning or basic contribution that different
modals make to the interpretation of an utterance.
Like Coates, I feel that "the imprecision of our knowledge of the world
might be inherent" (p.ll), and that fuzzy set theory may be of use in the
analysis of modality (cf. Sweetser on contextual ambiguity). Contrary to
Coates (and perhaps also to Sweetser), however, I argue that modals have
basic, core meanings along the lines of the definitions proposed by Perkins
(1983), while fuzziness applies rather to the contexts, linguistic and extra
-linguistic, in which modals are used. Fuzziness or graded membership of
contexts allows for various interpretations of modalized propositions, and the
precise point at which one interpretation is no longer possible is usually
difficult to establish. Contextually inferred messages (e.g. ability, possibility),
in principle infinite, have been mistakenly considered to be the, or part of the,
meaning of the modal.7
The question of the basic meanings conveyed by linguistic expressions,
as opposed to the meanings, messages or pragmatic implicatures which these
forms appear to have in specific contexts of use, is a recurrent theme in
semantic-pragmatic studies. With particular respect to closed grammatical
systems (e.g. pronouns, verbal affixes), there seems to be general agreement
that a distinction between basic, invariant, or systemic meaning, and second
ary, implicated, or non-systemic meanings or functions must be made (Bello
1977; Bull 1971; García 1975; King 1992; Silva-Corvalán 1991). By contrast,
there is little agreement as to what exactly constitutes the semantic substance
of the linguistic units under analysis.
With respect to poder and deber, I propose that these modals may be
characterized as having invariant meanings which account for the use of the
modal instead of a non-modalized verb, or for the choice of one modal over
the other in particular discourse contexts. Poder and deber interact with other
Interpretation of 'poder' and 'deber' 73
characterized by the same common reason for using one form rather than
another. Thus, though there is in principle no end to the messages that may be
postulated, I would like to propose that CMs may cluster into "great classes of
messages" (the most frequent ones becoming the PMs) that underlie groups of
utterances where the modal is inferred to convey the same or similar meanings.
The term 'modality', as the expression of speaker attitude towards the
contents of a proposition, is a semantic notion covering a range of meanings:
certainty, probability, possibility, belief, obligation, necessity, permission,
volition, intention, doubt, prediction, and denial. The speaker's attitude may
be conveyed through verb morphology (mood), lexically, syntactically, pro-
sodically, or by a combination of these linguistic devices. Here, Ï focus on
lexical and affixal manifestations of epistemic and root modalities. For exam
ple, the proposition Juan viene mañana 'John is coming tomorrow' can be
modalized to express speaker uncertainty or prediction about the probability
of actualization of the event encoded in the proposition (henceforth p) by
using the modal verbs (henceforth m) poder or deber. Thus, Juan puede/debe
venir mañana, may be paraphrased, for instance, as 'Speaker believes that it is
possible for Juan to come/that it is very likely that Juan will come tomorrow'.
Marked with Future morphology, on the other hand, these modals may not
convey possibility, but rather dynamic or deontic modalities: Juan podrá/
deberá venir mañana 'Juan will be able to/will have to come tomorrow'.
I distinguish between epistemic (EP) and root possibility (RP) contextual
meanings (cf. Coates 1983). Epistemic and root possibility are concerned with
the speaker's assessment of or assumptions about the likelihood that the
content of a proposition is or may become true. EP and RP differ in that the
former involves the speaker in logical inference and has p and m in its scope,
while the latter has only p in its scope and is agent/event oriented, i.e., the
subject of the modalized infinitive is normally a willful agent and/or the
infinitive is a dynamic verb.9 This difference has consequences for negation,
such that negation affects p in the case of epistemic sense, but m and p in the
case of root sense. When an affirmative modal (example 9) interpreted to
convey RP is negated, the negative form is interpreted as a negative fact
(example 10).10 To preserve the possibility interpretation, therefore, root
modality must be negated on p (example 11), while epistemic modality may
be negated on m, as example (12) shows.
Interpretation of 'poder' and 'deber' 75
3. Poder
I assume that the formula proposed by Perkins (1982) for can captures the
meaning of poder. K (C does not preclude X). This formula incorporates both
the IM and the elements that contribute to the CM. Thus, poder presents the
IM 'does not preclude' and it relates actualization of X to a set of circum
stances C, such that with reference to a set of principles K, the circumstances
C do not preclude X. The values of K and C are recognized (and specified by
the analyst) in context; these variables (which include morphosyntactic, se
mantic and prosodic features) plus the modal chosen determine whether X is
interpreted under a dynamic, deontic or epistemic modality (Heine (this
volume) proposes similar variables in his stimulating account of German
modals).
Interestingly, the specific semantic contribution proposed for poder,
'does not preclude', appears to agree with the meaning postulated for can by
Interpretation of 'poder' and 'deber' 11
Bybee, Perkins & Pagliuca (1994: Ch. 6), i.e., 'enabling conditions exist', as
well as with the meaning suggested by Sweetser (1982) in terms of the
presence or absence of 'barriers'. The question might arise why I agree with
Perkins in assigning a negative meaning to poder. This is because it seems to
me that a negative meaning captures more appropriately than a positive
meaning (such as 'enables' or 'makes possible') the sense of 'difficulty
overcome' which appears to be incorporated in most of the CMs of poder.
This is taken up again later in this essay.
Example 16 illustrates how the definition of poder would lead to an
interpretation of ability assuming certain values for K and C. Different values
for K and C would obviously lead to different interpretations.
Pottier (1976) discusses the same three meanings for pouvoir 'can' in
terms of 'pouvoir physique/possibilité interne', 'pouvoir par autorisation/
possibilité externe', and 'probabilité/possibilité neutre' (pp.39-41). In addi
tion, Palmer (1977:2) observes that epistemic modality should be seen as a
more basic concept "since what is (epistemically) possible includes what can
(dynamically) be done". This led me to think, for a while, that the IM of poder
should explicitly include 'possibility'. It became obvious to me later, how
ever, that 'possibility' is a CM which may be derived from 'does not pre
clude', since not precluding X in fact implies (though not equivalently)
making X possible.
The data examined indicate that examples of poder in Present tense affirma
tive form group into five sets of CMs (or "great classes of messages"),
discussed below, with the following readings: permission, ability, mitigation,
root possibility, and epistemic possibility. It is in principle possible, however,
that other sets of CMs might be identified in a much larger corpus of data.
It must be emphasized that many cases are not easy to fit neatly into one
of these five CMs. The indeterminacy of many cases may be viewed as
stemming from possible indeterminacies of K and C in natural language use,
e.g. the strength of the deontic source, weak indication of evidentiality.
Analogous observations made by Coates in regard to the meaning of English
modals led her to develop a prototype model of analysis. As in her English
data, my data contain a number of examples with contexts which appear to
represent the core set of features associated with a certain CM, but many
examples fall somewhere in the skirt or towards the periphery of the set.
Let us look now at examples of poder in contexts whose features have led
me to propose five different modalities (sets of CMs).
3.1.1 Permission.
Whenever a strong deontic source, i.e. some person or institution that clearly
creates obligation or permission (Lyons 1977:843), is identified either explic
itly or implicitly, the CM inferred is permission, as in example (14). The most
clearly identifiable context for permission, then, includes a strong deontic
source, an agentive subject, and future time orientation. The transcribed
Interpretation of 'poder' and 'deber' 79
(18) Creo que ésta es una democracia y que uno puede hacer lo que
quiera. (M25,p.6)
T think that this is a democracy and one can do what one wants.'
In example (18) the deontic source is an abstract entity, 'democracy' (as
opposed to Franco's dictatorship), and X is not a specific event. In addition,
time orientation is generic. Given these values for C and X, example (18) may
lend itself to an interpretation of root possibility as well as permission.
That these inferred meanings are contextually determined is sup
ported by the fact that a change in the environment of the modal has conse
quences on its interpretation, as shown in (19).
80 Carmen Silva-Corvalán
3.1.2 Ability.
The IM 'does not preclude' is interpreted as 'ability' when the following
contextual factors obtain: (a) K: natural or rational principles that permit us to
infer either innate or learned capacities to do z. (b) C: (i) the speaker has
evidence that X has been previously actualized, or that at least z has been, and
the circumstances are such that z may be valid also for a 'new' y; (ii) y is
animate, agentive, and specific; z is an event; core time orientation is generic
or present; if y is 'new', it is future.
Examples (20) and (21) illustrate contextual conditions for the CM 'ability':
(20) Ella os puede decir todo, todo referente a la droga. (M1 l,p39)
'She can tell you everything, everything about drugs.'
(21) El árabe y el ruso, pues, no los ha llegado a hablar, pero vamos,
para conversación puede defenderse. (M26)
'Arabic and Russian, well, she hasn't managed to speak them, but,
in conversation she can get by.'
Note that saber 'to know' may also convey 'ability for z', as in Yo sé
nadar 'I know how to swim'. Saber, however, simply asserts knowledge or
ability for; it does not implicate precluding circumstances as poder does.
Therefore, saber is not compatible with circumstances that imply that X is
somewhat surprising or that it involves effort. Thus, when y performs an event
that involves effort, or the overcoming of a given barrier, or a certain degree
of difficulty, p may be modalized with poder (ex. 22) but not with saber
(example. 23):
(22) Juan puede nadar mariposa por horas.
'John can swim the butterfly (stroke) for hours.'
(23) *Juan sabe nadar mariposa por horas.
*'John knows how to swim the butterfly (stroke) for hours.'
Interpretation of 'poder' and 'deber' 81
3.1.3 Mitigation.
(32) ...que si hoy se cierra el plazo, dentro de tres meses ya puede ser el
examen. (M5,135)
'...if the deadline is today, then the exam may be within three
months.'
CM: Based on previous experience, given that today is the deadline to
register for the exam, I infer that it is possible that the exam may be
given in three months.
(33) Puede ser ésa una diferencia respecto a otros regímenes tota
litarios. (M5,250)
'That may be a difference with respect to other totalitarian re
gimes.'
In the case of epistemic possibility, K includes rational laws (people can
make predictions, deduce, infer, etc.). As for C, lack of evidentiality and
stative aspect are indeed crucial. If evidentiality exists, an example may then
be interpreted as conveying RP, as in (34) and (35):
(34) No es un clima muy estándar. Efectivamente, te puede llover y al
día siguiente hacer un sol espléndido. (M6,p2)
'It's not an even climate. In fact, it can rain one day and the
following day the sun may be shining brightly'
CM'. I have evidence that the climate is not even. I may infer with
confidence that it is possible for it to rain one day and the following
day be sunny.
(35) Luisa puede ser la Decana de Filosofía, porque ha renunciado a su
otro cargo.
'Louise can be the Dean of Philosophy because she has resigned
her other job.'
CM: Now that she has resigned her other job, I infer that it is possible for
Louise to be the Dean of Philosophy.
84 Carmen Silva- Corvalán
3.2 Conclusion
The data examined indicate that the most frequent CM of poder when used in
Present tense form is root possibility (58 of 110 cases studied). Ability, EP,
and mitigation account for 9%, 8%, and 4%, respectively. Furthermore, 9
Interpretation of 'poder' and 'deber' 85
examples are indeterminate between ability and RP, and 20 examples are
indeterminate between permission and RP.
Two further points must be emphasized: (1) Some of the 58 cases of RP
are in the periphery of this CM, close to readings of permission or ability; and
(2) The nature of the data (conversational discourse) must have consequences
on the frequency of occurrence of the CMs identified, such that other dis
course genres would most likely yield different percentages of distribution of
CMs.
It is clear that we are dealing with intersecting contextual sets. On the
other hand, it is not clear to me that a strict prototype approach is the most
appropriate one to account for the facts. Indeed, a prototype model does not
acknowledge the relevance or validity of necessary or sufficient conditions in
the interpretation of CMs. Nevertheless, there do appear to be certain neces
sary contextual features in every CM, and there are cases when the necessary
conditions are also sufficient to identify a CM as, for instance, 'permission': a
specific strong deontic source, a specific agent, direct speech, a dynamic
situation, positive consequences of p (ex. 40); or 'epistemic possibility':
non-agentive, non-intentional 'doer', stative situation, negative consequences
of p (ex. 41).
4. Deber
(44) Tener un coche allí [en USA] debe ser algo alucinante. (M25,p5)
'To have a car there [in the USA] must be out of this world.'
IM: K (C requires X)
(i) K: rational laws (capacity to infer from available evidence)
(ii) C: (a) absence of deontic source
(b) preceding discourse gives some indication that p is true
(c) y = inanimate subject; z = stative; time orientation: generic
(iii) X: having a car in the USA is out of this world
CM: given humans' rational capacity to infer the likelihood of p
from available evidence, I interpret that the speaker intends to
convey the message that y ('to have a car in the USA') is very
probably z (' be out of this world'). (Epistemic possibility
sense).
By postulating for deber a core meaning of 'requirement of X', it is
possible to account for such CMs as obligation, advice, and probability,
depending on whether a number of contextual circumstances prompt a read
ing of more or less forcible requirement. These circumstances include: (1)
Animacy and agentivity of y; (2) Nature of the deontic source, if any; (3)
Speaker approving or disapproving attitude towards p; (4) Adverse or positive
consequences of p.
Present tense, but appears to be more stable in other tenses (see Section 5 on
tense and modality). Deber de occurs in only 4 of the 24 examples of
epistemic possibility (see example 45). Furthermore, two examples with
deber de in the Imperfect Indicative do not convey possibility but rather
advice or obligation, as shown in (46).
(45) A: Es que se para [el coche] y luego-
B: xxx (incomprehensible)
A: Sí. Es que debe de estar estropeado el freno. (M24,p27)
A: 'It's that it stalls [the car] and then-'
B: xxx (unintelligible)
A: 'Yes. It's that the brake must be broken.'
(46) Yo exactamente no lo sé. Tendría que ser [el referendum] a prime
ros del año que viene o algo así. Debían de solucionarlo antes de
terminar la jefatura de ellos [los socialistas] .(M24,p29)
'I don't know exactly. It [the referendum] would have to be at the
beginning of next year or something like that. They [the socialists]
should resolve this question before the end of their term in office.'
Deber is more strictly modal in its semantics than poder in that it is in all
contexts non-factual, i.e., propositions modalized by deber cannot be inter
preted as 'actualized' ,15 This semantic difference accounts for the fact that one
of deber's CMs is EP, but not RP, which is related to ability and evidentiality.
In its EP contexts, deber may be preceded by negation but is not affected by it.
Examples (47) a-c are illustrative (see note 11).
(47) a. El alto debe ser Pepe.
b. El alto no debe ser Pepe.
c. *El alto debe no ser Pepe.
a. T h e tall one must be Pepe.'
b. 'The tall one can't be Pepe.'
c. *'The tall one must not be Pepe.'
The core contextual features associated with epistemic possibility are as
follows:
(i) K: rational laws
(ii) C: (a) evidence that p may be true
(b) absence of deontic source
Interpretation of 'poder' and 'deber' 89
4.1.2 Mitigation
Changes in the contextual circumstances of deber in (48b): animate and
agentive subject, a possible though weak deontic source, strong evidence that
p is true, adverse consequences of not-p, bring about a CM which is indeter
minate between high likelihood, categorical necessity or requirement, and
mitigation.
Note that example (48) illustrates intriguing uses of deber in utterances
which repeat the content of an immediately preceding non-modalized p. This
appears to indicate that in certain cases deber is used to convey a degree of
speaker uncertainty in the truth of p as a pragmatic strategy for mitigating the
degree of assertiveness of the utterance. The communicative purposes which
motivate the use of this strategy need to be further investigated. The choice of
90 Carmen Silva-Corvalán
(52) a. *Ese tío va tan lento que no tiene que sentirse muy bien.
b. *Ese tío va tan lento que tiene que no sentirse muy bien.
a. 'That guy's going so slowly that he doesn't have to be feeling
well'
b. 'That guy's going so slowly that he has to not be feeling well'
4.2 Summary
This section discusses the co-occurrence of poder and deber with verbal
markers of tense, mood and aspect other than 'Present tense'. Proposing IMs
for these verbal affixes is beyond the scope of this study. Therefore, my
analysis will consider only what appears to be their contextual contribution.
I have observed in the introductory section that EP and RP are not possible
CMs when the modals are in Infinitive form (ex. 53). This restriction extends
to the Present Participle (ex. 54). Furthermore, neither modal is interpretable
in the Imperative (ex. 55). Thus, one might say that, like English modals,
poder and deber are 'defective' in this respect: they lack CMs of possibility in
the Infinitive and the Present Participle, and lack the Imperative altogether. In
the Infinitive (53) and the Present Participle (54) only root modality CMs are
inferred (e.g. obligation, ability). With regard to the Past Participle, the
behavior of poder and deber differs: while deber appears to retain many of its
possible CMs, including EP (56), poder in the Past Participle may convey
ability (57), but cannot be interpreted to convey RP (58) nor permission.
92 Carmen Silva-Corvalán
example (66), which is counter-factual; the Preterite suffix indicates that this
is root possibility in the past.
I have proposed that the IM of poder implies that a certain amount of
effort must be expended in 'not precluding X'. The relation between this IM
and the CM of (66) is, then, easy to establish: speakers do not assume that
entities may spend effort in the actualization of an event whose consequences
are adverse, so the only possible reading for (66) is of non-actualization or
non-factuality. Thus, this CM does not appear to be fuzzy, but rather a
category with clear boundaries defined by three features: adverse conse
quences of p, speaker's disapproval of p, Preterite morphology. This analysis
is confirmed by the acceptability of (67) and (68 a), and the unacceptability
(under 'normal circumstances') of (68 b).
bility (unless, of course, the speaker had wanted to become extremely poor,
but I would not consider this to be a 'normal social circumstance').
(71) Pudimos quedarnos en Inglaterra.
'We could stay in England.'
(72) Pudimos quedarnos en la miseria. [La situación estaba muy mala.]
could-we stay-us in the poverty
'We could've gone broke.' [The situation was bad.]
In its actual, more extended context of occurrence, example (71) in fact
conveys root possibility: staying in England was possible but not actualized.
However, this has to be made explicit by the speaker (pero él no quiso 'but he
didn't want to'), as shown in the actual utterance (73).
(73) Pero pudimos quedarnos en Inglaterra, porque le ofrecieron una
permanencia, pero él no quiso. (M10,p4)
'But we could have stayed (lit: could stay) in England, because
they offered him a permanent position, but he didn't want to.'
Otherwise, possibility with Preterite poder may be expressed if the modal
is combined with the Perfect Infinitive, as in English, such that example (74)
does not then require any further context to be interpreted as a possibility
which existed in the past, i.e. as counterfactual in the present.18
(74) Pudimos habernos quedado.
'We could have stayed.'
There are no examples with Preterite poder plus Perfect Infinitive in the
data from Madrid. I note, however, that in my variety of Chilean Spanish, this
tense combination is required in examples (66), (70), (72), & (73). My
intuitions in this respect are confirmed by the responses of four native speak
ers from Chile and Argentina to an elicitation test given to determine the
meanings associated with poder and deber in various tense forms. This may
reflect a difference between Madrid and Latin American Spanish, but conver
sational data from this latter variety must be examined before any further
proposals can be made in this respect.
The Perfect Infinitive does occur, however, with Present, Imperfect, and
Conditional forms of poder. All examples have past time orientation and
stress the non-factuality or hypotheticality of p. This higher degree of hypotheti-
cality becomes more evident through a comparison of examples (60) and (75).
Interpretation of 'poder' and 'deber' 97
There are no occurrences of deber with future morphology in the data, and
only one of poder, with CM permission:
(82) Entonces esa persona que quiera abortar - podrá abortar siempre
que esto lo diga en las primeras semanas de su embarazo. (M10,p7)
'So a person who may want to have an abortion - will be able to
have one provided she requests it during the first weeks of preg
nancy.'
The translations provided for examples (83) a-b are intended to capture
the fact that poder, because it incorporates the notion of effort, makes p more
hypothetical than when it is modalized simply by Future morphology.
By contrast, deber in the Future does not appear to be compatible with an
epistemic modality context, as I show in (84). Only root modality CMs may
be inferred for deber with Future morphology (85).
(84) a. ?Si tú lo dices, deberá ser cierto.
b. Si tú lo dices, será cierto.
'If you say so, it must-fut be true.'
(85) La inauguración deberá ser mañana.
'The inauguration will have to be tomorrow'
There are 6 examples with poder in Conditional form; they all occur in
root possibility contexts. Indeed, since Conditional morphology conveys very
weak assertiveness, it is not surprising that in every case it communicates
probability of p, regardless of the context. That is, non-modal verbs modal
ized by Conditional morphology are interpreted to refer to hypothetical events
as well. Example (86) illustrates.
(86) Podrían hacerme una virguería - se suele decir una virguería, o
sea una cosa bonita -, arreglarme los dientes, quedarían perfectos.
(Mll,p27)
'They could do me a 'virguería' - they usually say 'virguería', you
know, something pretty -, fix my teeth, they would look perfect.'
On the other hand, deber in the Conditional may receive a deontic or an
epistemic interpretation depending on the same contextual features estab
lished for Present deber. The contribution of Conditional morphology is that
of adding weak speaker assertiveness which, in a deontic context, creates a
CM of mitigated obligation or advice (ex. 87), and in an epistemic context, a
CM of weaker confidence in the truth of p (ex. 88).20
(87) El gobierno debería dar una explicación. (M5,p21)
'The government should give an explanation.'
(88) No vamos a saber exactamente las consecuencias, ¿no? [de entrar
al Mercado Común Europeo] En principio, debería de ser bueno.
(M18,pl5)
'We won't know exactly what the consequences will be, right? [of
entering the Common Market] In theory, it should be good.'
100 Carmen Silva-Corvalán
5. Conclusions
(90) a. That place must make a good profit. They'll close it down if it
doesn't.
Contextual modification of (89) and (90) leads to indeterminacy between
root and epistemic interpretations of (89a) and (90a), i.e. to fuzzy CM of must,
Admittedly, Coates's analysis may in essence be the same as mine. Yet, it
seems to me more appropriate to acknowledge explicitly that certain forms at
least do not have multiple meanings. Rather, they have basic meanings which
make them compatible with a number of contexts; this form-context interac
tion may bring about different interpretations of the meaning of the form,
which I have referred to as CMs.
In regard to the questions posed at the start, I hope to have shown that a
monosemantic approach which goes beyond a level of IM and studies linguis
tic forms in specific contexts of use adequately captures the relation between
IM and CMs of modals, and indeed reflects this interrelatedness more appro
priately than a polysemantic analysis, which does not necessarily imply
affinity between various meanings. Given my view of language as a system of
communication and a particular instance of human behavior, I have sought
answers to linguistic questions within and beyond the specific linguistic
system under examination. Thus, crucial external factors which have rel
evance to the interpretation of modalized utterances have been identified:
identification and strength of deontic sources, evidentiality, negative conse
quences of propositions (which are likely to be culture specific), and speaker
attitude towards the content of propositions. This appears to be conclusive
evidence in support of a non-autonomous semantic approach to the study of
meaning.
NOTES
* I would like to thank Joan Bybee, Jenny Coates and Erica García for their extensive
comments and insightful criticism of an earlier version of this paper. My thanks as well
to Franco D'Introno, Wolfgang Klein, and Michael Perkins, who have contributed
helpful observations. Improvements in this present version also owe much to Erica, and,
likewise, to Flora Klein-Andreu, Roger Wright and especially to the editors of this
volume. I am grateful to all of them for their careful reading of my work and for their
penetrating comments. Errors and misinterpretations remain my sole responsibility.
1 Grammars note a difference between deber 'must' (obligation) and deber de 'must'
(possibility), but at the same time acknowledge that this opposition is being lost as the
form without the preposition de has extended to contexts of epistemic modality. I refer to
this change later.
102 Carmen Silva-Corvalán
I do not examine here the transitive verb deber 'to owe' (Te debo $1 'I owe you
$1'), because at this synchronic stage of the language it clearly is a different lexical item
from modal deber.
2 The corpus of spoken data consists of approximately six hours of recorded conversations
with seven speakers from Madrid. The data were collected by me and a group of
graduate students participating in a seminar which I conducted in Madrid in the summer
of 1985.1 would like to thank my students for sharing their data with me. Included in this
study are three adult women and four adult men. They are all middle class 'madrileños',
except one man (M9), who works as a handyman in one of the Colegios Mayores of the
University.
3 Halliday refers to these as modulations and notes that though modalities and modula
tions are not identical, they are semantically related, such that their partial reduction to a
single network "expresses the closeness of fit of the two systems" (347)] i.e. when they
do not convey the meaning of probability.
4 Haegeman deals only with will, and Kratzer with can and must, while Perkins examines
a wide range of modal expressions.
5 Coates includes the deontic and dynamic categories of modal logic in a root category,
and epistemic and alethic modalities in an epistemic one.
6 This analysis is similar to Lyons's (1977:843) proposal that the identification of a
deontic source (an entity with authority) is a prerequisite to the interpretation of a modal
as conveying obligation or necessity.
7 In his analysis of poder 'can, may', for instance, Narbona (1989:88) appears to identify
two verbs poder, a modal that does not occur with a compound Infinitive (no example is
offered), and a non-modal with the meaning of possibility that does admit this construc
tion (Puede habérselo prestado '(He) may have lent it to him). It seems to me that
Narbona's proposition that there is a dichotomous modal/non-modal opposition identi
fied by the possibility of co-occurrence with a compound Infinitive is not correct;
suggesting the existence of two verbs poder would force him to create at least two lexical
entries for this verb, a solution that he criticizes elsewhere in his work. Pottier (1976:39)
adopts a similar position when he argues that pouvoir 'can, may' and devoir 'must'
clearly represent a case of multiple polysemy. Likewise, Huot (1974, cited in Thibault
1991) states that there are two different auxiliary verbs devoir in French.
8 There is at least one other type of meaning which is also conveyed linguistically, namely
social meaning. I am not concerned with social meaning here.
9 'Agent-oriented' is here used in a similar, though not identical, way in which Bybee &
Pagliuca (1985) have used the term. Indeed, they propose to refer to the notions of
ability, obligation, desire and intention as 'agent-oriented modalities' because they
"predicate conditions of either an internal or external nature on a willful agent" (p. 63).
10 Miller and Kwilosz (1981) investigate the interaction of modality and negation in
English through a study of speakers' judgements of the meaning of sentences. Their
experiments show that "There is apparently a tendency for verbs of necessity to take de
re negation [i.e. only p is negated], and for verbs of possibility to take de dicto negation
[i.e. m and p are negated], but there are exceptions in both cases." This tendency also
seems to apply to poder and deber, but I do not examine this question in depth here.
Interpretation of 'poder' and 'deber' 103
11 For the EP reading to obtain, the Infinitive must receive stronger stress than the modal.
12 Perkins's definition is in turn modelled on Miller's (1978) discussion of Wertheimer's
(1972) characterization of the meaning of OUGHT.
13 Information given in parentheses identifies the tape and either page of the transcription
or tape counter number where example is located. Examples without this information are
made up by the author.
14 By 'new' y I mean a subject referent for whom the speaker infers that z is valid, as in M.
va a poder caminar después de la operación 'M. will be able to walk after the operation',
on the basis of the speaker's previous evidence that z has been valid for a different
subject referent under similar circumstances.
15 Compare poder and deber in the Preterite: Pudo cruzar el río 'He was able to [and did]
cross the river' (situation occurred in the past); Debió cruzar el río 'He must have
crossed the river' (the situation most likely occurred in the past).
16 Moliner (1981), for instance, notes that tener que may express necessity, obligation, and
strong resolution, but does not include 'possibility.'
17 The perfectivity of the Future (-rá) in Spanish is debatable, but there is evidence that
supports this analysis whenever this form fulfills its 'tense' function (Silva-Corvalán
1991). By contrast, in contexts where the Future is used to express hypotheticality in the
present, its aspect seems to be Imperfective. Note, however, that the expression of
hypotheticality is subject to some intriguing restrictions with dynamic verbs, as illus
trated in (i) and (ii):
(i) A: ¿Por qué no vino Pepe?
'Why didn't Pepe come?'
B: Estará enfermo.
'He may be/is probably ill'
(ii) A: ¿Por qué no está Pepe aquí?
'Why isn't Pepe here?'
B: *Trabajará hoy.
*'He'll work today.'
Estará trabajando hoy.
'He may be working today.'
18 The Perfect Infinitive by itself expresses conditionality (counterfactual protasis) in
certain special constructions of the type of (i) B:
(i) A: Me habría gustado ir contigo a Chile.
B: Haberlo dicho antes, te habría invitado.
have-it said before,
'Had you said it before, I would've invited you.'
19 This fact poses a challenging question in regard to the process of loss of deber de from
the system. The observation that past morphology appears to favor its retention, espe
cially with non-stative verbs, needs to be examined statistically in a larger corpus of
data. See, also, examples 46, 87, and 88, which give evidence of insecurity in the use of
deber de.
20 Note that in example 88 the speaker uses deber de, but in Chilean Spanish deber alone
would also be interpreted to convey EP.
104 Carmen Silva-Corvalán
REFERENCES
Bello, Andrés. 1977. Gramática de la lengua castellana (With notes from Rufino J.
Cuervo). 10th edition. Buenos Aires: Sopena.
Bosch, Peter. 1985. Lexical Meaning Contextualized. Meaning and the Lexicon.
ed. by G.Hoppenbrouwers, P. Seuren & A. Weijters, 251-258. Dordrecht: Foris.
Bull, William E. 1971. Time, Tense, and the Verb. 4th printing. Berkeley: University of
California Press.
Bybee, Joan. 1985. Morphology. Amsterdam: John Benjamins.
Bybee, Joan. (This volume). "The Semantic Development of Past Tense Modals in
English."
Bybee, Joan & William Pagliuca. 1985. "Cross-Linguistic Comparison and the Develop
ment of Grammatical Meaning." Historical Semantics. Historical Word Formation,
ed. by Jacek Fisiak, 59-83. Berlin: Mouton.
Bybee, Joan, Revere Perkins & William Pagliuca. 1994. The Evolution of Grammar:
Tense, Aspect and Modality in the Languages of the World. Chicago: University of
Chicago Press.
Coates, Jennifer. 1983. The Semantics of the Modal Auxiliaries. London: Croom Helm.
Fleischman, Suzanne. 1989. "Temporal Distance: A Basic Linguistic Metaphor." Studies
in Language 13.1-50.
García, Erica C. 1975. The Role of Theory in Linguistic Analysis: The Spanish Pronoun
System. Amsterdam: North- Holland.
Gili Gaya, Samuel. 1976, Curso superior de sintaxis española. Barcelona: Biblograf.
Givón, Talmy. 1982. "Evidentiality and Epistemic Space." Studies in Language 6.23-49
Haegeman, Liliane M.V. 1983. The Semantics of Will in Present-day British English:
A Unified Account. Brussels: Verhandeling Letteren, 45.103.
Halliday, M.A.K. 1970. "Functional Diversity in Language as Seen from a Consideration
of Modality and Mood in English. Foundations of Language 6.322-361.
Heine, Berndt. (This volume). "Agent-Oriented vs. Epistemic Modality: Some Observa
tions on German Modals."
Hernández, César. 1986. Gramática funcional del español. Madrid: Gredos
King, Larry D. 1992. The Semantic Structure of Spanish. Amsterdam: John Benjamins.
Klein-Andreu, Flora. 1986. "Speaker-Based and Reference-Based Factors in Language:
Non-past Conditional Sentences in Spanish." Studies in Romance Linguistics, ed. by
Osvaldo Jaeggli, & Carmen Silva-Corvalán, 99-119. Amsterdam: Foris.
Kratzer, Angelika. 1977. "What 'Must' and 'Can' Must and Can Mean. Linguistics and
Philosophy 1.337-355.
Lyons, John. 1977. Semantics. Cambridge: Cambridge University Press.
Marcos Marín, Francisco. 1975. Aproximación a la gramática española. 3rd edition.
Madrid: Cincel.
Miller, George A. 1978. "Semantic Relations Among Words." Linguistic Theory and
Psychological Reality, ed. by M. Halle, J. Bresnan, & G.A. Miller, 60-118. Cam
bridge, Mass.: MIT Press.
Miller, George A., & Donna M. Kwilosz. 1981. "Interactions of Modality and Negation in
English." Elements of Discourse Understanding, ed. by Aravind Joshi, Bonnie
Webber, & Ivan Sag, 201-216. Cambridge: Cambridge University Press.
Interpretation of 'poder' and 'deber' 105
Edith L. Bavin
La Trobe University
1. Introduction
2.1 Introduction
There are five languages in the Western Nilotic group: Acholi, Lango,
Dhopadhola, Alur and DhoLuo. With the exception of DhoLuo, the languages
are spoken in Uganda, with Acholi extending into the Sudan. DhoLuo is
spoken in Western Kenya, so it is separated physically from the other four
The Obligation in Western Nilotic 109
(6) Lango
a. (An) a-ngweco
I l:SG-run
'I ran.'
b. A-tye a-ngweco
l:SG-be l:SG-run
'I am running.'
c. Myero onwongo dong a-ngwec oko
must PI then l:SG-run already
T should have run.'
(7) Alur
Awiya omako o-dik-i i-paco tin
children must 3:SG-return-SUBJ LOC-home today
'The children must go home today.'
112 Edith L. Bavin
(8) Lango
Onwongo gin myero o-bol odilo
PI they must 3:SG-throw ball
They would have thrown the ball.'
(9) a. Lango
Te bedo kobo atin gwok-ere ca ni pe
NARR stay talk child dog-POSS DET COMP NEG
dong myero gwee
then must bark
'He then told his puppy that he mustn't bark.'
b. Lango
Onwongo o-punyo-wa ni pe myero o-mede
PI 3:PL-teach-1:PL COMP not must 1:PL-continue
kede kodi luwenyi
with way fighting
'We were being taught that we shouldn't fight.'
c. Acholi
Myero pe o-bol odilo
Must NEG 3:SG-throw ball
'He shouldn't have thrown the ball.'
Another property of the obligation construction concerns the shape of the
lexical verb following the particle. In all five languages, the Subjunctive form
of the verb is used. Most indicative verbs in the Western Nilotic languages are
disyllabic and end with a final vowel o (e.g. from Acholi and Lango: neno
'see', miyo 'give', ryemo 'chase', keto 'put, bino 'come', camo 'eat', mito
'want', ringo 'run'). There is also a small set of mono-syllabic verbs in each
language (e.g. from Acholi: coo 'wake up', aa start from', and oo 'arrive').
For the disyllabic verbs, the Subjunctive form that follows (o)myero is formed
by omitting the final vowel (e.g. nen from neno 'see'), and for some verbs a
final vowel i replaces the o. Driberg (1923) discussing Lango, and Crazzolara
(1955) discussing Acholi, specify that the Subjunctive verbs which add a final
i in place of the o are phonologically determined; however this is no longer
true as speakers now use the suffix i only on motion verbs, or not at all. 3 For
The Obligation in Western Nilotic 113
example, Acholi speakers vary as to whether they use i on the verb ngwec
'run' and dok 'return', but they do not use a final i on non-motion verbs.
The Subjunctive form also follows wek 'let, allow', and gwok 'prohibi
tion'. Crazzolara (1955:17) describes its function as hortative, and lists the
following meanings for the form: wish, invitation, exhortation, command,
prohibition. However, there are other uses. For example, in Lango and Acholi,
nen is used with the meaning 'to be seen'. The form is also used for the
imperative; in this function, also, there is variability in the use of the suffix i
(e.g. ngweci or ngwec 'run!'), although more Lango speakers than Acholi tend
to use the suffix.
Note that in (10) the Acholi speaker uses a final vowel i on the verb
following myero; he also produced (11), in which bed does not have a suffix.
A few verbs in each language are idiosyncratic since they always appear in the
Reflexive form, no matter what the context (e.g. medde 'continue, which is
illustrated in (12)).
(10) Myero a-ngwec-i
must l:sG-run-suBJ
T must run.'
(11) Myero a-bed piny
must l:SG-sit down
'I must sit down.'
(12) Myero a-med-de i-kwan iyonge mwaka aryo
should l:SG-continue-REF with-study after year two
'I must continue studying in two years/I intend to continue study
ing in two years.'
In examples (l)-(5), the lexical verbs following the Obligation markers
are ngol 'cross' and nen 'see' from Lango, tim 'do' and wil 'buy' from Acholi,
cithi 'go' and bed 'stay' from Alur, chiem 'eat' and kony 'help' from Dholuo,
and woth 'go' from Dhopadhola. These are all Subjunctive forms of the
respective verbs, and all have a Subject Prefix. In contrast, verb complements
of equi-verbs do not appear in the Subjunctive form, and it is the matrix verb
that carries the Subject Prefix, not the complement. For example, amito neno
means 'I want to see' in both Acholi and Lango. Mito has the Subject Prefix,
not neno. There are two modal verbs in Lango and Acholi {romo 'can', and
twero 'to be able') which behave like equi-verbs; they carry the Subject Prefix
114 Edith L. Bavin
and are followed by the Infinitive form of the verb, not the Subjunctive. These
properties are illustrated with romo in (13).
(13) Acholi
Ka i-tye i-peko i-romo penyo ngati-mo
if 2:SG-be with-problem 2:SG-can ask person-some
'If you have a problem, you can ask anyone.'
The position of the Negative pe shows up another difference between a
complement-taking verb and the modal particle (o)myero. As illustrated
above, pe usually appears between myero and the lexical verb. However, pe
precedes an equi-verb or a modal verb, as in the second clause of (14).
(14) Acholi
Gu-yabo wang ot ka ngiyo ka kwene ma
3: PL-open eye house PROG look place where REL
ogwalpok tye iye; pe gi-romo neno-ne
frog be LOC NEG 3:PL-can see-3 SG
They opened the window looking for the frog to see where it was;
they couldn't see it.'
A change in interpretation is made in both Lango and Acholi when the
Past Imperfective marker precedes the obligation marker. For example, in
(15) the Acholi speaker is indicating that while there was some obligation for
them to play ball they did not, whereas in (16), he is indicating that because it
is the most likely situation, he believes they did play ball. That is, when the
aspect marker follows the obligation marker, there is an epistemic interpreta
tion. Other interpretations for the obligation constructions will be discussed in
Section 4.
(15) Gin onongo myero gu-tuk odilo4
They PI must 3:SG-play ball
They should/would have played ball.'
(16) Gin myero onongo gu-tuk odilo
They must PI 3:PL-play ball
They must have played ball.'
The Obligation in Western Nilotic 115
3. Lexical sources
3.1 Introduction
The source for the Dhopadhola obligation affix is not evident. Ripo is not a
lexical verb in the language group and it will not be considered further.
However, the lexical sources of the obligation markers in the other four
languages are evident.
(17) Lango
Oluk o-myer-e jobi
Oluk 3:SG-match-REF buffalo
'Oluk was a match for the buffalo.' [Driberg, 1923: 391]
In form, omyero is the Past Third Person Singular of the verb myero.
Given that Acholi uses omyero or myero, it is hypothesized that that both
Lango and Acholi started with omyero, but Lango has dropped the prefix. This
seems the most likely direction of change, since forms generally become
reduced in form as they become grammaticized (Bybee 1985).
Assuming the Third Singular Past form is the source for the obligation
particle, one change to be accounted for is the change in the function of the
prefix o. In omyero 'obligation', o does not refer to a particular entity, and
116 Edith L. Bavin
thus the form omyero is impersonal in interpretation. That is, a Third Singular
form with past reference was reinterpreted as impersonal and as an on-going
state or condition ('It is fitting'). As discussed by Bybee (this volume), a
change from past form to present interpretation is common for modals. The
impersonal sense of 'It is fitting' indicates that it is desirable for an existing
situation to be maintained, that is, certain conditions that hold are appropriate
according to the current views of the community, the authority that deter
mines what is fitting or necessary. To maintain the values of this society, a
person is obliged to conform to these views.
Since 'It is fitting/suitable' has a non-agentive, intransitive meaning, it is
feasible that the short form of myero was first used with the impersonal
meaning. Recall that a distinction is made for a few verbs in the languages
between a short (intransitive) form and a long, transitive form which is
marked with the o suffix. The distinction might have been more common in
earlier stages of the languages, with the o suffix being added to the short stem
of verbs to mark transitivity. Since this is the same form as the Third Person
Subject prefix o, the most likely development is that o was used to mark a
third Person Object but was reinterpreted as a marker of transitivity, and later
as part of the verb stem for all but a few verbs. However, this development
could have taken place prior to myero becoming used as a modal particle.
The impersonal form ('It is fitting') would have propositional scope with
the source of the modality being society in general. However, changes have
taken place to allow the form to become a clause-internal grammatical marker
which predicates the modality onto the agent of a particular activity. One
necessary change is that the impersonal verb takes a verbal complement, as in
'It is fitting to run'. In this way, the impersonal lexical item becomes part of a
larger construction.
In order to personalize an utterance containing omyero and a verbal
complement, the speaker must direct the message to or about a particular
individual. This would necessitate adding a Subject Prefix. Because Lango
and Acholi are pro-drop languages, an overt subject nominal is not required,
but the bound form is. The fact that the Subject Prefix did not replace o on
myero indicates the shift was already in progress, with the complement being
analyzed as the main predicate. Thus there was a change from a propositional
scope lexical modality to a specific predication of the modality onto an agent,
the subject of the new predicate.
There is no evidence for discrete stages in the shift, either semantic or
The Obligation in Western Nilotic 117
syntactic, and it is likely that there was synchronic variability as the changes
took place. Thus the following list does not represent distinct stages in the
path of change. Rather it illustrates the starting point and the current construc
tion. Myero started as a lexical verb but it can no longer be used as a lexical
verb, only as a modal particle in a clause which has an inflected predicate.
(18) o-myero 'He/she/it fits/matches X' (transitive)
(OR o-myer 'It fits' (intransitive))
omyero 'It is fitting/suitable/appropriate' (impersonal)
omyero + verb 'It is appropriate to V'
omyero prefix+V 'It is appropriate/ necessary for X to V'
One further development has occurred. The prefix on omyero serves no
purpose as a subject marker and has been dropped in Lango, and also by some
Acholi speakers. For some of those Acholi speakers who use both forms, the
forms have developed different functions. For example, one male who used
both myero and omyero explained the difference as follows: omyero has more
force, but with myero the speaker is just telling someone what they must do.
This suggests that omyero is more formal, and this would fit in with the
hypothesis that it is the older of the two forms. Lango, then, is slightly more
ahead in the grammaticalization of the obligation marker in that no speakers
have been found to use the o prefix. Lango also shows more change in the
forms of verb prefixes generally, since o is used for third singular and all
plural persons.
3.3 Alur
The source for the Alur particle omako is the lexical verb mako 'hold, catch'.
The verb form mako is currently a lexical verb with the meaning 'to hold,
catch' in all five languages under discussion. In addition to its active meaning
with an agent subject, it is also used in Acholi in expressions showing a
situation has arisen in which a non-animate is the cause of a current state. For
example, 'He is mad' is literally 'madness holds/ captures him'. In Alur, the
meaning has been extended to the current meaning of obligation.
What is of interest is the form of the verb. As with omyero, omako has the
o prefix, but because the prefix does not refer to a particular person but society
at large, it is an impersonal form. Similar developments seem to have oc
curred with this verb as with omyero in Acholi and Lango, in that a verbal
118 Edith L. Bavin
complement with Subject Prefix must have been added, and the form omako
reinterpreted as a clause-internal grammatical marker, which shifts from
having propositional scope to predicating the modality onto the subject of the
predicate. In its lexical use, mako takes a nominal complement; for example,
omako winyo means 'He/she caught/held a bird'. But the verb mako does not
take a verbal complement, as is possible with English 'caught'. Consider the
sentence 'He/she caught a fish swimming in the river'; here 'fish' could be
analyzed as the object of 'caught' as well as the subject of 'swimming'.
However, in the Western Nilotic languages, mako would not be used here; the
verb gwok 'constrain' would be used instead.
The fact that mako (as a lexical verb) does not take a verbal complement
suggests the personal meaning of omako was not used as the source for the
shift to a modal meaning. It is likely that an impersonal meaning developed
for omako, as with omyero, and that as the semantic changes took place so did
the syntactic. Just as with omyero, omako is a Past form of a verb, and its
impersonal meaning 'It was held' has been extended to non-past contexts.
While physical contact is part of meaning of the lexical verb mako,
'catch/hold', the modal meaning of omako implies social constraints, which
are the views held by society as to what is appropriate. For example, the literal
interpretation of (3b) is 'If you want food, it is held you stay with money'.
Omako here does not convey any sense of physical constraint or support;
however it does convey the sense of social constraint. That is, a concrete
lexical item has been used to express a more abstract notion of constraint. A
similar extension has been discussed for English 'have', which once meant 'to
hold in one's hand', the meaning has changed so that the notion of physical
presence of a possession has been lost, although one can have an abstract
possession (Bybee and Pagliuca (1985:72). In Alur, the meaning of omako
'hold' has been metaphorically extended so that one may hold ideas as well as
physical objects. However, the interpretation is not associated with the per
sonal form of the verb. Since it is the impersonal form which has been
grammaticalized with a modality function, it is through this form that the
semantic change has spread.
3.4 Dholuo
For DhoLuo, Stafford (1967) lists the meaning of the form nyaka as 'compul
sion/ necessity'. Huntingford (1959), Omondi (1982), Gregerson (1962), and
The Obligation in Western Nilotic 119
Malo (1952) also cite the form nyaka, although they differ in the senses listed.
While Huntingford enters meanings of 'must', 'from' and 'since', Omondi
gives only 'must'. The form nyaka is also currently used in DhoLuo with a
spatial meaning 'up to' and this is a cognate of naka 'up to' in Acholi as in
example (19).
(19) Acholi
Gu-ci-kwalo dyangi naka Patong o
3:PL-go-steal cows up to Patongo
'They went to steal cows as far as Patongo.'
In discussing Lango, not DhoLuo, Driberg (1923) lists nak as the stem
for 'to bear fruit' with nyal or nywalo as 'to be fruitful'. In the related
language, Dhopadhola, nyalo is currently used for the epistemic modality of
Possibility ('might'). While it is beyond the scope of this paper to speculate
the possible relationship between the Dhopadhola form for 'might' and the
DhuLuo form for 'must' and the relationship to a verb meaning 'to be
fruitful', it is conceivable that there is one.
While nyaka is the form generally used in DhoLuo to mark the modality
of obligation, there are alternatives. Malo (1952) gives onego, owinjore and
oromo. Omondi (1982) lists both onego and owinjore with the meaning
'should', while Stafford (1967) lists onego and owinjore as meaning both
'should' and 'ought'. For the Reflexive verb winjore, he gives the meaning 'It
is convenient, fitting'. Thus the lexical source for winjore is similar in
meaning to the source which has developed into the Lango and Acholi
markers of obligation.
Omondi argues that while nego and winjo in DhoLuo are interchange
able, with the meaning 'fits into', there must be a meaning difference since
only winjo can be substituted by romo 'to deserve'. Writing on Alur, Ringe
(1948) lists the meaning of romo as 'fit'. However, in Lango and Acholi,
romo does not have either meaning ('deserve' or 'fit'); it functions as a verb in
these languages with the meaning 'to be enough, sufficient, equal', and it is
also used as a marker of epistemic modality with the meaning 'can'. This is
not a function for romo in the other three languages in the group.
Although there are alternative forms for marking obligation, one consult
ant pointed out that he interprets nyaka as more 'forceful' than nego, and
winjore as the least strong, being used in the sense 'It is sensible, reasonable'
or, 'You the listener should see the sense of what I'm saying'.
120 Edith L. Bavin
In summary, there are some similarities across the languages in the meanings
of the lexical sources for the obligation markers, even though the forms are
different. These similarities support the notion of a conceptual basis for
diachronic changes. For example, omyero, the impersonal of 'to be fitting' is
the source for the obligation marker in Lango and Acholi. For DhoLuo,
Stafford gives the meaning for winjore, a weak obligation marker, as the
impersonal 'It is fitting', although Omondi lists 'fits into'. The lexical mean
ing is retained alongside the obligation meaning in Dholuo, unlike the situa
tion in Lango and Acholi, where the lexical meaning has been lost.
Another form used in DhoLuo as a marker of obligation, romo, is listed
by Ringe (1948) with the meaning of 'fit' for a related language, Alur. This
form has developed into the marker of another modality in Lango and Acholi,
the epistemic modality of Possibility which, in contrast to the agent-oriented
modalities, has scope over the whole proposition.
4.1 Introduction
In (11), the Lango speaker is tired and wants to sit, so her physical condition
motivates the use of myero; the form here is interpreted as Necessity, as it is in
(20). Necessity 'reports the existence of physical conditions compelling an
agent to complete the predicate action' (Bybee et al. 1994). In (20) the
children need to lift up their clothes so they don't get wet; the motivation to
lift them is created by an external factor, the physical conditions.
(20) Ma peya wa-oo i-school omyero wa-ngol kulu,
REL not yet l:PL-arrived LOC-school must l:PL-cross river
ma peya wa-ngolo kulu omyero wa-lung bongo-wa
REL not yet l:PL-cross river must l:PL-lift clothes-1:PL
'Before we reached the school we had to cross a river; before we
crossed the river we had to lift up our clothes.'
Examples (21) and (22), from Acholi, are extracts from a riddle. The
listener is told that a man has some obligation to get his things across the river.
He has to take three things, a lion, a chicken and some seed, but he can only
take one thing at a time because the boat is too small to take them all. If he
leaves the lion with the chicken, it will it eat the chicken, and if he leaves the
chicken with the seeds, it will eat the seeds. Thus the items must be trans
ported in a particular order. Here the physical conditions create the necessity
to take the items in a particular order.
(24) Acholi
In omyero i-cam mot
YOU(SG) must 2:sG-eat slowly
'You should eat slowly.'
(25) Edimacta lacen o-tito ni pic myero gi-gwok
headmaster elder 3:SG-say COMP schoolfees must 3:PL-keep
pien kare me cul pud tye anyim, wa i-nino
because time for school still be ahead, up to LOC day
dwe 27 me Janwari
month 27 for January
'The headmaster said that they must save their money because
school (and school fees) were due on January 27th.' [p. 12]
(26) A-mito ni myero Hi ki woru myero en
l:SG-want COMP must 2:SG-tell to father must he
o-gwok cente pic maber in i-bi-kelo ne i-nino
3:SG-keep money fees properly you 2:SG-FUT-putthisLOC-day
The Obligation in Western Nilotic 123
Sentences with (o)myero often express something about the attitude of the
speaker. In (lb), the Lango speaker wants the listener to convey a message to
someone else; she is really indicating that it would be more convenient for the
other person to travel than her. In the negative sentence (9a), the child's
decision is imposed on the dog; the child has heard a noise and wants to
investigate the source. And in (28), the speaker is letting us know her disap
pointment with the non-runner. She could have added the equivalent of 'So
why didn't you?'
(28) Lango
Myero onwongo i-ngweci
must PI 2:SG-run
'You (SG) should have run.'
Thus (o)myero) can be interpreted in many contexts as the speaker's
attitude to a particular situation or a desire for a particular outcome. It can also
be used to mark the intention of the speaker, as in (12). The intention reading
is most common when an (o)myero clause is embedded in a matrix clause with
the verb tamo 'think' and a First Person Singular subject, as in (29); the
interpretation is that the predication expressed in the subordinate clause is
what the speaker intends to do, not what he must do.
Example (29) is taken from a passage in which an Acholi man is talking
about his plans for the future. In terms of cultural expectations he should own
land. Because he does not have any land, he intends to acquire it. Compari-
124 Edith L. Bavin
sons can be made here with English 'have to'. As discussed in Bybee et al.
(1994), the obligation expression 'have to' may be interpreted as 'intention'.
Here omyero is interpretated as 'intention'.
(29) A-tamo ni omyero a-nong kaka-mo ma-ber
l:SG-think COMP must l:SG-find land-some REL-good
ma-ayela o-bi-bedo peke
REL-trouble 3-SG-FUT-stay not
'I intend to find some good land where there is no trouble (i.e. no war).'
In (30), the speaker's intention is conveyed with an embedded omyero
clause, but here the nominal tarn 'idea' is used in the matrix clause rather than
the verb tamo. The speaker produced the utterance in answer to a question as
to whether he would accompany an acquaintance to a social function.
(30) Onongo a-bedo ki tarn ni omyero a-cit ki jo
PI l:SG-stay with idea that must l:SG-go with people
ma a-bedo kwed-gi i-floor-na-ni
REL l:SG-stay with-3:PL LOC-floor-l:SG-DET
T think I should go with the people from my floor.'
(='I intend to go with the people from my floor, not you')
In Section 2, we discussed the fact that the speakers of Lango and Acholi
may vary the word order of the modality particle and Past Imperfective
marker to mark the speaker's commitment to the proposition. That is, the
order with the Modal Particle following the Aspect marker is used when the
speaker is commenting on the most likely outcome. In English, 'You must
have done it' could be used if the speaker sees evidence that it has been done
already, or if this is the most likely situation (as in 'The room must be empty').
With emphasis on 'must', it could express a sense of disbelief that you did not
do it. In contrast, 'You should have done it' implies you did not, so has a
counter-factual interpretation. In Lango and Acholi, the difference between
expressing what the most likely situation is and what should have been done
(but was not) is marked by the position of the Aspect marker, and listeners use
this variation in ordering to interpret the intended message. Thus linguistic
context affects the interpretation of the form (o)myero.
In (33), only the lexical meaning appears, but one sentence may include
both meanings, aspectual and lexical, as in (34) and (35). Note that nwongo
'find' in both (33) and (35) lacks a Subject Prefix because it follows the
narrative linker te; no verb following te has a Subject Prefix. (O)nwongo is
similar to omyero in that it always precedes the lexical verb. However, there is
a difference: while (o)myero has lost its lexical meaning, (o)nwongo has not.
Myero has lost its prefix for Lango speakers and also for some Acholi
speakers, but onwongo has retained the o prefix in Lango and its presence is
variable in Acholi.
(37) Alur
a. Omito a-tim matek
need 1 : SG-work hard
'I need to work hard'.
b. Omito nango a-cam
need PAST l:SG-eat
'I should have eaten.'
c. A-mito ringo
l:SG-want run
'I want to run.'
d. Omito a-ringo
need l:SG-run
'I need to run.'
The Obligation in Western Nilotic 129
In DhoLuo, dwaro is the lexical verb for 'want', not mito, and dwaro can
also be used to express Necessity, but it is not grammaticalized for this
function, as in Lango and Alur, in which the impersonal form is used. The
context alone determines whether the interpretation is 'want' or 'need' since
the verb carries a Subject Prefix for both interpretations, as illustrated in (a)
and (b); the prefix is determined by the person and number of the subject of
the predication. In (a), the speaker is not reporting what his friend desires; he
is giving his own view that his friend has few possessions and needs to obtain
a few more basic necessities to improve his quality of life. The speaker in (b)
is not reporting what he wants to do; he has to tell a story because I asked him
to. This was part of a session for which the speaker would be paid for his time;
he had indicated that he did not want to tell a story since he couldn't remember
one.
Note that with both senses (want and need) the verb can have either a
nominal or a verbal complement. In (38a), a nominal complement is used,
while (b) contains a verbal complement. In other contexts, the verb in each
sentence could be interpreted as 'want'.
(38) a. O-dwaro gik mangeny
3:SG-need thing many
'He needs many things.'
b. A-dwaro gano-nu sigana moro ka
l:SG-need tell-2:SG story certain here
T need to tell you this story.'
We can compare the use of English 'want' to express what someone
needs to do. For example, in 'You want to go to the next counter', which is a
response that could be made when someone asks for a particular service at the
wrong counter, the interpretation is that you need to go to the next counter.
Another context in which 'want' is used with the interpretation 'need to' is a
school report on which a teacher might write that a student wants to work
harder, meaning that the student needs to. Similarly, when someone says T
want a pen', it would be interpreted as 'I have need of a pen' in a context in
which that person is required to sign a form. It is context which determines
whether there are physical conditions present which lead to the 'need' inter
pretation of 'want', or whether the utterance expresses someone's desire and,
perhaps, intention (see Bybee et al. 1994, chapter 6).
130 Edith L. Bavin
7. Conclusions
speaking to a subordinate, or when social customs are being discussed that the
obligation interpretation is made. In some contexts, the form is interpreted as
a marker of Necessity or of Intention, and it can also have an epistemic
interpretation. Such synchronic variation is indicative of change in progress
with the obligation marker gradually taking on other functions as its meaning
generalizes.
Bybee et al. (1994) discuss four semantic notions in the set of agent-
oriented modality: obligation, necessity, ability and desire. In the Western
Nilotic languages there is a close connection between the notions of obliga
tion, necessity and desire in terms of the forms used to express them. How
ever, the forms used to express ability in the languages are distinct. In Lango
and Acholi, ability is currently expressed with a subject-inflected verb which
takes an Infinitive complement, as illustrated in (13) and 14), unlike the
obligation marker, which is derived from an impersonal construction.
As illustrated above, several paths of development in a language may
result in different forms having interpretations associated with agent-oriented
modality. The lexical verb for 'want' is also used to express necessity or
obligation in three of the languages. While two of the languages use the
impersonal form, DhoLuo uses a verb inflected with Subject Prefix, and it is
the context alone that determines whether the verb is to be interpreted as
desire or necessity.
Language-specific patterns can be detected in the paths of development.
For example, the grammaticalization of particles is more advanced in Lango,
in that both mito and myero lack prefixes, although the former is still a lexical
verb when used with Subject Prefixes while myero is not.
ABBREVIATIONS
NOTES
1 Unless otherwise stated, the data reported in this paper were collected from native
speakers of the languages. Many of the examples are taken from discourse, rather than
from elicited sentences. The data were collected in Uganda in 1992, in Kenya in 1990,
and between 1981 and 1992 from students who have been studying in Australia for short
periods. Speakers have ranged in age from 20-38 years. Funding for the research has
been from an Australian Research Council grant.
2. Note that the Plural forms of the verb in Lango take o as the subject prefix (as does the
third singular) rather than wa '1 PL', wu '2 PL' and gi '3 PL' as used in Acholi. Thus
there is potential ambiguity if there is no overt subject, and Lango speakers often add a
third person plural pronoun plus a definite suffix ('this') following the verb as in the
examples given below. Thus a third option for sentence (8) is given in (c) below:
a. Ka o-poto gin-i i-pii-ni
when 3:PL-fal they-DET LOC-water-DET
'When they fell into the water.'
b. O-tye o-neno gin-i i-wang dirija
3:PL-be 3 PL-see they-DET LOC-eye window
'They were looking out of the window.'
c. Onwongo myero o-bol gin-i odilo
PI must 3:SG-throw they-DET ball
'They would have thrown the ball.'
The prefix o has generally replaced e as the form for third singular in Acholi and Lango
for past contexts (currently e is optionally used in non-switch-reference contexts only).
Non-past has a zero prefix in Acholi, but a is used in Lango. In past contexts, the form
gu is used for Third Plural in Acholi. This is derived from gi+o. For some Acholi
speakers, when the overt third plural pronoun (gin) is used, there is the option of using
the prefix o on the verb instead of gu.
3. Use of the Subjunctive form depends on the syntactic and semantic context. For a
number of lexical items the Subjunctive is used for resulting states (such as 'is stuck', 'is
dry' and 'is broken'), and for others, the short form is used as the intransitive. It can also
function as a nominal, so it is ambiguous as to whether amito cam is T want to eat' or I
want food'. For most verbs the long form is used for both transitive and intransitive
sentences. The final o is most likely an old marker of transitivity since it can be
analyzed, diachronically, as an object marker. Note that it takes the pronominal object
position. Synchronically, however, its function is a verbal marker, and is not restricted to
transitivity.
4. The anterior aspect marker is onwongo in Lango but onongo or nongo in Acholi. Past
tense is marked by three low tones on the verb in both languages.
5. In Dhopadhola nwango is used with past reference, and in Alur, nwango 'find' contrasts
with nango which has past reference. Compare (a) and (b) from Alur:
a. Koth o-daro bino
rain 3:SG-finish come
'The rain has come.'
The Obligation in Western Nilotic 133
REFERENCES
Bavin, E.L. 1982. "Aspects of Morphological and Syntactic Divergence in Lango and
Acholi". Studies in African Linguistics 13 (3).231-248
Bavin, E.L. 1990. "The ki-V Construction: Development of a Resultative". La Trobe
University Working Papers 2.29-42
Bybee, J. 1985. Morphology: A Study of the Relation between Meaning and Form.
Amsterdam & Philadelphia: John Benjamins
Bybee, J. & Pagliuca, W. 1985. "Cross-Linguistic Comparison and the Development of
Grammatical Meaning". Historical Semantics, Historical Word Formation ed. by J.
Fisiak. Berlin: Mouton.
Bybee, J., R. Perkins & W. Pagliuca 1994. The Evolution of Grammar: Tense, Aspect &
Modality, Ch. 6. Chicago: University of Chicago Press.
Coates, J. 1983. The Semantics of Modal Auxiliaries. London: Croom Helm
Crazzolara, J.P. 1955, [c 1938]. Acholi Grammar. Oxford: Oxford University Press.
Driberg, J.H. 1923. The Lango, a Nilotic Tribe of Uganda. London: J. Fischer Unwin.
Fleischman, S. 1989. "Temporal Distance: A Basic Linguistic Metaphor". Studies in
Language 13.1-50.
Gregerson, E.A. 1962. "Luo: a Grammar". Yale University PhD dissertation.
Huntingford, G.W.B.1959. Elementary Lessons in Dho-Luo. London: School of Oriental
and African Studies.
Kitching, Rev. A.L. 1907. An Outline Grammar of the Gang Language. London: Society
for the Promotion of Christian Knowledge.
Ladefoged, P., R. Grick & C. Criper 1968. Language in Uganda, Oxford: Oxford Univer
sity Press.
Malo, S. 1952. Dholuo Without Tears. Kisumu: Anyange Press.
Omondi, L. Ndong'a. 1982. The Major Syntactic Structures of DhoLuo. Berlin: Dietrich
Reimer Verlag.
Ringe, P.C. 1948, A Simple Alur Grammar. Eagle Language Study Series.
Stafford, R.L. 1967. An Elementary Luo Grammar. Nairobi: Oxford University Press.
Traugott, E. 1989. "On the Rise of Epistemic Meaning in English: An Example of
Subjectification in Semantic Change". Language 65.31-55.
The Gestural Expression of Modality in ASL
... The essence of language is human activity — activity on the part of one
individual to make himself understood by another, and activity on the part of
that other to understand what was in the mind of the first...
O. Jespersen, The Philosophy of Grammar (1924: 17)
For the most part, modern linguistic descriptions of American Sign Language
(ASL) have relied on formalist linguistic theories. The predominant effort has
been to demonstrate that ASL is indeed a language and can be described using
theories developed for spoken languages. Further, it is typically claimed that
the findings of ASL linguistic research lend support to these theories of
language.
These theories incorporate several powerful assumptions about the goals
and expectations of linguistic investigation (cf. Langacker 1991: 507-514).
One assumption is that grammar is independent of meaning. From this as
sumption follows what Givón (1989: 94-95) calls three "pre-empirical postu
lates":
(1) Language is a separate module of the mind/brain, not part of 'general
cognition';
(2) Structuralism in the analysis of language; that is, language structure
can be analyzed independently of its communicative function;
(3) The sign-relation between the linguistic code and its mental designa-
tum is arbitrary, unlike the obvious iconicity seen in pre-human com
munication.
136 Sherman Wilcox and Phyllis Wilcox
Another assumption is that the mental is independent from the physical. This
leads to the view that linguistic units are mental things that cannot be identi
fied with a set of articulatory, physical characteristics (Fowler 1986: 9). This
view is stated by Givón (1989: 95) as another pre-empirical postulate:
(4) Some abstract, idealized entity — be it langue or competence — is the
'object'of linguistic analysis.
2. Modality in ASL2
The expression of mood and modality has not been extensively explored in
ASL. Long (1918) in his chapter on auxiliary verbs briefly describes the
production and semantics of the following modals: CAN; CANT; MAY (MAYBE,
PERHAPS); MAY; SHOULD (OUGHT); MUST (NEED, HAVE TO); HAVE (FINISHED); WILL
(SHALL); WON'T.3
Fischer and Gough (1978) briefly mention modals in their discussion of
ASL verbs, but do not deal with them semantically. Padden (1988) also briefly
mentions the modals CAN, WILL, SHOULD, and MUST, but likewise does not
discuss in any depth the semantics of these words, focussing instead on what
types of nominais may precede modals. Finally, the semantics of modals is
typically dealt with in only a cursory way in most ASL textbooks. Questions of
strength, of which modals can express deontic and which can express epis-
temic modality, and other subtle semantic and pragmatic factors are left to
students to discover on their own.
Strong obligation in ASL is expressed with the lexical modal MUST (Figure
la). Weak obligation is expressed with the lexical modal SHOULD (Figure 2b).
Long (1918) describes the meanings and production of several ASL words.
This text is a rich source of etymological and historical information about
ASL (cf. Frishberg, 1975). We will cite Long throughout this paper, primarily
as evidence for phonological change.
Long (1918: 26) offers the following descriptions of SHOULD and MUST:
Should, Ought, indicating duty.— Press the crooked forefinger of the right
"G" hand against the lips and then move toward side and downward as in
"must" (see below). Or,
(2) Indicate by the signs "better" and "must." Better is signed as follows:
Place the end of the right open hand pointing toward the left, palm against
the mouth; draw away toward the side, assuming the position of "A" and
lifting it to a level with the head. "Must," see below.
Must, Need, Have to. — Crook the forefinger of the right "G" hand,
pointing it downward, and press the hand down some distance with more or
less force. Sometimes the motion is repeated several times.
The forms described by Long for SHOULD are no longer used. While MUST
is essentially unchanged from this earlier form, SHOULD is now made with
reduplicated flexion and extension of the wrist. This phonological relation
ship (single, punctual movement versus reduplicated, cyclic movement) will
reappear in other modals and will be explored in more detail in section 3.
According to Boyes-Braem (1981), the forms MUST and SHOULD are
related to the Old French Sign Language 4 form of NECESSARY (IL FAUT), in
which a G hand (extended index finger) pointed firmly down toward the
ground in front of the signer. This could also mean ICI (RIGHT-HERE) or THIS-
GROUND. Boyes-Braem argues that the more abstract concept 'necessary' is
assigned an abstract deictic handshape (the X, or bent index finger) in ASL.
Modals can occur in three positions: at the end of the verb phrase 5 as in
(1), preceding the verb as in (2), or preceding and at the end of the verb phrase
as in (3) (Humphries, Padden & O'Rourke, 1980).
t
(1) TELEPHONE NUMBER, WOMAN SHE-GIVE-ME SHOULD SHE
T h e woman should give me the telephone number.'
t
(2) TELEPHONE NUMBER, WOMAN SHOULD SHE-GIVE-ME SHE
T h e woman should give me the telephone number.'
(3) I MUST WIN RACE MUST I
'I must win the race.'
Necessity is indicated in ASL with the use of the word MUST. 6
The Gestural Expression of Modality in ASL 141
Figure 2: CAN
Figure 4: CAN'T
Epistemic possibility can be indicated in ASL with the lexical items POSSIBLE
(Figure 5a) and MAYBE (Figure 5b). Long (1918) related the word MAYBE
('may,' 'maybe,' or 'perhaps') to the physical act of comparing weights by
using the hands as a balance scale:
The Gestural Expression of Modality in ASL 143
May, Maybe, Perhaps, indicating probability.— Hold out both open hands
in front straight from the sides, palms up; balance the hands up and down
alternately several times like the balances of a pair of scales.
Figure 6: IMPOSSEBLE
In addition, there is the single lexical item IMPOSSIBLE (Figure 6). IMPOSSI
BLE is a stronger negation than NOT POSSIBLE; it indicates that the proposition is
not only not very likely but absolutely impossible.
There is also a variant of CAN'T (Figure 4) that means 'impossible'. It is
signed with two small, repeated movements, IMPOSSIBLE (Figure 6) is much
more commonly seen.
Epistemic modalities in ASL are expressed with lexical items such as
7
FEEL, SEEM and OBVIOUS (Figure 7) , in other constructions such as questions,
and with particular NMSs. For example, in discussing a missing item in one's
apartment, the dialogue in example (12) could occur (from Cokely & Baker
1980).
gaze If q
(12) FEEL SOMEONE 1 -CL-rt 'person walks by' NOTICE-TO GO-INTO SWIPE YOU 8
'Do you suppose someone walked by, noticed it, and just went in
and stole it?'
n nodding
NOT-KNOW SEEM+
'I don't know, apparently that's what happened.'
FEEL in the epistemic sense is related to the word FEEL used for the
physical sense. Typically, the latter is produced with a single upward move
ment, while the former uses shorter movements, often reduplicated, and a
distinct set of NMSs (this relationship is discussed in more detail in section 3).
SEEM is historically related to the words MIRROR and COMPARE, words
related to physical resemblance. In its old form SEEM retained some of its
"traditional" or lexical iconicity: "Bring the open right hand up in front
toward one side with the thumb edge toward self; turn the hand so as to
The Gestural Expression of Modality in ASL 145
present the palm toward self and fix the eyes upon if9 (Long, 1918: 63, our
italics). Compare this description of SEEM to Higgins' (1923: 92) almost
identical description of MIRROR: "vertical palm held at side of face and is
gazed into as if into a mirror." As we will see below, Frishberg (1975) claims
that this type of iconicity is eroding.
OBVIOUS is related to the ASL word BRIGHT ('bright' or 'light'). Com
pared to BRIGHT it is made with less stressed, reduced movement. In its
epistemic sense it appears in constructions such as:
(13) HE RICH, OBVIOUS
It is worth noting that all of these modals are related to ASL words which
denote physical activity and perception:
(i) physical strength (CAN)
(ii) judging the physical weight of an object (MAYBE)
(iii) the physical sense of feeling (FEEL)
(iv) physical resemblance, as reflected in a mirror (SEEM)
(v) the bright light needed for clear visual perception (OBVIOUS)
This relationship has been noted by others (Sweetser 1990), and clearly
relies on a metaphorical mapping between embodied, physically-grounded
source domains and mental, abstract target domains. The widespread use of
conceptual and experiential metaphor in ASL grammar is explored more fully
in P. Wilcox (1993).
Finally, we should point out that the deontic modals MUST and SHOULD are
not used epistemically in ASL. Thus, the sentence:
146 Sherman Wilcox and Phyllis Wilcox
cannot mean 'You must be tired' in the epistemic sense. Likewise, the
sentence, "We should be finished by 5 PM" in ASL would not use the lexical
item SHOULD.
ASL epistemic sentences may also be produced without any of the lexical
items discussed above. One way in which this may be accomplished is with
tag questions:
_q
(15) YOU WORK HARD, YOU TIRED YOU, RIGHT?
wh+y
RAIN
'Surely it's raining.'
NMSs (head nodding, eyes squinting, eyebrows squeezed together)
The NMSs that accompany (16) and (17) are quite similar to those that
accompany imperative sentences. For example, consider example (18) in
Figure 8. The imperative NMSs that accompany the verb GIVE include a sharp
head nod, a slight leaning forward of the torso, squinting the eyes, and
squeezing the eyebrow together (Humphries, Padden, & O'Rourke 1980). We
t
(18) TICKET, YOU-GIVE-ME!
'Give me the ticket!'
Figure 8: NMS's in imperatives
t
(19) TELEPHONE NUMBER, WOMAN SHE-GIVE-ME, SHOULD SHE.
The woman should give me the telephone number.'
Figure 9: NMS's in deontic modality
148 Sherman Wilcox and Phyllis Wilcox
whq
(20) YOUR CAR, WHERE?
'Where is your car?'
Figure 10: NMS's in WH-questions
also see similar NMSs in the expression of deontic modalities such as example
(19) in Figure 9. Clearly related are the NMSs that accompany WH-questions
(see example (20) in Figure 10).
Thus, we find quite similar NMSs used in the expression of the following
functions in ASL:
(A) imperative; obligation; request for information; epistemic prob
ability
In the first two (imperative and obligation), the NMSs must accompany
lexical material (in imperatives the focus of the NMSs is on the imperative
verb; in deontic modals the entire phrase). In WH-questions, the NMSs may
accompany a phrase containing a WH-word, as in (20), or a generalized
question word usually glossed as "WHAT" (it can mean 'what', 'where', 'who',
'huh' etc.) Given the proper context, NMSs alone (i.e. without a wh-word)
can signal a WH-question (21). As we have seen above, in the expression of
epistemic probability, these NMSs may be used with modals or alone.
(21) A: YESTERDAY ME BUY TICKET ME.
'Yesterday I bought tickets.'
B: TICKET?
'What tickets ... for what?'
We suggest that the relationships described in (A) are an expression of
interpersonal transitivity. This in turn is related to the grammatical expression
The Gestural Expression of Modality in ASL 149
Figure 11:WILL
of certainty, power, and politeness (Givón 1989; Brown & Levinson 1978).
Parallel to (A), the scale of interpersonal transitivity runs from (i) to (iv)
below:10
(i) I command you to do something: "Sit down!"
(ii) I obligate you to do something: "You must sit down."
(iii) I make a polite request (this indirect speech act is accomplished by
means of a request for information in form): "Why don't you sit
down?"
(iv) I infer something about you: "You must be tired."
The final modal we will discuss is WILL. Long (1918: 27) attributes only a
future sense to WILL:
Will and Shall, indicating future.— Hold the right open hand pointing
straight out and elevated to a level with the should, and push it straight
forward the length of the arm.
WILL can also be used in intentional (22), predictive (23), and volitional
senses11 (24):
(22) SEND-YOU LETTER, WILL ME.
'I will send you the letter.'
(23) HE RECOVER WILL HE.
'He will get well.'
(24) HE ACCEPT RESPONSIBILITY WILL HE.
'He will accept responsibility.'
150 Sherman Wilcox and Phyllis Wilcox
time line is a line along the horizontal plane near the signer's ear and cheek
used consistently for many signs indicating time other than the present, WILL
and TOMORROW move forward along the time line (see Figure 11); YESTERDAY,
BEFORE, LONG-AGO are examples of signs that move toward the signer's
back.13 This is not to say that WONT has acquired a temporal sense. It has
resulted, however, in WILL and WONT taking part in a nicely iconic relation
ship: volitional WILL moves forward (as does future WILL) while WONT jerks
backward.
Finally, it should be noted that the gestural component of old WONT and
modern WONT are the same, even though the change in location has resulted in
a change from upward movement to backward movement: both are produced
with forearm flexion (bicep contraction).14
3. Iconicity in ASL
Judging from the literature on ASL, the existence of iconicity in this language
has been viewed as something of an embarrassment to ASL linguists. This is
not hard to understand, given that from Saussure to Chomsky, formal linguis
tics has maintained that to the extent that a communication system exhibits
iconicity, it is less a human language. More recently, Givón (1984, 1989) and
Haiman (1984) argue against this position for spoken languages, suggesting
instead that linguistic form is often motivated by iconic principles. For exam
ple, Haiman (1984: 1) writes, "linguistic forms are frequently the way they are
because, like diagrams, they resemble the conceptual structures they are used
to convey."
It is difficult to deny the existence of iconicity in signed languages. Few
sign language linguists would, however, ascribe the deeply significant role to
iconicity that functional and cognitive linguists have in spoken languages.
Rather, many ASL linguists have taken a different approach, the intent of
which appears to be to explain away iconicity in ASL. They claim that
although iconicity may be present in ASL, it is eroding over time; it plays no
role in the acquisition, processing, or storage of language; and grammatical
processes routinely act to override and submerge it.
Frishberg (1975) argues that historical change acts to diminish iconicity
and heighten the arbitrary nature of ASL. We would point out, however, that
Frishberg's study deals primarily with iconicity at the lexical level. Meier
152 Sherman Wilcox and Phyllis Wilcox
(1981; 1991) and Klima & Bellugi (1979) argue that iconicity plays an
insignificant role in the acquisition of ASL by deaf children. In Meier's study,
two models of iconicity are proposed. One assumes that deaf children are
attuned to verbs that happen to be enactments, or mimes, of an action, the
other that deaf children are attuned to verbs that map the spatial relations of
the actors. It turns out that deaf children follow neither model; deaf children in
Meier's study showed no tendency to use iconic verb forms earlier than
arbitrary verb forms.
It is interesting to note that Meier's second iconic model was tested using
the so-called directional, or person-agreement forms. In ASL, person-agree
ment in these verb forms is indicated by incorporating spatial loci (which have
typically been assigned to a previously mentioned nominal) into the verb.
Other verb inflections in ASL, such as aspect, involve changes in the tempo
ral/kinematic structure of the verb's movement. Thus, even if we accept
Meier's conclusions about the influence of iconicity vs. arbitrariness in lan
guage acquisition, it is worth noting that these verb forms do, nevertheless,
incorporate iconicity: "Prototypically and phylogenetically/ontologically,
topic/participant deixis tends to be spatial, while propositional deixis tends to
be temporal" (Givón, 1984:270). In ASL, this type of deixis is coded by
indexing spatial locations. Coding of information about the temporal structure
of a verb's meaning, however, is accomplished by modifying the verb's
movement (which necessarily involves the sign's temporal structure). There
is no evidence that ASL is becoming more arbitrary in this regard — that is,
ASL is not beginning to code topic/participant deixis in the kinematics of the
verb's movement and aspectual information as spatial locations.
Klima & Bellugi (1979) further claim that iconicity is not relevant in
encoding and remembering processes. Such a claim is at odds with Givón's
(1989: 97) iconic imperative:
All other things being equal, a coded experience is easier to store, retrieve
and communicate if the code is maximally isomorphic to the experience.
Finally, Klima & Bellugi (1979) argue that the grammatical processes of
ASL function to suppress whatever iconicity may exist: "One of the most
striking effects of regular morphological operations on signs is the distortion
of their forms so that iconic aspects of the signs are overridden and sub
merged" (1979: 30). As an example, they describe the morphological process
which may be added to a signed word to indicate intensity. This process
consists of the addition of an initial hold on the sign followed by a sudden
The Gestural Expression of Modality in ASL 153
(a)IDEA (b)SUPPOSE
Figure 13: IDEA and SUPPOSE
release. 15 When applied to the sign SLOW, the resulting sign means 'very
slow.' Klima and Bellugi's point is that the sign VERY-SLOW is made with a
very fast movement: "Thus the form of 'very slow' is incongruent with the
meaning of the basic sign" (30). The grammar of ASL seems to have sub
merged the iconicity of VERY-SLOW.
But if this is true, it is only because iconicity has re-emerged in the new
form, VERY-SLOW has not overridden the iconicity, it has merely shifted it to
the morpheme meaning 'very.' 'Very' can be understood metaphorically in
terms of the build up of pressure (intensity). As pressure increases so does
intensity, until it is suddenly released. In the ASL morpheme 'very' this build
up of intensity is iconically signalled by a build up muscular or gestural
energy. Finally, the metaphoric pressure is released — as is the gestural energy
in the sudden release into a short, rapid movement. It is an iconic process, and
one which, we would suggest, is not unlike a similar process at work in spoken
language when, for example, teenagers say, "That's bad." Typically, 'bad'
will be pronounced with an extended hold on the bilabial, followed by a
sudden release. Of course, the meaning is the same as the ASL example —
'very' (how 'very bad' came to mean 'very good' is another story).
SHOULD is unstressed and reduplicated. The same holds true for CAN and
POSSIBLE.
For FEEL, the relevant feature for the present discussion is reduplication.
When FEEL is produced using a single, upward movement it refers to the
physical sense of feeling. When it is produced with multiple, reduplicated
movements (and the appropriate NMSs discussed above) it acquires the
epistemic meaning described above.
The same phonological relationships are seen between the forms for
BRIGHT/OBVIOUS described above, and for IDEA and SUPPOSE (Figure 13).
SUPPOSE can be used to express hypotheticals ("Suppose you and I drive to
California?") and to mark conditionals:17
(28) SUPPOSE TOMORROW SNOW, I GO-AWAY SKIING.
'If it snows tomorrow, I'll go skiing.'
In order to describe how these forms are iconic, we must return to
Stokoe's semantic phonology and propose the notion of phonological transi
tivity. By phonological transitivity, we mean simply that ASL signs in their
production can be described as phonologically transitive (an agent, a hand,
performs a punctual action on and significantly changes a passive patient, the
other hand) or phonologically intransitive (a single patient acts, non-punctu-
ally). We are intentionally using semantic terms for phonological, or forma-
tional, characteristics of a sign, to bring out the close relation between
meaning and form — the iconic relation.
To further elaborate the concept of phonological transitivity we can
invoke several of the parameters proposed by Hopper and Thompson (1980:
252) for understanding the notion of semantic transitivity. The parameters
listed in Table 1 seem especially pertinent to phonological transitivity.
The Gestural Expression of Modality in ASL 155
Many of the modals discussed in this paper have only one participant
(even when two hands are forming the words, it is not the case that one is
acting on the other). Reduplication affects a sign's formational characteristics
on the kinesis, aspect, and punctuality parameters: reduplicated signs are less
active, nontelic, and have no obvious transitional phase between inception
and completion (i.e., they are cyclic as opposed to punctual). 18 Finally, note
that phonological transitivity is an expression of phonetic or gestural sub
stance (this will be elaborated further below).
The iconicity principle at work in these modals can now be summarized
as follows:
(A) Senses which are more physically or interpersonally grounded, or
semantically less abstract, are expressed with more gestural sub
stance; the more epistemic or semantically abstract senses are
expressed with reduced gestural substance.
Ferreira Brito (1990: 255) comments on the same relationship between
deontic and epistemic modals in Brazilian Cities Sign Language (BCSL).
Deontic modals are expressed with "simple and energetic movements" which,
she suggests, is related to an underlying metaphor of obligation for something
to be done as actions or events in the real world. Epistemics are distinguished
by nonenergetic movement of the hands.
SEEM is more complex. Consider the following: In a discussion of which
of two possible translations would be more appropriate for a particular ASL
utterance, the following conversation took place:
(29) Speaker A: 'Which do you think she said, this [indexi] or that
[indexj]?'
Speaker B: SEEM++ INDEXI
'She probably said this ...'
(The two study the videotape of the utterance under discussion
some more.)
Speaker B: SEEM INDEXJ!
'Yes, this has to be what she said.'
In example (29), the first occurrence of SEEM is slowly reduplicated; the
second is signed once with emphatic stress. The first expresses weak epis
temic probability, the second stronger epistemic certainty. The iconicity
principle in operation here seems to be:
156 Sherman Wilcox and Phyllis Wilcox
4. Conclusion
In turn, ASL research has done its part in furthering this symbiosis. If the
formalist approach merely assumes that it is possible to view some abstract,
idealized entity as the true object of linguistic analysis for spoken languages,
signed languages seem positively to demand this stance. How else, the reason
ing would suggest, could a unified linguistic theory be developed across such
radically different substances as light and sound? Having made such an
assumption, the conclusion from ASL research that language emerges in a
linguistically driven manner (Poizner, Klima and Bellugi 1987: 23) is taken as
support, and the symbiotic cycle is completed.
Because of these conditions, and because ASL is an unwritten language
which was not seriously described and studied until the late 1950s (Stokoe
1960), there have been few studies of grammatization in this language. We
believe we have shown here, however, that when such studies are attempted,
they demonstrate that the same processes at work in other languages
(Fleischman 1982; Heine et al. 1991; Sweetser 1990) — semantic extension
from concrete to abstract senses, reemergence of iconicity between cognitive
and linguistic patterning, the use of physical world vocabulary to talk about
the mental world — are operating in ASL.
NOTES
1 This view is more fully described in Armstrong, Stokoe and Wilcox (in press).
2 We want to gratefully acknowledge the assistance of our consultants, whose expert
intuitions were invaluable: Dennis Cokely, Antoinette Eidson, Betsy McDonald, Jenny
Singleton, and Anna Witter-Merithew. Figures are reproduced from Baker and Cokely
(1980) and Humphries et al. (1980).
3 ASL words will be glossed in English with small capital letters. The plus symbol (+) is
used to indicate reduplication.
4 It is well-attested that ASL is historically related to French Sign Language.
5 Humphries, Padden, & O'Rourke (1980) state that the modal occurs at the end of the
sentence, but their example #1 (p. 98, given above as example (1)) suggests that our
characterization is more accurate. These examples and many others in this paper are
from the exercises in Humphries et al.
6 Actually, there are four related words: MUST, NEED, HAVE-TO, and SHOULD, distinguished
by changes in stress and repetition. They are commonly grouped by repetition (cf.
Humphries et al.): MUST/HAVE-TO (+stress, -repetition/-stress, -repetition) and NEED/
SHOULD (+stress, +repetition/-stress,+repetition). See Section 3 for further discussion.
7 Figure 7c is actually the word BRIGHT; OBVIOUS is made with less stressed, reduced
movement
The Gestural Expression of Modality in ASL 159
8 In example (12), FEEL SOMEONE ... is signed with the dominant hand, YOU is signed
simultaneously with the non-dominant hand.
9 Of course it must be noted that this does not signal a wh-question.Still, it is interesting
that Givón (1989: 156) notes that "the use of WH forms to code epistemic uncertainty is
wide-spread."
10 The use of these expressions to convey the same speech act is not universal; thus, ASL
cannot use the request of information (#3) to indirectly convey a command.
11 Example (24) can have both predictive and volitional readings
12 There is another word using the " S " handshape which includes the notion of denial or
negation: REBEL (Long (1918: 46) glosses this as DISOBEY).
13 Although TOMORROW and YESTERDAY also use the open-A handshape, they are not related
to the negative signs WONT, NOT, and DENY. Rather, this handshape is a remnant of
French Sign Language (FSL) forms. In FSL, counting starts with the extended thumb as
' 1 ' . In ASL, ' 1 ' is represented with the extended index finger. Thus, TOMORROW and
YESTERDAY appear to be complex forms representing 'one day in the future' and 'one day
in the past' respectively. This also reflects the different ways Americans and Europeans
count on their fingers.
14 Mowrey & Pagliuca (1988) discuss the same predicament for autosegmental versus
gestural phonologies in terms of the features [coronal] and [anterior]; Bybee (1992) for
'plural' and [coronal] in her discussion of formal/structural versus functional/substan
tive phonology.
15 Klima & Bellugi (1979:30) describe only the change in movement: "an extremely short,
rapid movement." Also note that by characterizing the process in terms of addition of an
initial hold, we are not implying an analysis based on the Movement-Hold phonology of
Liddell and Johnson (1989).
16 Wilbur & Schick (1987) describe several features associated with stress, including non-
manual behaviors, sharp sign boundaries, higher placement in the signing space, faster
movement, and repetition.
17 Conditionals need not be marked with SUPPOSE; a NMS (raised eyebrows) is enough to
signal the protasis. It is interesting that this NMS is also used to mark topics (cf. Haiman
1978).
18 It may seem that reduplicated signs are more, not less active. Note, however, that as a
sign such as MUST is repeated, especially if it is also unstressed, the result is to change it
from a punctual event to a cyclic state.
19 Although we leave the question unexplored at present, we believe that this is an instance
of the iconic expression of force-dynamics in ASL modals (cf. Talmy 1985).
20 Bybee, Perkins, & Pagliuca (1994) note that a common source for obligation are forms
meaning 'owe'. The relationship between the forms in Figure 14 would suggest that they
too are historically related. We have no evidence that they are (and indeed Boyes-Braem
suggests a different lexical source for MUST, as we noted above). Note also that OWE is
used almost exclusively with a financial connotation.
160 Sherman Wilcox and Phyllis Wilcox
REFERENCES
Armstrong, D.F., W.C. Stokoe & S.E. Wilcox. 1995. Gesture and the Nature of Language.
Cambridge: Cambridge University Press.
Browman, C. & L. Goldstein. 1990. 'Tiers in Articulatory Phonology, with Some Impli
cations for Casual Speech." In J. Kingston & M.E. Beckman (Eds.), Papers in
Laboratory Phonology I: Between the Grammar and Physics of Speech. Cambridge:
Cambridge University Press.
Brown, P. & S. Levinson. 1978. "Universals in language usage: Politeness Phenomena."
In E.N. Goody (Ed.), Questions and Politeness: Strategies in Social Interaction.
Cambridge: Cambridge University Press.
Bybee, J.L. 1985. Morphology: A Study of the Relation Between Meaning and Form.
Amsterdam: John Benjamins.
Bybee, J. 1994. "A View of Phonology from a Cognitive and Functional Perspective."
Cognitive Linguistics, 5-4, 285-305.
Bybee, J., Perkins, W.D. & W. Pagliuca. 1994. The Evolution of Grammar: Tense, Aspect
and Modality in the Languages of the World. Chicago: University of Chicago Press.
Boyes-Braem, P. 1981. Features of the Handshape in American Sign Language. Unpub
lished doctoral dissertation, University of California, Berkeley.
Cokely, D. & C. Baker. 1980. American Sign Language: A Student Text (units 1-9). Silver
Spring, MD: TJ Publishers.
Deane, P.D. 1991. "Syntax and the Brain: Neurological Evidence for the Spatialization of
form Hypothesis." Cognitive Linguistics, 2(4), 361-367.
Deane, P.D. 1993. Grammar in Mind and Brain: Explorations in Cognitive Syntax.
Berlin/New York: Mouton de Gruyter.
Edelman, G.M. 1987. Neural Darwinism: The Theory of Neuronal Group Selection. NY:
Basic Books.
Edelman, G.M. 1989. The Remembered Present: A Biological Theory of Consciousness.
NY: Basic Books.
Ferreira Brito, L. 1990. "Epistemic, Alethic, and Deontic Modalities in a Brazilian Sign
Language." In S.D. Fischer & P. Siple (Eds.), Theoretical Issues in Sign Language
research. Chicago: University of Chicago Press.
Fischer, S.D. & B. Gough. 1978. Verbs in American Sign Language. Sign Language
Studies, 18, 17-48.
Fleischman, S. 1982. The Future in Thought and Language: Diachronic Evidence from
Romance. Cambridge: Cambridge University Press.
Fowler, C.A. 1986. "An Event Approach to the Study of Speech Perception from a Direct-
realist Perspective." Journal of Phonetics, 14, 3-28.
Frishberg, N. 1975. "Arbitrariness and Iconicity: Historical Change in American Sign
Language." Language, 51, 676-710.
Frishberg, N. & B. Gough. 1973. Morphology in American Sign Language. Manuscript,
The Salk Institute for Biological Studies, La Jolla, Calif.
Givón, T. 1984. Syntax: A Functional-Typological Introduction Volume I. Amsterdam:
John Benjamins.
Givón, T. 1989. Mind, Code and Context: Essays in Pragmatics. Hillsdale, NJ: Lawrence
Erlbaum.
The Gestural Expression of Modality in ASL 161
Stokoe, W.C. 1991. "Semantic Phonology." Sign Language Studies, 71, 107-114.
Studdert-Kennedy, M. 1986. "Two Cheers for Direct Realism." Journal of Phonetics, 14,
99-104.
Sweetser, E. 1990. From Etymology to Pragmatics: Metaphorical and Cultural Aspects of
Semantic Structure. Cambridge: Cambridge University Press.
Talmy, L. 1985. "Force Dynamics in Language and Thought." In Papers from the Parases-
sion on Causitives and Agentivity, 1, 293-337. Chicago: Chicago Linguistic Society.
Wilbur, R.B. & B.S. Schick. 1987. "The Effects of Linguistic Stress on ASL Signs."
Language and Speech, 30(4), 301-323.
Wilcox, P. 1993. Metaphorical Mapping in American Sign Language. Unpublished dis
sertation, University of New Mexico.
Wilcox, S. 1990. "The Structure of Signed and Spoken Languages." Sign Language
Studies, 67, 141-151.
II
The Interaction Basis of Modality
The Development of Epistemic
Sentence-ending Modal
Forms and Functions in Korean Children
Soonja Choi
San Diego State University
Modality is the linguistic domain that covers the speaker's attitudes and
opinions about a proposition (Palmer 1986), and it can be divided into at least
two semantic types: agent-oriented modality and epistemic modality (Bybee
1985). Agent-oriented modality, which includes deontic modality, expresses
various conditions on the agent, such as ability and obligation, with regard to
a proposition. For example, in the sentence 'John can (or must) lift this stone',
the auxiliary verbs, can and must, express the ability and obligation respec
tively of the agent toward the predicate. Agent-oriented modality also in
cludes desire and intention of the agent as well as permission. On the other
hand, the central function of epistemic modality is to express the degree of the
speaker's commitment to the truth of a proposition. For example, in 'John
should (or may) be home by 5 o'clock', the speaker expresses certainty (or
possibility) about the truth of a proposition. Palmer (1986) and others (Bybee
1985, Chafe & Nichols 1986) include evidentiality as a type of epistemic
modality, one which specifies the source of information, e.g., hearsay, direct
experience, inference. By specifying how the information is obtained, the
speaker conveys the kind of warrant he/she has about the proposition, and
therefore, the degree to which the proposition is likely to be true. Thus,
broadly defined, epistemic modality refers to 'the status of the speaker's
understanding or knowledge' of a proposition (Palmer 1986:51).
The semantic distinction between agent-oriented and epistemic modality
often parallels a morphological distinction. In an extensive cross-linguistic
study of morphology, Bybee (1985) calls attention to a strong tendency for
epistemic modality to be expressed inflectionally in bound forms (i.e. morpho-
phonologically fused with the verb stem), whereas agent-oriented modality is
166 Soonja Choi
to be acquired by Korean children. Both Kim (1992) and Choi (1991) have
shown that productive uses of SE suffixes begin at around 1;9, i.e., before 2
years.
In an earlier study (Choi 1991), I analyzed the semantic functions of the
first three suffixes, -ta, -e, and -ci, acquired by three Korean children. The
data showed that, between 1;8 and 2;2, the children acquired the suffixes in a
consistent order, and with clear semantic distinctions that relate to old and
newly-acquired knowledge (expressed by -e and -ta respectively) and to
certainty of information (-ci). As a first step toward understanding such an
early acquisition of modality, I examined the frequency of the suffixes in
caregivers' speech to the children. The result showed that the order of acquisi
tion correlated only partially with the input frequency. The study, therefore,
suggested that the acquisition of the modal suffixes in Korean is the result of
an interaction between the child's cognitive development and the language-
specific input.
The present study investigates what roles caregiver-child interactions
and the child's cognitive capacity play in the acquisition of Korean modal
suffixes. The study analyzes discourse interactions between the caregiver and
child, paying attention to both linguistic and extralinguistic contexts in which
the child produces a given suffix. The analysis shows that a given suffix is
consistently used by all three children in specific types of discourse-pragmatic
context. The present study attempts to characterize these types in a systematic
way. This discourse approach to data not only enables us to discover the
modal meanings acquired by the children, but also gives us insight into how
caregiver-child interactions contribute to the child's development of modal
meanings. In fact, it will be argued that the discourse-interactional component
embedded in the modal meaning enhances acquisition, and that this explains
the difference between the early acquisition of modal suffixes in Korean and
the relatively late acquisition of epistemic modality reported for other lan
guages. To investigate the cognitive aspects of the acquisition, the kinds of
verbs the children use with each suffix will be analyzed, as children's lexical
development often gives us insight into their cognitive development (Gopnik
& Meltzoff 1984; Tomasello 1992). The study will show that both the child's
cognitive understanding of the world and the discourse-interactional aspects
of contexts play an important role in children's early acquisition of modal
suffixes. The analysis of the present paper includes five modal suffixes
acquired between 1;8 and 3;0 by the three children.
168 Soonja Choi
tended to the expression of old and new knowledge: -dt is used for informa
tion that is expected from some background knowledge (i.e. old knowledge),
but -mts is used for an unexpected event (i.e. new knowledge). Although
languages are not identical in terms of how different degrees of integration of
knowledge are semantically categorized and how they relate to specific types
of evidentiality, these studies and others suggest that the old and new knowl
edge distinction is an important basis for the evidential/epistemic system in
many languages, particularly Asian languages, e.g., Sherpa (Woodbury
1986), Chinese pidgin Russian (Nichols 1986), Korean (H. Lee 1991).
The meanings related to the status of knowledge in this way are often
interrelated with discourse pragmatics. Palmer (1986) notes that, in several
languages, expressions of status of knowledge depend on what the speaker
thinks the listener knows. For example, in Kogi, ni- is used when the speaker
reminds the listener of information that the latter presumably knows already
(e.g., ni-gu-ku-á T did it just a while ago, as you know'), but na- is used when
the speaker informs the listener of an ostensibly new proposition, (e.g., na-gu-
ngú 'I tell you he did it some time ago') (Palmer 1986: 76). Palmer goes on to
note that, in these cases, it is hard to distinguish between modality and
discourse interactional features, since the latter is an integral part of the
expression of the speaker's knowledge status.
It is not easy to draw a clear distinction semantically between discourse
features and modality. For in discourse we often express opinion, draw
conclusions, etc.; and no doubt, in 'evidential' language [sic] speakers
regularly indicate in their conversations and arguments the evidential basis
for what they are saying. [...] what are clearly systems of epistemic modality
may contain some terms that belong more to discourse (Palmer 1986: 9Iff.)
Korean has two types of modal forms: sentence-ending (SE) modal suffixes
and auxiliary verbs. Table 1 shows a list of modal forms in the two categories
with approximate meanings in English, SE suffixes occur typically at the end
of the sentence, as in (1) with the suffix -e. Korean is a verb final (sov)
language, and SE suffixes occupy the final position among the inflections on
the predicate: verb or adjective1.
(1) Younghi-ka Seoul-ul ttena-ss-e.
Younghi-SUBJ Seoul-OBJ leave-PAST-SE
'Younghi left Seoul.'
On the other hand, auxiliary modal verbs are free morphemes which can
occur after the main verb. An auxiliary verb must be connected to the main
verb by a specific connecting suffix (= connective) on the latter to express a
given modal meaning. For example, the connective -ya followed by the
auxiliary verb tway-ta, i.e. -ya tway-ta (-ta is the citation form), expresses
obligation, whereas the connective -to with tway-ta expresses permission.
(2) Younghi-ka Seoul-ey ka-ya/to tway-e.
Younghi-SUBJ Seoul-to go-coNN AUX-SE.
'Younghi must/can go to Seoul.'
Other examples include the connective -ko and the auxiliary siph-ta express
ing desire, and -swu iss-ta expressing ability. Thus, connectives are an inte
gral part of the whole auxiliary modal system in Korean. A list of combina
tions of frequently used connectives and auxiliary verbs in Korean is shown in
Table 1B.
All of the auxiliary verbs can be used as main verbs except -ko siph-ta,
which occurs only as an auxiliary. The meanings of these verbs in the main
verb position are as follows: molu-ta 'not to know', kath-ta 'to be the same',
ha-ta 'to do', i-ta 'to be (Copula)', po-ta 'to see', iss-ta 'to exist', tway-ta 'to
become'. As shown in Table 1, when these verbs are used as auxiliaries, some
denote epistemic meanings (i.e. kath-ta, molu-ta, i-ta, po-ta), and others
agent-oriented meanings (i.e. iss-ta, tway-ta, ha-ta). However, note that each
form denotes either agent-oriented or epistemic meaning, and there is no overlap
of the two types of modality on the same verb, except the verb hata 'to do'.
The Development of Epistemic Sentence-ending Modal 171
B. Modal auxiliaries
Canonical structure: Main verb-Connective Auxiliary-tense/aspect-SE.
Form Meaning
EPISTEMIC
informal (infinitive)
-ci molla (moluta) possibility (weak)
-kes kathay (katha) possibility (strong)
-tus hay (hata) possibility (strong)
-n/l kkeya (kesita) probability
-na pwa (pota) inference
DEONTIC
-su isse (issta) ability
-to tway (twayta) permission
-ya hay/tway (hata/twayta) obligation
-ko siphe (siphta) desire
mos + Verb negation of ability
-myen an tway (twayta) negation of permission, negation of obligation
-ki silhe (silhta) negation of desire
172 Soonja Choi
o-ass-ta
come-PAST-SE
'(He) has come.'
The use of -ta is appropriate in this context in which John has just seen the
friend and registers the information in his mind for the first time. Comments
made with -ta may not be directed to the listener; they can be noteworthy
remarks to the speaker himself. Once the speaker has registered this new
information, in subsequent mentions, e.g., when the speaker repeats the
information again to his wife, or when he tells this experience later to another
friend, he would use the SE form -e:
o-ass-e
come-PAST-SE
'(He) came.'
In this latter context, the use of -ta, i.e. *o-ass-ta is ungrammatical. The use of
-ta is not restricted to perception of the outside world. It can also occur with an
internal state of mind. For example, adult Koreans often say al-ass-ta (know-
PAST-SE) '(I)'ve got (it)' when they have just come to an understanding of
The Development of Epistemic Sentence-ending Modal 173
something. (It should be noted that -ta is not an aspectual marker denoting
perfective aspect. One can use -ta when describing a state of affairs as well as
an on-going activity if it draws the speaker's attention for the first time, e.g.,
ippu-ta [pretty-SE] '(she's) pretty', or no-n-ta [play-PRES-SE] '(They) are playing').
The suffix -kwun is similar to -ta in that it is also used for newly acquired
information. However, the information marked by -kwun is often an inference
based on what the speaker has just seen, as in example (3).
(3) (The mother, upon arriving home, sees her children's school bags
lying around. She says to herself,)
ai-tul-i tolao-ass-kwun.
child-PL-suBJ return-PAST-SE
'The children have returned (home).'
By using -kwun, the mother indicates that the proposition 'the children re
turned' is inferred from seeing their school bags. As H. Lee (1985, 1993)
argues, both -ta and -kwun are used for knowledge that has not yet been
assimilated into the speaker's knowledge system. The difference between the
two suffixes is that -ta is used with knowledge obtained through direct
experiences, whereas -kwun is often used with knowledge obtained through
inference.
In contrast to -ta and -kwun, -e is used when the information has already
been assimilated into the speaker's knowledge system: the speaker acquired
the information in the past, therefore has known it for some time. As men
tioned above, once newly perceived information is encoded with -ta or -kwun,
in subsequent mentions -e must be used. In H. Lee's survey (1991), -e is the
most frequent suffix (58%) in spontaneous discourse interactions. This is not
surprising, since in conversations participants often contribute to the topic by
offering information that they already have about the topic. Indeed, one can
conclude that -e is the unmarked suffix in conversations.
In discussing -kwun above, we have seen how a particular status of
knowledge is related to an evidentiary source, e.g., inference. There are a few
other suffixes that differentiate evidential sources in Korean. The difference
between -toy and -ney is a case in point. Whereas -toy marks hearsay or
reported speech, -ney signals that the event was directly witnessed by the
speaker.
All of these suffixes are used only in informal conversational interactions
where participants, familiar to one another, freely and spontaneously ex-
174 Soonja Choi
change information about the current topic. Probably due to this characteris
tic, the meanings of many suffixes incorporate the speaker's assumption about
how much the listener knows. That is, the speaker's choice of a specific SE
suffix reflects his or her assumption about status of the listener's knowledge
about the proposition (H. Lee 1991). For example, the suffix -ci (or -cyana) is
used when the speaker commits him/herself to the truth of the proposition; it
therefore expresses certainty (-cyana expresses a stronger commitment than -
ci). At the same time, -ci denotes that the information is also known to the
listener or can be readily inferred by the listener. This is confirmed by the use
of -ci in questions. The suffix -ci? is used in a question when the speaker is
committed to the truth of the proposition and at the same time wants to
confirm that the listener shares this commitment. This is shown in (4).
Considering the context of (4), the meaning conveyed by -ci is that the
speaker is certain that Mary is pretty and also believes that the listener has the
same opinion. The speaker expects agreement from the listener. Compare this
with the suffix -nil in (5). When -nil is used, it conveys the speaker's
assumption that the listener knows more than the speaker about the matter
(since only the latter saw Mary). By using -ni the speaker expresses a desire to
know what the listener knows to be the true.
(4) (Possible context: Both speaker and listener saw Mary and both
liked her. The speaker later asks,)
Mary-ka ippu-ci?
Mary-SUBJ pretty-SE?
Tsn't Mary pretty?'
(5) (Possible context: The speaker heard from someone that Mary was
pretty. The speaker asks the listener, who has seen Mary,)
Mary-ka ippu-ni?
Mary-SUBJ pretty-SE?
Ts Mary pretty?'
Thus, -ci and -ni express different degrees of certainty which also relate to
what the speaker assumes about the listener's knowledge. To this category,
the suffix -ta (Type 2) should be added. This suffix is used when the speaker
has assimilated the information conveyed in the proposition but believes that
the information is new and noteworthy to the listener. In this case, -ta (Type 2)
is typically used with a high pitch (see the acquisition section below). For
example, when the speaker but not the listener has seen Mary, and the speaker
The Development of Epistemic Sentence-ending Modal 175
wants to convey this information that is unknown to the listener, the speaker
may say Mary-ka ippu-ta (-ta with a high pitch).
In summary, I have discussed several examples to show that a number of
SE modal suffixes in Korean express evidentiality and the different status of
knowledge on the part of the speaker and the listener concerning an event or
state. These suffixes are used only in informal interactive contexts (e.g., a
conversation, or personal letter to a friend). They are not used in formal
situations (e.g., a formal speech), nor in contexts where the speaker/writer
does not have a particular listener/reader in mind, e.g., written reports. The
present study will suggest that such discourse-interactional features and non
verbal contextual cues that accompany them facilitate and enhance the acqui
sition of epistemic modal suffixes.
How often do adults use these various SE suffixes? As noted above, one
can always use the unmarked form -e, which simply denotes propositions that
are established in the speaker's mind. However, H. Lee (1991) found in his
informal conversation data that, although -e is used most frequently, it occurs
only in just over half of the utterances (58%). For the remaining utterances
(42%), 21 different SE suffixes are used, with varying degrees of frequency.
Among these, -ci was most frequent (15%) followed by -ta, -kwun, -ney, and -
toy, used at roughly equal frequency of 2%. (-ni was used only 1%). Thus,
although Korean speakers use -e most of the time, they use other SE suffixes
spontaneously to express different degrees of certainty about a proposition,
about the source of the information, and/or to check the status of knowledge
of the listener.
tween must and may in English in 'He must/may be home by 5 o'clock' has
little to do with the speaker's assumption about how much other participants
know about the proposition. Also, in most of the experimental studies on the
acquisition of epistemic modality, tests have been designed to understand
such context-independent reasoning. For example, in Moore et al. (1990),
children (between 3 and 6 years of age) were asked to guess the location of an
object hidden in a box solely on the basis of one sentence cues which varied in
epistemic modal auxiliaries, e.g, "It must be in the red box" or "it might be in
the blue box." Moore et al. found that ability to find the hidden object on the
basis of the modal meaning was shown only in children older than four years.
However, the results of these studies give little indication of how and when
children understand modal forms that incorporate discourse-interactional
meanings.
A growing body of research on communicative competence in children
shows that children learn to be good participants in conversations from a
young age. Several studies (Shatz 1983, 1984; Pellegrini, Brody & Stoneman
1987) have shown that even two-year-olds are capable of giving enough
information and keeping their linguistic contributions truthful and relevant to
the topic of discourse. Bloom, Rocissano & Hood (1976) have also shown that
before children reach two years, they have learned a basic rule of discourse,
that of conversational turns, and that between two and three years, children
increase the amount of information they contribute to the shared topic.
These studies suggest that the modal functions of SE suffixes in Korean
which relate to discourse interactions may be within children's cognitive
grasp and linguistic capacity from an early age. Furthermore, there are several
characteristic morphological features of SE suffixes that may facilitate early
acquisition of SE suffixes in Korean, particularly when we consider Slobin's
operating principles (1973). First, SE suffixes occur at the ends of sentences
(most often with one syllable consisting of a consonant and a vowel), and
therefore are perceptually salient. Second, the SE suffixes constitute an obliga
tory category in that all sentences in discourse interactions must end with a SE
suffix. During interactions with children, the caregiver provides a variety of
SE suffixes appropriate to specific discourse contexts. From the acquisition
perspective, this means that children hear different SE suffixes frequently from
their caregivers. Third, there is a relatively high degree of semantic transpar
ency in that agent-oriented and epistemic meanings are distinguished morpho
logically, and SE suffixes denote epistemic meanings most of the time. (Table
The Development of Epistemic Sentence-ending Modal 177
1 shows that all SE forms in Korean carry the type of epistemic meanings
discussed above, except -llay and -kkey which denote desire and intention
respectively. It should be noted, however, that, as discussed in a number of
studies, constructions that express desire and intention often develop a use for
prediction, which can be considered epistemic (Bybee 1985; Bybee &
Pagliuca 1985; H. Lee 1991). In fact, the two SE suffixes are always preceded
by the future tense marker -/.) Agent-oriented meanings are expressed pre
dominantly by auxiliary verbs, which require children to use two verbs in the
sentence, i.e. a main verb and an auxiliary. Also, the SE suffixes have only
modal meanings; they incorporate neither tense nor aspectual meanings. All
of these morphological characteristics, along with the discourse pragmatic
functions of SE suffixes, may enhance children's acquisition of SE suffixes.
4.1 Data
ever, if the child answers with either a different verb (e.g., hay-ss-e do-PAST-SE
'(I) did') or a different suffix (e.g., mek-ess-ci eat-PAST-SE '(I) ate'), or both
(e.g., hay-ss-ci do-PAST-SE '(I) did') the child's utterance is included in the
data.
As can be seen from Table 2, the order of acquisition of the modal
suffixes is strikingly similar across the three children. First, note that several
SE suffixes are productive in the children's speech before auxiliary verbs
appear. Second, we also see a consistent pattern across the three children in
the order of acquisition among the SE modals: -ta (Type 1), -e , -ci, -toy, and -
ta (Type 2), in the order mentioned. The two agent-oriented SE modals with
future meaning, -llay and -kkey, are both acquired after -ta, -e and -ci have
been acquired.
The general developmental pattern is that the children acquire the suf
fixes -ta (Type 1), -e, and -ci before the suffixes indicating future, and then the
other two epistemic suffixes, -toy, and -ta (Type 2) are acquired along with
some agent-oriented modal auxiliaries. These five SE suffixes are acquired
before 3 years in all three children. In what follows, I analyze the modal
meanings encoded by these suffixes in the children's speech.
In order to identify the meanings of these suffixes in the children's
speech, I have examined the linguistic as well as nonlinguistic contexts in
which the suffixes are used. Concerning the linguistic context, I have ana
lyzed the structure of discourse interactions between the child and the care
giver. This includes an analysis of discourse contingency (i.e. whether or not
the child's utterance maintains the shared topic) and the effect of preceding
utterances on the child's selection of a particular SE suffix. Since the children's
lexicon is rather limited at this early stage, I have also examined the kinds of
verbs to which a particular suffix is attached. As for nonlinguistic contexts, I
have analyzed the events, states, and entities that the children refer to in their
propositions. I have also examined the types of context which provide the
child with particular information, i.e. the source of information.
4.1.1 Acquisition of the suffixes -ta and -e: distinction between unassimilated
and assimilated knowledge.
The first two suffixes to appear in all three children's discourse were -ta (Type
1) and -e. These two suffixes were acquired at around the same time, i.e.
during the first month of study in all three children. HS used the two suffixes
productively from the first session (1;10). PL acquired -ta at 1;8 and -e one
The Development of Epistemic Sentence-ending Modal 179
HS PL TJ
Form Age* Age* Age*
SE suffixes
EPISTEMIC
-ta (Type 1) 1;10 1;8 1;9
-e 1;10 1;9 1;11
-ci/cyana 2;1 l;ll 2;2
-toy 2;5 2;l 2;3
-ta (Type 2) 2;5 2;9 3;0
DEONTIC/EPISTEMIC
-kkey 2;1 2;1 2;7
-llay 2;3 2;0 2;6
Modal auxiliaries
EPISTEMIC
2;8 3;0
-na pwa (inference)
month later (1;9), and TJ acquired -e at 1;9 and -ta three weeks later.
Whatever the order of acquisition of the two suffixes was, the two forms
served distinct functions in the speech of all three children, who used them to
make the same distinctions. The distinction between -ta and -e went through
two phases. During the first phase, -e was restricted to requests. Examples are
shown in (6) and (7).
(6) HS(1;10)
(Mother and HS open the refrigerator in the kitchen)
→ HS: uyu cw-e.
milk give-SE
'Give (me) milk.'
180 Soonja Choi
(7) TJ(1;9)
(TJ says to a friend)
→ TJ: ilwu o-a3.
here come-SE
'Come here.'
In (6), HS asks her mother to give her some milk, and in (7), TJ asks her friend
to come closer. These are appropriate uses of -e in that in the Korean adult
grammar, -e is used for imperative as well as declarative sentences. On the
other hand, the children used -ta in all statement sentences as illustrated in (8)
and (9).
(8) TJ(1;9)
(TJ putting one lego block on top of another)
→ TJ: olla ka-ss-ta.
Up gO-PAST-SE
'(It) went up.'
(9) TJ(1;9)
(TJ looks inside her doll house and sees that it's empty.)
→ TJ: eps-ta.
not-exist-SE
There is nothing.'
As shown in (8) and (9), -ta was used in statements to describe events
(example (8)) as well as states of affairs (example (9)). The contrastive use of
the two suffixes, -ta for statements and -e for requests, was shown by all three
children during the first phase. The distinction between statements and re
quests is parallel to the mood distinction between declarative and imperative:
-ta for the declarative mood, which expresses the speaker's assertion of
propositions, and -e for the imperative mood, which directs the listener to do
something. It seems therefore that at the beginning the two suffixes distin
guish mood in the children's speech. This supports Stephany's finding (1986)
that mood distinctions occur early.
At this early period of language development, verbs attached to -ta were
limited in number and were restricted to those that reflected the child's
cognitive interests in trying to understand the world. That is, the kinds of
events and states of affairs that the children described (all with -ta) were
The Development of Epistemic Sentence-ending Modal 181
(11) PL(1;9)
(Seeing grandfather's glasses)
→ PL: apuci-kke-ta.
grandpa-POSS-SE
'(They're) grandpa's.'
(12) HS(1;10)
(HS puts a lego person in a chair.)
→ HS: tway-ss-ta.
become-PAST-SE
'done'
(13) (TJ1;11)
M: Mickey eti iss-ni?
Mickey where exist-SE?
'Where is Mickey?'
(TJ pointing to the picture of a Mickey Mouse on her doll house)
→ TJ: Mickey yeki iss-ta.
Mickey here exist-SE.
'Mickey is here.'
The events described in (10)-(13) above occur in the children's immedi
ate context, and are newly registered in their mind. In (10), TJ describes the
scene which has drawn her attention, and in (11), PL points out that the
glasses belong to her grandpa. In (12), HS marks the completion of her goal as
the action was just accomplished, and in (13) TJ has just found the picture of
Mickey. These examples illustrate typical contexts in which -ta is used, and
they can be categorized as follows:
(a) describing a scene in a picture;
(b) describing events/states that the child has just observed:
i. a perfective aspect which results in a particular state, e.g.,
nemecy-ess-ta 'fallen down', ollaka-ss-ta, 'went up',
ii. an ongoing event/state e.g., can-ta 'sleeping',
iii. naming a referent, appa-(i)-ta4 'daddy'
(c) commenting on the existence or non-existence of an event/object.
These three types were the dominant types of contexts for -ta from the first
phase (i.e. when all statements ended with the suffix -ta), and continued to be
the dominant types after -e was also used for declaratives. Table 3 shows the
The Development of Epistemic Sentence-ending Modal 183
Table 3. Context analysis of-ta. during the first six months of its contrastive use of with -e.
HS PL TJ
(l;10-2;4) (l;10-2;4) (l;ll-2;5)
%(N) %(N) %(N)
a. picture scene 5(14) 19 (23) 29 (32)
b. present event/state 45(116) 28 (34) 49 (55)
c. (non-)existence 46(119) 17 (20) 18 (29)
Table 4. Context analysis of-e during the first six months of its contrastive use with -ta.
HS PL TJ
(l;10-2;4) (l;10-2;4) (l;ll-2;5)
%(N) %(N) %(N)
a. picture scene 5(8) 19 (22) 27 (28)
b. present event/state 25 (40) 5(6) 12(13)
c. (non-)existence 4(7) 5(6) 7(7)
frequency of -ta used in these three types during the first six months from the
time -ta and -e were both used for declarative sentences. As Table 3 shows,
the children used -ta most frequently in the three types of context (a), (b) and
(c). (Further categories (d)-(f) will be introduced below.) This suggests that
the semantic content of propositions with -ta was something that the children
had just perceived through direct experience. The children seemed to describe
the events as they became aware of them.
Table 4 shows the frequency of -e in different types of contexts. The
distribution of -e in these contexts is clearly different from that of -ta.
Although -e is used in categories (a)-(c), these amount to fewer than 50% of
all the uses of -e. (The contrastive uses of -ta and -e in (a) - (c) will be
discussed later.) The majority of the -e uses occur in categories (d)-(f):
(d) to give information about a past event/state;
(e) to convince the listener of an event/state of affairs, or to talk about an
event/state which was not occurring at the time of speech (e.g., negation,
actions which the child was about to perform, make-believe events while
playing with a doll or a toy), and;
(f) in questions to verify the truth of a proposition (see Table 4).
These types are illustrated in examples (14) - (17).
(14) PL(1;11)
(Grandmother talks to another adult about the hair salon where PL
has recently had her hair cut. PL, hearing all this conversation, says)
→ PL: polami meli ippukey hay-ss-e.
PL hair prettily do-PAST-SE.
'PL had (her) hair done. It was pretty.'
(15) TJ(2;2)
(TJ is in the middle of a book, but doesn't want to read anymore. TJ
closes the book.)
—» TJ: eps-e
not-exist-SE.
'no more.'
(16) TJ(1;11)
(TJ is in another room. M asks TJ to bring a color book.)
M: ppalli kacko o-a. illwuo-a.
quickly take come-SE here come-SE
'Bring (it) quickly. Come here.'
The Development of Epistemic Sentence-ending Modal 185
—» TJ: an ka-a
not go-SE
'(I'm) not going.'
(17) HS(2;0)
(HS gives a piece of apple to the Investigator (Inv). Inv. doesn't
want any more. HS holding the apple,)
—> HS: yoke-nun acci-kke nwu-ka hay(-e)5?
this-TOP uncle-poss who-suBJ do(-SE)?
'Who will do (=eat) uncle's apple?'
(By this question, HS implies that it is she who wants to eat the apple. In other
words, in her mind 'who' refers to HS herself, and she asks the question to get
permission from the investigator.)
In (14), PL states a past event that is well known to her, and in (15) the
child creates a desired state (i.e. because the book is closed, reading is
finished) and describes it to convince the adult of its truth. In both (14) and
(15), the information that the child conveys has been established in her mind
before actually saying it. Examples (16) and (17) show that -e is used also for
negations and questions. In (16), M has asked TJ to come back to the living
room, and TJ negates the proposition. In (17), the child asks about a proposi
tion that is on her mind, i.e. she wants to eat the apple. In both cases, the
proposition negated or questioned has already been established in the child's
knowledge system.
Among the categories (d)-(f), there were, however, individual differ
ences in frequency: HS used -e most frequently in wh-questions, whereas PL
and TJ used it most frequently for past events/states. Nevertheless, the catego
ries (d)-(f) taken together constitute more than 50% of all the -e uses in all
three children's speech. This is different from the uses of -ta which was rarely
produced in these contexts.
Several further analyses support the interpretation that -ta and -e are used
for unassimilated/new and assimilated/old knowledge, respectively. In Table
4, one notices that, although -e is used in categories (d)-(f) more than half of
the time, it is also used in categories (a)-(c) quite often. Specifically, HS used
-e for picture scenes (category (b)) 25% of the time, PL and TJ used it for
present events/states (category (a)) 19% and 27% of the time, respectively.
Thus, both -ta and -e were used for describing present situations. However,
the two suffixes contrasted in a systematic way. One contrast was related to
186 Soonja Choi
the way the two suffixes were used in repetitions of a proposition. When -ta
and -e occurred within one conversational turn, the first mention of the
proposition was marked by -ta, and subsequent repetitions then switched to -e.
This is shown in the following examples:
(18) PL(2;0)
(PL finds the balloon she has been looking for.)
PL: yeki iss-ta. yeki iss-e. yeki iss-e.
here exist-SE. here exist-SE. here exist-SE.
'Here (it) is. Here (it) is. Here (it) is.'
(19) TJ(2;3)
(The picture of an elephant in a book and the elephant design on her
pants are similar.)
TJ: ike ttokkath-ta. ike ttokkath-e.
this same-SE this same-SE
'These are the same. These are the same.'
In (18), PL finds the balloon unexpectedly. She gets excited and describes the
event with -ta first. However, at the second mention of the same proposition,
she switches to -e. A similar explanation can be given to TJ's utterance in
(19). As TJ recognizes that the two pictures are the same, she uses -ta (the
concept of sameness interested TJ greatly at this period). But once the infor
mation has been registered in her mind, she switches to -e. This sequence of -
ta followed by -e occurred a number of times in the speech of all three
children. The reverse order never occurred. Such a fixed order of -ta and -e
within one conversational unit also occurs in adult speech, and apparently the
children learned this order without errors.
Another distinction between -ta and -e had to do with verb types. As
noted above, at the beginning, verbs attached to -ta were limited in number
and restricted to those that reflected the cognitive concepts that the child was
developing at the period of time in question. Most notable verbs were: iss-ta
'exist', ep-ta 'not-exist', tway-ss-ta 'become', kath-ta 'same', ollaka-ss-ta
'went up'. In addition, -ta was also used frequently after a noun to label
entities (e.g., appa-ta '(It's) daddy'). These verbs were used with high fre
quency and co-occurred only with the suffix -ta during the first few months.
Gradually however, propositions about existence, success, change of location,
or names of things involved in the here and now were encoded with -e,
particularly when an event or state was expected by the child (see examples
(15), (18), and (19) above, also (20) below). As the children's cognitive
The Development of Epistemic Sentence-ending Modal 187
Table 5. Frequency of occurrence of-ta. and -e in relation to the discourse topic during
the first six months, (from Choi 1991)
-ta -e
Related* Unrelated** Related Unrelated
HS 9% 91% 65% 35%
PL 19% 81% 57% 43%
TJ 24% 76% 64% 36%
*Related to the immediately preceding utterance, or to the current topic.
**Unrelated to the topic, introduction of a new topic.
(21) TJ(2;0)
(Investigator points to the girl that she colored)
Inv.: i salam ippu-cil
this person pretty-SE?
'Isn't this person pretty?'
TJ: ippu-ci.
pretty-SE
'pretty.'
Inv.: i salam ippu-cil
this person pretty-SE?
'Isn't this person pretty?'
(TJ hears a baby cry upstairs.)
→ TJ: aka wu-n-ta.
baby cry-PRES-SE.
'A baby is crying.'
Inv.: aka wul-e?
baby cry-SE
'Is baby crying?'
In (21), as TJ hears a baby cry, she comments on this, abruptly interrupt
ing the topic of coloring. This non-contingency occurred in 83% (on average)
of the children's -ta uses. In contrast, the propositions with -e showed a much
lower frequency of non-contingency (39% on average). In fact, more than half
of the time, the propositions with -e maintained the current topic. More
specifially, -e was used to respond to or comment on the preceding adult
utterance. PL's utterance in (14) above is a good example of providing more
information on the current topic (i.e. about PL's hair).
In summary, several types of analyis have shown that the children in the
present study used the modal suffixes -ta and -e for distinct functions. -Ta was
used to encode a new and noteworthy proposition as the child became aware
of its meaning. At the beginning, propositions with -ta also reflected cognitive
concepts that the children were developing. At a later stage, -ta encoded new/
unassimilated information that the child experienced directly in the here and
now. In both cases, -ta was often used to introduce a new topic during
interaction. In contrast, -e was used for propositions that were already estab
lished in the child's knowledge system, and its function in discourse was to
contribute more information to the current topic.
The Development of Epistemic Sentence-ending Modal 189
4.1.2 Acquisition of-ci: shared knowledge and certainty of the truth of the
proposition
The third modal suffix to be acquired by the children was -ci. (The children
also acquired -cyana around the same time, and the data in this section include
both -ci and -cyana.) With the use of -ci, the interactional function of modal
suffixes becomes even clearer. Initially, -ci was used in the following con
texts: (a) when reiterating a proposition in the preceding utterances produced
either by the child herself or by the interactant (example 22), or (b) when
redescribing an event or state which had been described several times before
(example 23).
(22) PL(1;11)
(PL pointing to Grandmother's hairpin)
PL: ike hammwuni kke-ya?
this grandma POSS-SE
'Is this grandma's?'
GM: ung, hammwuni kke-ya.
yes, grandma POSS-SE.
'Yes, grandma's.'
—» PL: hammwuni kke-ci?
grandma POSS-SE
'Grandma's?'
GM: ung, hammwuni kke-ci.
yes, grandma POSS-SE
'Yes, grandma's.'
(23) PL(2;2)
(PL remembers seeing a monster on TV.)
PL: kweymwul an mwusew-e?
monster NEG scary-SE
'Monster is not scary?'
GM: kweymwul eps-e.
monster not-exist-SE
'There's no monster.'
→ PL: kweymwul Tibi-ey-na iss-ci?
monster tv-LOC-only exist-SE?
'Monsters exist only on TV?'
190 Soonja Choi
(24) PL(2;6)
(PL and Mother talk about whether neighbor houses have their
lights on or not.)
PL: emma. ce cip-ey-nun pwul-i kke-ss-e
mommy, that house-LOC-TOP light-suBJ turn off-PAST-SE.
'Mommy. Lights are off in that house.'
M: etil
where
'where?'
PL: yeki
here
'here.'
M: ung, ce wuit cip-ey.
yes that above house-LOC
'Yes, that house on top.'
PL: ung, wuit cip-ey.
yes above house-LOC
'Yes, the house on top.'
M: ung, ung.
yes yes
'Yes, yes.'
—» PL: ce alay cip-un pwul khye-cye iss-ci.
that below house-TOP turn-on-PASS be-SE.
'Lights are on in that house below.'
M: kule-kwuna.
SO-SE
'It is so.'
(25) HS(2;9)
(One coin chip is stuck in toy cash register)
Inv.: kelye-ss-cyana
stuck-PAST-SE
'(It's) stuck.'
HS: kelye-ss-e?
Stuck-PAST-SE
'(It's) stuck?'
192 Soonja Choi
Inv.: ung.
yes
'Yes.'
HS: ike kocang na-ss-nunka pwa.
this obstacle arise-PAST-coNN seem
This seems to be broken.'
Inv.: kocang an na-ss-e
obstacle NEG arise-PAST-SE.
'(It) isn't broken.'
HS: kocang na-ss-e.
obstacle arise-PAST-SE
'(It)'s broken.'
Inv.: tasi ha-l-kkey.
again do-FUT-SE
'(I')H do it again.'
(Inv. tries but it still gets stuck)
HS: an tway.
NEG become
'(It) doesn't work.'
Inv.: an tway.
NEG become
'(It) doesn't work.'
→ HS: kocang na-ss-ci.
obstacle arise-PAST-SE
'(It)'s broken.'
In (24), the child maintains the current topic, and tells whether the lights are
on or off on the basis of what she sees, i.e. perceptual evidence. In this
example, PL first confirms with her mother that the lights are on in the upper
apartment. This discussion leads PL to draw the conclusion that the lights in
the lower apartment are off by contrast, and PL marks her certainty of this
knowledge by -ci. In example (25), being broken becomes more and more
certain to HS as the coin keeps getting stuck. Also, in (25), the meaning
kocang na-ta 'be broken' is expressed several times by the child herself
before marking it with the suffix -ci. This use of -ci is analyzed as certainty
developed through discourse (category d in Table 6). At this developmental
stage, certainty constructed through discourse is often accompanied by per-
The Development of Epistemic Sentence-ending Modal 193
(26) HS(2;9)
(HS putting coins one by one into the toy cash register.)
→ HS: yoke ha-ko yoke ha-ya-ci.
this do-then this do-must-SE
'(I) do this, then (I) must do this.'
(27) PL(2;2)
(PL playing doctor.)
→ PL: cinchal ha-myen ei wul-myen nappen ke-ci.
examine do-then ei cry-if bad NOMINAL-SE.
'It is a bad thing to cry ei (sound of crying) when (the
doctor's) examining.'
This represents category (g) in Table 6. The obligation connective and the
nominalizer in Korean denote obligation and normativity, respectively. This
is interesting because the two meanings are closely related in that when one is
obligated to do something or when one normally does something in a given
situation, the predicated event acquires a degree of certainty. In other words,
obligation and normativity often lead to certainty of an event occurring.
Studies of modality in adult language show that there is a close relationship
between obligation and certainty (Bybee 1985). Also, Steele (1975), in a
cross-linguistic study of modality, argues that for a systematic relation be
tween deontic and epistemic modality. In particular, Steele's data show that in
many languages obligation and certainty of information are expressed by the
same morpheme. In Korean, modal forms for obligation and certainty are
distinct; however, the two forms frequently co-occur. The three children
acquired such co-occurrences from an early stage.
As in adult speech, the suffix -ci was frequently used in yes/no questions
asking the caregiver to confirm the truth of a proposition: yes/no questions
comprised 35% of -ci utterances in HS's speech, 25% in PL's speech, and
38% in TJ's speech (these are incorporated in the appropriate pragmatic
categories in Table 6). In addition, -ci was used in wh- questions. Initially, this
is suprising because wh- questions are asked when some particular informa-
194 Soonja Choi
Table 7. Number of-tay utterances in different types of context f rom its acquisition till 3;0.
HS PL TJ
(2;5-3;0) (2;l-3;0) (2;3-3;0)
a. what X would say or feel 3 44 7
b. reported speech 3 10 7
c. during storytelling - 9 9
d. errors with 1st person 1 1 1
Total -tay utterances: 7 64 24
with a doll as a baby, and reports what the baby might feel. Obviously, the
baby (i.e. the doll) said nothing, but PL infers that the baby felt sick and
reports this to her mother, who may not share this information. A similar
explanation can be offered for (29): TJ infers that Ernie might hurt his feet by
jumping many times. In (30), HS reports to the investigator what her sister has
just said, appropriately with the suffix -toy.
(28) PL(2;1)
(PL plays with a doll with her Mother.)
→ PL: aka-ka aphu-tay.
baby-suBJ sick-SE.
T h e baby is sick.'
(29) TJ(2;4)
(TJ looking at a picture of Ernie jumping)
Inv.: jumphu-ha-e?
jump-do-SE?
'(Is he) jumping?'
→ TJ: pal ayaya ha-n-tay.
foot a y a y a (typical sound made when one is hurt) do-PRES-SE
'He says his feet are hurt.'
Inv.: ung.
yes
'Oh, I see.'
(30) HS(2;5)
(HS and her older sister are coloring.)
HS's sister: nayka saykchil hay cwu-kkey.
I-SUBJ coloring do give-SE.
T will color (it) for you.'
(HS immediately reports to Inv.)
→ HS: enni-ka saykchil hay cwu-n-tay.
sister-suBJ coloring do give-PRES-SE.
'(My) sister says that she will color (it) for me.'
As shown in the above examples, all of the utterances with -tay were reports
about what a third party was feeling or had said. That is, the information with
-tay did not come from direct experience but from inference or other indirect
sources (except for one error in each child's sample which was about the child
196 Soonja Choi
(31) TJ(3;0)
(TJ showing the Investigator her new bracelet on the wrist.)
→ TJ: na phalci iss-ta↑
I bracelet exist-SE
'I have a bracelet!'
Inv.: TJ, phalci iss-e?
TJ, bracelet exist-SE
T J has a bracelet?'
TJ: ung.
yes
'Yes.'
(32) TJ(3;0)
(TJ tells the Investigator what she did before the Inv. came.)
→ TJ: akka-nun Cwunwu-lang pizza mek-ess-ta↑.
before-Top Cwunwu (TJ's friend)-with pizza eat-PAST-SE
T ate pizza with Cwunu awhile ago!'
Inv.: kula-e?
SO-SE
Ts that so?'
(33) HS(2;9)
(HS putting her sunglasses in her pocket, tells the Inv.)
—» HS: nay kyowhay ka-lttay yoke kac-ko ka-nunke-ta↑
I church go-when this take-coNN go-NOMIN-SE.
'This is something that I take to church!'
In all three examples above, the information that the child gave to the
investigator was new to the latter. The child marked this assumed knowledge
The Development of Epistemic Sentence-ending Modal 197
gap in the listener's mind with the suffix -ta. It should be noted that the
information with -ta (Type 2) was often something the child wanted to brag
about to the adult, thus it carried a certain affective component.
In summary, both -tay and -ta convey information the child believes the
listener doesn't have. The suffix -tay marks information that comes from an
indirect source: either inference about what another person would say or
would feel or a report of indirect speech. The suffix -ta (Type 2) with a high
pitch expresses something that the child experienced directly and which the
listener might not know. It is interesting to note that the productive uses of the
two suffixes, -tay and -ta (Type 2) all occurred later than -ci. This suggests
that an understanding of differences in knowledge between the child and the
listener develops after the child learns to express shared knowledge.
The data of this study have shown that Korean children acquire and use
several epistemic SE suffixes appropriately at an early stage. Specifically,
between 1 ;9 and 3;0 the three children in the study acquired five SE suffixes to
mark distinct meanings. The meanings were distinguished in terms of the
following features: degrees of integration of a proposition in the child's
knowledge system, source of information, and the child's belief about what
the listener knows. In particular, the children in this study distinguished
between newly perceived information that was not yet assimilated into their
knowledge system and old/assimilated information (-ta (Type 1) vs. -e);
between knowledge shared vs. not-shared with the listener {-ci vs. -tay/-ta),
and between direct and indirect sources of information {-ta (Type 2) vs. -tay).
The data suggest that these oppositions are acquired in a consistent order.
Figure 1 attempts to illustrate this development. First, -ta is acquired to denote
newly perceived information that attracts the child's interest at the time of
speech. Then, -e is acquired to contrast the new knowledge with old/assimi
lated knowledge. The new vs. old knowledge distinction is related to the
child's own knowledge system and does not seem to be related to whether or
not the knowledge is shared with the listener. However, as -ci is acquired, a
new component, the feature of shared knowledge, is added to the SE modal
system. The propositions with -ci in the children's speech predominently
contain information related to the shared topic: 85% of the -ci uses have
discourse contingency. In addition, -ci expresses knowledge shared by the
198 Soonja Choi
Development of | >
Modal meaning: Degree of assimilation of knowledge in the speaker's mind
I >
Listener's status of knowledge about a proposition
Source of information
Figure 1. Development of epistemic functions
child and the listener. The nature of this shared knowledge is based on
discourse or perceptual evidence. This function of -ci leads to marking propo
sition as certain. Later, as -toy is acquired, shared knowledge comes to be
differentiated from non-shared knowledge, i.e. the child informs the listener
of a proposition the latter may not know. Furthermore, -toy denotes informa
tion that comes from an indirect source. Lastly, -ta contrasts with -toy in
marking a proposition as reflecting the child's direct experience. This devel
opmental order suggests that as each new modal form and its corresponding
function are acquired, a new domain of epistemic modality is added to the
existing one(s), as shown in Figure 1.
One explanation for these results has to do with children's general
cognitive development and capacity at this stage. We have seen that the three
children acquired their first SE suffixes to express their own knowledge status,
and subsequently to incorporate the listener's knowledge status. This con
forms to the general view that children's use of language develops from
egocentric to decentered (Piaget 1955). Also, a closer look at the contexts in
which particular propositions were expressed suggests that at the earliest
stage in the children's development of SE modals, propositions were related to
the information in the immediate context. In particular, information with the
suffix -ta (Type 1) was closely tied to what was going on at the time of speech.
Then, the children acquired -e and -ci to convey information about past
events. This parallels the pattern of a child's general cognitive development,
which proceeds from an understanding of present events to an understanding
of events removed from the present (i.e. past or future). Not only across
suffixes, but also with respect to a single suffix, the development of functions
The Development of Epistemic Sentence-ending Modal 199
(24') PL(2;6)
(PL and Mother have been engaged in a pretend play for awhile. PL
plays a student who came to study in Boston.)
M: kongpwuha-le o-si-ess-eyo.
study-coNN come-HON-PAST-SE.
'(You) came (here) to study.'
PL: ney.
yes
'Yes'
(PL sees a light in a neighboring house.)
→ PL: emma, ce, ceki yep cip-ey pwul khye-cye-iss-ta.
mommy there there side house-LOC turn on-PASS-be-SE
'Mommy, the light is on in that neighbor's house.'
M: ung. ce cip pwul khye-iss-kwuna.
yes. that house light turn on-be-SE.
'Yes, the light is on in that house.'
—» PL: emma. ce cip-ey-nun pwul-i kke-ss-e.
mommy, that house-LOC-TOP Iight-SUBJ turn off-PAST-SE.
'Mommy. Lights are off in that house.'
M: etil
where
'where?'
PL: yeki
here
'here.'
M: ung, ce wuit cip-ey.
yes that above house-LOC
'Yes, that house on top.'
PL: ung, wuit cip-ey.
yes above house-LOC
'Yes, the house on top.'
M: ung, ung.
yes yes
'Yes, yes.'
—» PL: ce alay cip-un pwul khye-cye iss-ci.
that below house-TOP turn-on-PASS be-SE.
'Lights are on in that house below.'
The Development of Epistemic Sentence-ending Modal 201
M: kule-kwuna.
SO-SE
'It is so.'
In this example, as PL sees the light, she gets excited and describes what
she sees, abruptly changing the topic from the pretend play to lights in the
neighbors' houses. PL encodes this newly perceived information with -ta.
Once the topic is established and the information registered in her mind, the
child describes other houses with the suffix -e. As the exchange of informa
tion on lights being on or off progresses, PL changes to -ci to encode shared
information and its certainty. The three suffixes thus mark various types of
knowledge status that develop within a particular discourse interaction.
Early sensitivity to the discourse functions of SE suffixes in Korean
correlates well with the findings on the acquisition of discourse-pragmatic
phenomena for English-speaking children: children use language in request
ing information and answering questions from the one-word period (Dore
1974; Shatz 1983). Several studies on discourse contingency between care
giver and child have shown that children imitate caregivers' utterances from
early on. One function of imitation is maintaining a shared topic. Bloom et al.
(1976) shows that imitation is crucial to the development of the discourse skill
of contributing new information to a shared topic. Also, Pellegrini et al.
(1987) demonstrate that two-year-old children show signs of observing the
cooperative principles, e.g., giving informative and truthful information to
construct shared knowledge relevant to the topic of discourse. The present
study suggests that such an ability to follow the progression of discourse
toward more and more shared knowledge between the speaker and the listener
is instrumental to the early acquisition of SE modal suffixes. This would also
explain the late acquisition of pure epistemic modal forms whose meanings
are relatively context-independent (e.g., English epistemic modal auxiliaries).
Essential to children's sensitivity to discourse interactions is the affective
component that exists between the caregiver and the child. First, all of the SE
suffixes are used in informal conversations between participants who are
intimate with one another. For example, by repeating shared information
using -ci, and by checking and making sure that it is shared with the caregiver,
the child builds a bond between herself and the caregiver. Also, the suffix -ta
(Type 2) is used when the child wants to brag about something to the adult.
Such an affective component may enhance the acquisition of abstract con
cepts in ways similar to what Akatsuka and Clancy (1991) describe in relation
202 Soonja Choi
ABBREVIATIONS
NOTES
1. In colloquial speech, sov order is not always observed. In consequence, SE suffixes may
occur in the middle of the sentence, as in the following example.
kho ca-n-ta thokki-ka.
well sleep-Present-SE rabbit-suBJ
'The rabbit is sleeping soundly.'
2. I am very grateful to Pat Clancy and Youngjoo Kim for allowing me to analyze HS'S and
PL's data respectively.
3. -e and -(y)a are allomorphs. -e occurs after high and mid-high vowels and after all
consonants. -(y)a occurs following low and mid-low vowels.
4. In this case, the full form is appa-i-ta (daddy-cop-SE), however, the copula verb i 'be' is
often deleted after a vowel.
5. -e is deleted after a vowel.
The Development of Epistemic Sentence-ending Modal 203
REFERENCES
Akatsuka, Noriko. 1985. "Conditionals and the Epistemic Scale". Language 61.625-639.
Akatsuka, Noriko & Patricia Clancy. 1991. "Conditionally and Deontic Modality in
Japanese and Korean". Paper presented at The Second Southern California Conference
on Japanese/Korean Linguistics. Santa Barbara, California.
Aksu-Koç, Ayhan. 1988. The Acquisition of Aspect and Modality: The Case of Past
Reference in Turkish. Cambridge, England: Cambridge University Press.
Aksu-Koç, Ayhan & Dan Slobin. 1986. "A Psychological Account of the Development
and Use of Evidential in Turkish". Chafe & Nichols. 1986. 159-167.
Bloom, Lois, L. Rocissano & L. Hood. 1976. "Adult-Child Discourse: Developmental
Interaction between Information Processing and Linguistic Knowledge". Cognitive
Psychology 8.521-552
Brown, Roger. 1973. A First Language. Cambridge, Mass.: Harvard University Press.
Bybee, Joan. 1985. Morphology: A Study of the Relation between Meaning and Form.
Amsterdam: John Benjamins.
Bybee, Joan. & William Pagliuca. 1985. "Cross-Linguistic Comparison and the Develop
ment of Grammatical Meaning". Historical Semantics and Historical Word Formation
ed. by J. Fisiak, 59-83. The Hague: Mouton.
Byrnes, James P. & Michelle A. Duff. 1989. "Young Children's Comprehension of Modal
Expressions" Cognitive Development 4.369-387.
Chafe, Wallace & Johanna Nichols "eds." 1986. Evidentiality: The Linguistic Coding of
Epistemology. New Jersey: Ablex.
Choi, Soonja. 1991. "Early Acquisition of Epistemic Meanings in Korean: A Study of
Sentence-Ending Suffixes in the Spontaneous Speech of Three Children". First Lan
guage 11.93-119■.
DeLancey, Scott. 1986. "Evidentiality and Volitionality in Tibetan". Chafe & Nichols.
1986. 203-213.
Dore, John. 1974. "A Pragmatic Description of Early Language Development". Journal of
Psycholinguistic Research 3.343-350.
Gopnik, Alison & Soonja Choi. 1990. "Do Linguistic Differences Lead to Cognitive
Differences? A Cross-Linguistic Study of Semantic and Cognitive Development".
First Language 10.199-215.
Gopnik, Alison & Andrew Meltzoff. 1984. "Semantic and Cognitive Development in 15
to 21 Months-Old Children". Journal of Child Language 11.594-513.
Hirst, William & Joyce Weil. 1982. "Acquisition of Epistemic and Deontic Meaning of
Modals". Journal of Child Language 9.659-666.
Kim, Young-joo. 1992. "The Acquisition of Korean". Unpublished manuscript.
Lee, Hyo Sang. 1985. "Consciously Known but Unassimilated Information: A Pragmatic
Analysis of the Epistemic Modal Suffix "-kun" in Korean". Paper presented at The
First Pacific Linguistics Conference, University of Oregon.
Lee, Hyo Sang. 1991. "Tense, Aspect, and Modality: a Discourse-Pragmatic Analysis of
Verbal Affixes in Korean from a Typological Perspective." Doctoral dissertation,
University of California at Los Angeles.
204 Soonja Choi
Lee, Hyo Sang. 1993. "Cognitive Constraints on Expressing Newly Perceived Informa
tion: With Reference to Epistemic Modal Suffixes in Korean. Cognitive Linguistics
4.135-167.
Lee, Keedong. 1986. "Pragmatic Function of Sentence Enders". Inmunkwahak (Humani
ties) 56.41-59.
Moore, Chris, Kiran Pure & David Furrow. 1990. "Children's Understanding of the
Modal Expression of Speaker Certainty and Uncertainty and Its Relation to the
Development of a Representational Theory of Mind". Child Development 61.722-730.
Nichols, Johanna. 1986. "The Bottom Line: Chinese-Pidgin Russian". Chafe & Nichols
1986.239-257.
Palmer, F. R. 1986. Mood and Modality. Cambridge, England: Cambridge University
Press.
Pellegrini, Anthony, Gene Brody & Zolinda Stoneman. 1987. "Children's Conversational
Competence with Their Parents". Discourse Processes 10.93-106.
Piaget, Jean. 1955. The Language and Thought of the Child. New York: Meridian Books.
Shatz, Marilyn. 1983. "Communication." Handbook of Child Psychology, Vol 3, ed. by
John Flavell & Ellen Markman, 841-890.
Shatz, Marilyn. 1984. "Answering Appropriately: A Developmental Perspective on Con
versational Knowledge". Discourse Development ed. by Stan Kuczaj. New York:
Springer-Verlag.
Slobin, Dan I. 1973. "Cognitive Prerequisites for the Acquisition of Grammar". Studies of
Child Language Development ed. by Charles A. Ferguson & Dan I. Slobin, 175-208.
New York: Holt, Rinehart & Winston.
Slobin, Dan I. 1985. "Crosslinguistic Evidence for the Language Making Capacity". The
Crosslinguistic Study of Language Acquisition, Volume 2: Theoretical Issues, ed. by
Dan I. Slobin, 1157-1256. Hillsdale, NH: Lawrence Erlbaum.
Steele, Susan. 1975. "Is it Possible?" Working Papers on Language Universals. Stanford
University. 18:35-58.
Stephany, Ursula. 1986. "Modality". Language Acquisition (2nd Edition), ed. by Paul
Fletcher & Michael Garman. Cambridge, Mass.: Cambridge University Press.
Tomasello, Michael. 1992. First Verbs. Cambridge, England: Cambridge University
Press.
Traugott, Elizabeth. 1989. "On the Rise of Epistemic Meanings in English: An Example
of Subjectification in Semantic Change". Language 65.31-55.
Wells, Gordon. 1979. "Learning and Using the Auxiliary Verb in English". Language
Development, ed. by Victor Lee. New York: John, Wiley and Sons.
Wells, Gordon. 1985. Language Development in the Preschool Years. Cambridge, Eng
land: Cambridge University Press.
Woodbury, Anthony. 1986. "Interactions of Tense and Evidentiality: A Study of Sherpa
and English". Chafe & Nichols. 1986. 188-202.
Zoh, Myeong-Han. 1981. Hankwuk Atong-uy Ene Whoyktuk Yenkwu [A Study of Lan
guage Acquisision in Korean Children] Seoul: Seoul National University Press.
The Interactional Basis of the Mandarin
Modal néng 'CAN'*
Jiansheng Guo
University of California, Berkeley
1. Introduction
However, these researchers have not gone far enough in specifying the
nature of the force dynamics of modals. They elaborate on the the forces that
are exerted, e.g. physical, social, or rational enablement. But they do not
clarify what kinds of counter forces are exerted. Sweetser (1982) says nothing
about the nature of the opposing force, while Talmy (1988) says only that the
opposing force is backgrounded. Yet, the very essence of force dynamics is
the tension between the two forces. I will argue that the opposing force is as
important as the initial force. However, the opposing force is more likely to be
found in discourse interactions than in semantic content. The tendency to
consider only the semantic content of the modals may be one reason why the
opposing force is hard to specify or regarded merely as backgrounded. I
suggest that only by considering discourse function as well as semantic
content can we fully understand the force dynamics of modals. Using the
example of children's use of Mandarin néng, I will argue in this paper that the
force dynamics of modals is motivated primarily by interpersonal tension in
interactional contexts.
The paper is organized in three parts. First, it will give a distributional
description of the meanings of néng across three different age groups, 3, 5,
and 7 year olds. The order of development of the different meanings of néng
will be shown to be:
2. Data
Age 3 5 7
Frequenty 92 74 90
3 72% 19% 9% 92
5 45% 12% 43% 74
7 52% 11% 26% 12% 90
the social meaning in the 5 year olds' uses shows a much higher frequency,
making up 43% of total uses. This indicates that by age 5, néng has acquired a
new meaning of social permission. The 7 year olds' uses of néng are compara
ble to those of the 5 year olds in the physical and ambiguous categories, but
social meaning is used significantly less than among 5 year olds. But the 7
year olds seem to be developing a new meaning, the epistemic-like meaning.
In the next section, I will discuss in detail the discourse functions and
semantic content of each category, and argue for a functionalist explanation of
the semantic changes of the modal in child development. I will focus on how
néng is used in interaction, and on the discourse functions this morpheme is
intended to serve. For this purpose, the néng utterances within each meaning
category will be further categorized according to discourse functions.
speaker interpreted it that way. More often, however, the challenged assump
tion is not explicitly verbalized. But the speaker believes that the addressee
holds the assumption. When children say that they can do X with the focus on
ability, they believe that their ability is at issue. In general, these beliefs do fit
with adult expectations of children's abilities. Most cultures regard children
as weak, not fully competent, fragile, and in need of adult protection; and this
view of children inheres in the belief structure underlying the néng utterances
produced by the Mandarin-speaking children in this study. 10 Néng utterances
provide a strategy for signaling contradiction of the assumed adult beliefs,
thereby constituting an explicit situation of force dynamics. The presupposed
addressee's assumption constitutes the force that will presumably continue if
it encounters no obstacle; the néng utterance constitutes the counter-force that
challenges and seeks to block that assumption. I gloss this discourse function
as 'Do not assume X', thereby emphasizing that it is highly dynamic and
transitive at the interpersonal level.
While I am convinced that the discourse function of néng is to challenge
the addressee's assumptions, I am not claiming that this is also its semantic
meaning (in the sense discussed earlier). In (6), a clear sense of physical
ability is referred to by néng, which serves as the focus of the interpersonal
force dynamics. Following Lyons' definition of semantic meaning (his de
scriptive meaning) as referring to an event or state of affairs in the external
world, I take the semantic meaning of néng in the category of boasting to be
physical ability.
Although the discourse function and the semantic meaning of néng are
separate components, the two are closely interrelated. The obstacle, in the
semantic meaning, does not merely exist in the objective world, but has a
deeper social and psychological origin in the presupposed addressee's expec
tations, which the child speaker seeks to challenge. It is here that the abilita-
tive meaning and the discourse function of challenge find their connection.
In summary, the semantic meaning of néng in boasting utterances is that
the doer has the ability to overcome an obstacle. In contrast, its discourse
function is to counteract the addressee's assumption concerning this ability.
At the semantic level, force dynamics operates between ability and a diffi
culty in the physical domain. At the pragmatic level, it operates between the
addressee's expectation and the speaker's challenge to that expectation in the
interpersonal domain. The meanings at the two levels are intrinsically con
nected. The physical obstacle has social and psychological origins in the
The Interactional Basis of the Mandarin Modal néng 'CAN' 215
3.1.3. Problem reporting. The third subcategory under challenge to the ad
dressee's assumptions is problem reporting. Here, the child reports to the
adult addressee on a problem in the current activity. This may be a negative
statement, as in (8):
(8) C1: (tries to make a lego figure stand; it falls; looks at Guo and
shows lego figure)
zhè yàngzi bù néng zhàn zhe
this manner not can stand PROG
'(It) can't stand like this.'
or a question about the characteristics of the object, as in (9):
(9) C2: (doesn't know how to open cotton bundle, holds it up to Guo)
zhè néng dă kāi ma ?
this can make open Q
'Can this (be) opened?'
These utterances are not made merely to report a problem. Their goal is to
solicit the addressee's help. And help will be forthcoming only after a change
216 Jiansheng Guo
in the addressee's knowledge about the toys. Thus, utterances in this subtype
have a double discourse function: (1) to change the addressee's assumptions
about the object, and (2) to get help.
At least two pieces of evidence point to this double function. First, the
children produce these utterances in a matter-of-fact manner while showing
the objects to the addressee, as if to say 'Look, this toy is not working as you
expect. So help me get it to work.' Then, they simply wait for help. Second,
the sorts of problems reported in these utterances always relate to a normally
expected function of the toys in question. Since the toys are provided by the
experimenter, the children naturally hold the latter responsible for the failure
of these normal functions to occur. For the same reason, they assume that the
experimenter expects the toys to function normally. Thus, a challenge to that
assumption is a prerequisite for soliciting help.
I emphasize that these uses are not plain statements or questions; children
use a different construction when they encounter difficulties with their own
possessions. E.g., if they have difficulty untying their shoe laces, they will
produce utterances like (10):
(10) wo jië bù käi zhèi xiédài.
I untie not open this shoe-lace
'I can't untie this shoe lace.'
In (10), the subject is the agent (the speaker), and a Verb+Resultative
-complement construction is used instead of néng. This construction attributes
the failure entirely to the inability of the agent.
Uses of this type always occur where the adult forbids one child to do
something, and then another child makes a néng utterance to justify the adult's
prohibition. In this category, the néng utterance is always negative and always
refers to an undesirable state (e.g. loss of an existing ability) that the prohib
ited action might engender. The semantic meaning of the utterance is 'inabil
ity of object Z to perform function X'. The interpersonal discourse function of
the utterance is again two-layered. On the one hand, it challenges the address
ee's assumption that the action has no negative effect on the object. On the
other, it functions as part of a prohibition, a challenge to the addressee's
action.
Table 4. Breakdown of discourse functions of the physical meaning category of néng by age
Challenge to Justification for Representational
Age Assumptions Prohibitions Uses N
3 52% 8% 10% 92
5 42% 3% - 74
7 38% - 14% 90
Note: N is the total occurrences of néng, including social and epistemic meanings.
Therefore, the percentages in each age group do not add up to 100%.
the force which the néng utterance is intended to counter at the interpersonal
level. In other words, néng is used to overcome an assumed doubt (on the part
of the addressee) about physical abilities and characteristics.
Essential to an understanding of the modal néng is an understanding of
both its referential meaning and its interpersonal functions. In these néng
utterances, the addressee, although unmentioned, is always implicitly present
in the discourse interaction. To emphasize this presence at the discourse
(interpersonal) level, I gloss the discourse function of this category as 'Don't
assume X', which emphasizes that néng is highly social and transitive.
Maintaining the traditional distinction between semantic and pragmatic
meanings, we can say that in its physical sense, néng refers semantically to
physical ability, but pragmatically it indexes interpersonal challenge in an
active and intentional manner.
Unlike the physical meaning uses, the social meaning uses no longer convey
'social meaning' pragmatically, but express it directly. The physical ability
meaning is no longer relevant. When a speaker tells an addressee that the
latter cannot do X, it is obvious from the situation that the addressee has the
ability to do X, and that this ability is either actualized or about to be
actualized. The only meaning néng has here is social permission. Social
meaning uses of néng are rare among 3 year olds, but very frequent among 5
year olds. The uses may be divided into four groups: (1) prohibitions; (2)
permissions; (3) reportive uses; and (4) self regulations.
The Interactional Basis of the Mandarin Modal néng 'CAN' 219
4.1. Prohibitions
Prohibition is the major social meaning use, especially among the 5 year olds.
A typical example is shown in (13):
(13) C1: m bu néng dă rén.
you not can beat people
'You can't beat people.'
ba rén tóu da huài le,bătāde tóu kăn xiàlái, gëi rénjiā, duî ma?
'(if) her head is hurt, (then) his head will be chopped off and
given to her. Right?'
The form of the utterance differs here from that of physical meanings. In
social meanings, the subject is the second person and the sentence is negative.
Utterances in this category are of a specific form: 'You cannot VP.' In
contrast to the physical uses, for social uses, semantic meaning and discourse
function are identical: preventing an action from taking place, which is
glossed as 'Don't do X.' This identity between semantic meaning and dis
course function may explain why uses of this category are referred to as
discourse-oriented uses (Palmer, 1990). Since the 3 year olds seldom use néng
with this social meaning while the 5 year olds do so quite frequently, we
should regard this use category as a new development in the meaning of néng
(see Table 2 for a breakdown).
4.2. Permissions
Seeking permissions is a marginal use in the current corpus; only a few néng
utterances from the 7 year olds can be thus categorized.11 An example of this
use is shown in (14):
(14) C2: (wants to get water from restroom for toy tea set. asks Guo)
néng jië diâr shuǐ qù ma ?
can get little water go Q
'Can (I) go get some water?'
GU: mm, bù néng jiē diănr shut däi huǐr nöng shï le , jiù gāi.
'mm, (you) cannot go get water. You'll get (the floor) wet.'
220 Jiansheng Guo
This category, too, is only used by the 7 year olds. Structurally, it is similar to
the prohibition category, but differs in that the (explicit or implicit) subject of
the néng sentence is the first-person speaker. The prohibition is self-directed.
An example of this category is given in (17):
(17) C3: (gives C2 a toy, then changes her mind, shyly withdraws toy
from C2)
bù, bù néng gel nǐ zhèige.
no, not can give you this.
'No, (I) can't/shouldn't give you this.'
In this example, the child speaker has internalized the force dynamics
(i.e. the two opposing forces come from the same person), which elsewhere is
socially distributed. In other words, the social opposition that elsewhere exists
between two people has here become a psychological opposition in which the
self is split, representing two opposing forces simultaneously.
In this category, néng is used to ask for permissions to act. The sentence
structure is again in the OBJ+néng+V+Q form. The focus is on the object,
while the agent is backgrounded. Although from the context it appears that
néng is basically concerned with social permission, the structure of the
sentence closely associates the predicated meaning with the object, as if
The Interactional Basis of the Mandarin Modal néng 'CAN' 223
permission were part of the intrinsic characteristics of the object, similar to its
physical characteristics. An example of this category is shown in (19):
(19) C1: (helps Guo put lego into the box during clean-up, picks up a
lego car, asks)
zhè néng fàng ma ?
this can put Q
'Can this (be) put in?'
GU: á? néng.
'What? Yes, it can.'
This category is essentially the same as the permissions category, with a
clear social meaning. The difference is that here néng is not completely
dissociated from its source in the physical world.
This category has the same discourse function as the prohibition category
discussed in 4.1., but the referential meaning of néng is ambiguous. A typical
example of this category is given in (21):
224 Jiansheng Guo
ent on the rhetorical question form. In the context given in (22), C2 goes on to
answer C3's challenge by asserting néng ya 'It can (be the case), of course', as
shown in (24):
(24) C3: (turns around to look for it, to C2)
bù néng ba ?
not can Q (=tentative)
'(That) can't be, (can it)?'
C2: (matter of fact manner)
néng ya !
can PRTCL (=of course)
'Of course (it) can.'
Thus, although the epistemic meaning is elusive, it does not necessarily
depend on a particular sentence form. What it depends on at this stage of
development is considerable contextual framing, i.e. challenges and counter
challenges in argumentation.
To conclude the discussion of the data, all the meanings and functions of
the Mandarin modal néng are shown in Figure 1, And Table 5 gives a break
down of the frequency of each subcategory across all three age groups.
3 52% 8% 10% 3% 5% 3% 7% 6% - 4% - - 92
5 42% 3% - 3% - 3% 7% 39% - 4% - - 74
7 38% - 14% 1% - 2% 8% 13% 2% 2% 7% 12% 90
Boasting
3.1 Challenge to addressee's assumption New discoveries
Physical meaning 3.2 Justification for prohibition Problem reporting
3.3 Reportive uses
7. Discussion
7.1. The first aim of this paper is to argue that the Mandarin modal néng not
only has a set of referential meanings — ability, permission, and possibility —,
but it also serves important interpersonal discourse functions. In the physical
abilitative meaning, néng challenges what the speaker believes to be the
addressee's assumptions about the speaker's abilities. In the social permission
meaning, néng challenges the addressee's actions. And in the epistemic-like
meaning, néng challenges the addressee's immediately preceding assertion.
This interpretation of a common interpersonal function for the modals (i.e.
challenge) differs from Perkins' formulation of the core semantic meaning of
the English modals in terms of possibility and necessity as applied to the
physical, social, and rational domains (Perkins 1983). Perkins focuses on the
common features that can be extracted from the static semantic meaning of the
English modals. But what I am arguing here is that a common denominator for
the different semantic meanings of néng, — i.e. dynamic, deontic, and epis-
temic — arises from the interpersonal discourse function néng serves, namely
that of challenge. Each of néng's semantic meanings serves primarily to issue
a challenge rather than to make a statement about abilities, permissions, and
possibilities. Discourse analysis of néng prompts us to pose the broader
228 Jiansheng Guo
question of what modals are and what they do in language. Why is there a
grammatical category 'modal auxiliaries' that expresses possibility and ne
cessity?
In order to answer this question, let us look at two of the many different
ways of representing modal meanings. In English, physical ability can be
expressed either by the modal auxiliary can or by the adjective able, as in be
able to. Similarly, social permission can be expressed by can or be permitted
to. In each case the two options are referentially equivalent. But they differ in
grammatical status. Modal auxiliaries belong to a closed grammatical class,
while adjectives and verbs belong to open lexical classes. Along the con
tinuum between lexical words and grammatical morphemes, modal auxilia
ries are more grammaticalized than adjectives and verbs. This grammatical
difference has significant consequences with regard to the meanings ex
pressed. With lexical forms such as able or permitted, the speaker presents a
fact without any personal involvement. We interpret the utterance as 'I'm
stating X to you'. But when modal auxiliaries are used, the resulting utter
ances are colored by speaker involvement in the form of opinion, affect, or
personal dynamics. We interpret such utterances as 'I'm challenging/object
ing to/arguing with you by stating X to you.' This division of labor fits
Traugott's (1982) formulation of trends in grammaticalization. When a lexi
cal form is grammaticalized, its meaning may also evolve from propositional
to expressive/interpersonal. Similarly, when discussing the Tense, Aspect,
and Mood system, Givón (1984:269) points out that "as discourse-pragmatic
features, they play a crucial role ... in indicating their time/truth/certainty/
probability modalities vis-a-vis the speaker-hearercontract"(emphasis mine).
Child language research has shown that modals are employed by children to
convey their different social stances in terms of interpersonal commitment in
play with other children (Gerhardt 1985,1990 for English; Shepherd, 1981 for
Antiguan Creole). All this points to a special function for modals in express
ing interpersonal meanings.
The claim that modals indicate the speaker's subjective evaluation of the
modalized proposition is hardly novel (e.g. Lyons 1977; Halliday 1973).
However, previous approaches have failed to recognize the crucial interper
sonal function of speaker involvement. Concern has traditionally been with
the relationship between speaker and proposition, rather than with that be
tween speaker and addressee, with the proposition serving as a means of
actualizing the latter relationship. Lyons (1977), for example, glosses the
The Interactional Basis of the Mandarin Modal néng 'CAN' 229
deontic modality meaning as 'let it be so', and the dynamic modality meaning
as 'it is so'. Although the constitutive power of language that he attributes to
deontic modality is social in nature, he fails to recognize that his gloss
represents only part of the social interaction. Speakers are not simply playing
with the creative power of language. Rather, they are participating in a social
exchange, and the creative power of language is used to regulate the interper
sonal relationship between the speaker and the addressee. In this sense,
Palmer's (1990) characterization of deontic modality as 'discourse oriented'
comes closest to the position argued in this paper. But his analysis still focuses
on a single participant's involvement in the discourse. What's more, his
analysis is limited to the deontic modals, which are generally regarded as
having performative force. This paper argues, in contrast, that 'discourse
orientation' is the fundamental property for modals in all the three domains —
dynamic, deontic, and epistemic. Language has developed the grammatical
category of modal auxiliaries to serve the function of regulating interpersonal
relations in social interaction.
7.2. The second goal of this paper is to argue not only that our understanding
of the discourse functions of modals is indispensable to our understanding of
their meanings, but also that their discourse functions form an important
source of semantic change. Semantic bleaching and abstraction, strengthening
of inferential meanings, and metaphorical transfer have been recognized as
the basic processes of semantic change involved in grammaticalization (e.g.
Bybee 1988; Traugott 1988; Sweetser 1988). However, given their methodo
logical limitations, historical studies have seldom investigated the role played
by interpersonal discourse functions in semantic change. Studies of develop
mental change in child language can give us insightful revelations in this
respect. In particular, the development of the social permission meaning in the
5 year olds' uses of néng, and of the epistemic-like meaning in the 7 year olds'
uses of this modal, reflects a process of semanticization of the interpersonal-
discourse function that is associated with earlier uses of néng. The discourse
function of néng gradually saturates its semantic content, resulting in a new
semantic meaning.
The change from the 3 year olds' abilitative meaning to the 5 year olds'
social permission meaning is revealing. For the 3 year olds, the physical
-meaning uses of néng all share a discourse function, roughly glossed as
'Don't assume X' (see Section 3). 13 In addition, the justification for prohibi-
230 Jiansheng Guo
I can X
Physical Meaning
It can X (you) Don't assume X
(Typical of 3 year olds' uses)
It cannot X
Process of semanticization
of discourse function
tions category has the dual function of 'Don't assume X' and 'Don't do X'.
For the 5 year olds, the newly developed prohibition meaning has 'Don't do X'
as its semantic content and discourse function. What is the connection? I
suggest that the interpersonal function of challenge has been semanticized,
i.e., what began as a contextual meaning frequently associated with the form
has become conventionalized and incorporated into the semantic content of
that form.14 This process of change is illustrated in Figure 2.
By this process, néng's discourse function of challenge, pragmatically
interpreted in the 3 year olds' uses, becomes its semantic content in the 5 year
olds' uses. In this semantic change, a challenge to the addressee's assumption
becomes a challenge to the addressee's action. Two crucial components of the
challenge function are involved in the semanticization process. 'Don't' as in
'Don't assume X' contains both a social regulative meaning and a negative
meaning. In the 3 year olds' physical-meaning uses of néng, the social
regulative meaning is absent from the semantic content. Through semantici
zation, it ultimately becomes part of the semantic content of néng, and with it,
the negative meaning comes to the surface.
Admittedly, longitudinal data are needed to reveal the step-by-step proc
ess of semanticization of a discourse-interactional function. But for the time
being, this analysis seems more satisfactory than other possible explanations
for this semantic change in children's use of néng.
7.3. There are at least two alternative explanations for this semantic change.
The first is imitation of adult uses. But imitation cannot explain why children
should wait until age 5 before starting to learn the adult social-permission
The Interactional Basis of the Mandarin Modal néng 'CAN' 231
meaning. The 3 year olds, it seems, hear as many social-permission uses from
adults as do the 5 year olds. 15
7.3.2. The second reason why the simple imitation explanation is unsatisfac
tory comes from the 5 year olds' use of prohibitions. A careful look at their
sentence forms in the social-meaning category reveals that these differ from
adult usage. Adults' prohibitive uses of néng appear consistently in sentences
with no subject, as shown in (25):
(25) GU: bù néng qiäng.
not can grab
'(you) can't grab (it).'
232 Jiansheng Guo
Physical Domain
Physical Intended/
Physical
Enablement: Anticipated
Resistance
Ability Resulting State
Social Domain
Social Intended/
Social
Enablement: Anticipated
Resistance
Permission Resulting State
But as illustrated earlier, many of the 5 year olds' social meaning uses of
néng include a second-person subject, as shown in (26):
(26) Cl: nǐ bù néng dă rén.
you not can hit people
'You can't hit people.'
Simple learning from adult uses cannot account for the addition of an
explicit second-person subject by the 5 year olds.
7.4. The second alternative explanation is that the semantic change results
from a transfer of the basic semantic structure from the physical to the social
domain. According to the model put forth by Sweetser (1982) to explain the
historical change in the English modals, this potential alternative account
would look like Figure 3.
Néng, in its physical abilitative meaning, refers to a physical force which
enables the actor to overcome physical obstacles. In its social meaning, néng
refers to the social force which enables the actor to overcome social obstacles.
In the change from the 3 year olds' abilitative uses of néng to the 5 year olds'
social-meaning uses, the semantic structure of the earlier physical meaning is
The Interactional Basis of the Mandarin Modal néng 'CAN' 233
NOTES
* This paper was inspired by joint research on children's use of the English modal can
carried out by Julie Gerhardt and the author. Many of the ideas originated in discussions
related to that project. Thanks are due to Julie Gerhardt for detailed comments on the
earlier versions of the paper, to Mary Erbaugh for comments on theoretical issues and
insights into the Chinese language and culture, and to Suzanne Fleischman for her
thorough and painstaking editing which has clarified and sharpened many crucial ideas
and made this paper readable. I would also like to thank Dan Slobin, Eve Sweetser, and
Leonard Talmy for reading an earlier version of the paper and helping me to clarify and
sharpen the concepts and arguments. Susan Ervin-Tripp, Amy Kyratzis, and several of
my fellow graduate students also gave helpful feedback, for which I am grateful. Of
course I assume full responsibility for errors of fact or interpretation.
The data used in this study were originally collected for a research project on
children's use of modals. International travel for data collection was generously sup
ported by the Institute of East Asian Studies, UC Berkeley. Purchase of tapes was funded
by a graduate student research grant by the Institute of Human Development, UC
Berkeley. The research was supported by the Chancellor's Dissertation Year Fellow
ship, UC Berkeley, Ma Hsiang Fang Research Fellowship, and NSF grant No. NSF-BNS-
8919569 to Susan Ervin-Tripp for research on pragmatic factors in syntax development
at the Institute of Cognitive Studies, UC Berkeley, which also provided computer,
copying, and videotape laboratory facilities.
1. The ideas of interpersonal challenge and contested ability were first proposed by
Gerhardt (personal communication) in her analysis of children's use of English can. Her
proposal was that the child-speaker contests what he or she assumes to be the address
ee's belief that the child is unable to carry out the action predicated in the utterance. Thus
can is used to challenge and fix a state of doubt about the ability in question. For
example, when a child says I can put the poncho on, there is an expectation that putting
on the poncho is a difficult task, and thus it is questionable that the child has the ability to
do it. Evidence supporting this claim is also found in the use of the Spanish modal poder.
Silva-Corvalán (this volume) reports that while both saber and poder may refer to
ability, saber simply makes a neutral statement, while poder implies the existence of
difficulty or an obstacle in accomplishing the predicated action. Thus the use of saber in
contexts like I can swim the butterfly stroke for hours is ungrammatical.
236 Jiansheng Guo
2. However, Robin Lakoff (1972) offers extensive and interesting discussions on the
pragmatics of modals. Unfortunately, Lakoff draws a sharp distinction between the
pragmatics and the semantics of modal meanings, and fails to recognize their interplay.
Bybee (this volume) illustrates the role contextual inferences play in diachronic changes
of modal meanings. She proposes that the 'present' reading of the past forms of the
English modals has resulted from the conventionalization of pragmatic inferences.
3. See Sweetser (1990:56-57) for a brief survey of traditional treatments of modality.
4. For more discussion on the grammatical and semantic status of néng, see Li & Thompson
(1981) and Tsang (1981).
5. However, Sweetser (1982) notes that epistemic uses of can in English are also limited,
with the affirmative form being rarely used. See also Palmer (1990) and Coates (1983).
6. For discussion on root possibility, see Coates (1983:93-99). I use the term epistemic as
defined by Sweetser (1990), i.e. as reflecting our understanding of the world of reason
ing in terms of premises and conclusions.
7. However, there is a dialectal difference between Beijing Mandarin and Taiwan Manda
rin, in which kěyǐ can be used in the negative form.
8. The V-not-V question in Mandarin is similar to the English yes-no question. For more
discussion, see Li & Thompson (1981:535-45).
9. PRTCL = Sentence-final Particles; PERF= Perfect Marker; PROG = Progressive Marker; REL
= Relative Clause Marker; Q = Question Marker.
10. This idea of children's conception of adults' expectations of their abilities as manifested
in child-adult discourse comes from Julie Gerhardt's analysis of children's use of
English can (personal communication).
11. The marginality of the permission use of néng in the data is unrelated to the existence of
a competing form kěyǐ (see Section 1), also a low-frequency form in the corpus.
12. This meaning category represents a relatively small proportion of the uses of néng for all
age groups (see Table 2).
13. With the exception of the reportive uses, which make up only a marginal proportion.
14. Hopper (1982:16) uses the term semanticizing in discussing Russian aspectual func
tions; Traugott and König (1991) use the term strengthening of informativeness in a
similar sense.
15. No studies have been done on the amount of input of modal auxiliaries children get in
Mandarin. My estimate that 3 year olds and 5 year olds get essentially the same input is
based on the frequency of modal usage by the adult experimenter, i.e. the author, in the
current data corpus. Although this tells nothing about what the children hear at home or
in school, it does suggest that adult use of modals in the different meanings may not vary
according to the age of the addressee.
The Interactional Basis of the Mandarin Modal néng 'CAN' 237
REFERENCES
Bybee, Joan. 1988. "Semantic Substance vs. Contrast in the Development of Grammatical
Meaning". Proceedings of the Berkeley Linguistics Society Meeting 14: 247-64.
Bybee, Joan. This volume. "The Semantic Development of Past Tense Modals in English."
Coates, Jennifer. 1983. The Semantics of the Modal Auxiliaries. London: Croom Helm.
Gerhardt, Julie. 1985. "On the Use of Will and Gonna: Toward a Description of Activity-
types for Child Language". Discourse Processes 8: 143-75.
Gerhardt, Julie. 1990. "The Relation of Language to Context in Children's Speech: The
Role of hafta Statements in Structuring 3-year-old's Discourse". IPrA Papers in
Pragmatics 4:1/2: 1-57.
Givón, T. 1984. Syntax: A Functional Typological Introduction, Vol 1. Amsterdam: John
Benjamins.
Halliday, M.A.K. 1973. Explorations in the Functions of Language. London: Edward
Arnold.
Hopper, Paul. 1982. "Introduction". Tense-Aspect: Between Semantics and Pragmatics.
Amsterdam: John Benjamins.
Lakoff, Robin. 1972. "The Pragmatics of Modality". Papers from the Eighth Regional
Meeting of the Chicago Linguistics Society 8: 229-46.
Li, Charles & Sandra Thompson. 1981. Mandarin Chinese: A Functional Reference
Grammar. Berkeley, CA: University of California Press.
Lyons, John. 1977. Semantics, 2 vol. Cambridge: Cambridge University Press.
Lü, Shuxiang. 1980. Xiàndài hànyǔ bäbäi cí (Eight Hundred Words in Modern Manda
rin). Beijing: Shangwu yinshuguan (The Commercial Press).
Ogden, Charles & Ivor Richards. 1923. The Meaning of Meaning: A Study of the Influence
of Language upon Thought and of the Science of Symbolism. London: Routledge &
Kegan Paul.
Palmer, Frank. 1990. Modality and the English Modals, 2nd ed. London: Longman.
Perkins, Michael. 1983. Modal Expressions in English. London: Frances Pinter.
Shepherd, Susan. 1981. "Modals in Antiguan Creole, Child Language Acquisition, and
History." Unpublished doctoral dissertation, Stanford University.
Silva-Corvalán, Carmen. This volume. "Contextual Conditions for the Interpretation of
'poder' and 'deber' in Spanish".
Sweetser, Eve. 1982. "Root and Epistemic Modals: Causality in Two Worlds". Proceedings
of the Berkeley Linguistics Society Meeting 8: 484-507.
Sweetser, Eve. 1988. "Grammaticalization and Semantic Bleaching". Proceedings of the
Berkeley Linguistics Society Meeting 14: 389-405.
Sweetser, Eve. 1990. From Etymology to Pragmatics: Metaphorical and Cultural Aspects
of Semantic Structure. Cambridge: Cambridge University Press.
Talmy, Leonard. 1988. "Force Dynamics in Language and Cognition". Cognitive Science
12: 49-100.
Traugott, Elizabeth. 1982. "From Propositional to Textual and Expressive Meanings:
Some Semantic-Pragmatic Aspects of Grammaticalization". Perspectives on Historical
Linguistics, ed. by Winfred Lehmann & Yakov Malkiel. Amsterdam: John Benjamins.
238 Jiansheng Guo
John Myhill
Laura A. Smith
University of Michigan
0. Introduction.
We do not as yet know why some obligation markers develop in one direction
and others develop in another. This may be simply arbitrary, but a more
interesting possibility is that what we now call obligation subsumes a number
of subfunctions, each associated with a distinct historical development. At
present, however, we have no way of telling whether this is the case, because
existing descriptions of the functions of obligation markers are insufficiently
detailed.
The main purpose of the present paper is to begin to develop a framework
for describing the discourse and interactive functions of obligation marking.
In the short run, this will serve a useful descriptive purpose by providing a
categorization of obligation constructions in different languages. In the long
run, this categorization system will allow for more powerful cross-linguistic
generalizations about universal form-function relationships.
We will consider and contrast the expression of obligation in four lan
guages: English, Chinese, Hopi, and Biblical Hebrew. These were chosen
because they are genetically unrelated, are available in text form, and are
familiar to us. The main questions we want to address are: In what contexts do
speakers of a given language choose to use an obligation marker? How do
these contexts differ across languages? In a language with more than one
obligation morpheme at the speaker's disposal, what factors motivate the
choice of one or another of these morphemes? In English, for example, when
do we prefer have to to gotta, or vice versa? Do speakers of Chinese use the
same criteria to distinguish between, say, dei and yao? In addressing these
questions, we will be working toward a typology of obligation contexts across
languages.
Interestingly, the languages we will investigate differ not only in how
they use obligation markers but also in how frequently they use them: English
and Chinese have a number of commonly-occurring obligation markers,
while Hopi has only one productive (and not particularly common) lexical
marker of obligation. Biblical Hebrew, on the other hand, has none at all (we
will suggest that word order serves a related function in this language). Both
Hopi and Biblical Hebrew have a large number of clauses in which, from the
English point of view, there is an 'implied' obligation, without there being an
explicit marker.
Our approach in this study will be to analyze and categorize naturally
occurring tokens associated with obligation. We will not give a dictionary-
style 'definition' of obligation, because we do not believe that such a defini-
Functions of Obligation Expressions 241
1. Preliminary considerations.
A speaker can say You have to leave now, realizing that the hearer may in
fact not leave. Such is the case with the great majority of naturally-occurring
tokens of 'strong' obligation markers such as have to. If we change the
criterion to mean that the speaker is acting as though the action will definitely
take place, then we have to find independent evidence that the speaker is
acting this way. In the great majority of naturally-occurring cases, there is no
evidence other than the presence of a morpheme which has been designated as
marking 'strong' obligation; the argument for calling these 'strong' obligation
markers is therefore circular.
Coates 1983 makes a distinction between obligation which can be para
phrased as 'it is obligatory/absolutely essential that...' and obligation para-
phrasable as 'it is important that...', the first type being 'stronger'. However,
as Coates notes (pg. 34), the parameters for distinguishing between these are
'indicative rather than definitive,' and 'between these two extremes there is
considerable fuzziness.' For example, she suggests that (1) is paraphrasable
with 'obligatory' and (2) is paraphrasable with 'important,' but it seems to us
that the reverse is just as plausible:
(1) They were told by the Chairman, Mr. Jos. D. Miller, "You must
have respect for other people's property. "
(2) If you commit murder, Charlotte, you must be punished.
In attempting to apply Palmer's and Coates' tests for 'strong' and 'weak'
obligation to naturally occurring data, we found too many cases for which
there was no evidence of the strength of the obligation other than use of a
'strong' vs. 'weak' obligation marker. We therefore have not attempted to
apply independent tests for the 'strength' of the obligation.
Another parameter used to distinguish between different types of obliga
tion function is referred to variously as 'discourse orientation' (Palmer 1974),
'subjectivity' (Lyons 1977), and 'speaker involvement' (Coates 1983).
Coates (p. 33) considers the important parameter distinguishing must and
have to to be whether the "speaker is interested in getting (the) subject to
perform the action." This distinction is consistent with our own intuitions
about some of the English and Chinese data, but we were hard-pressed to code
for it objectively. Often, we simply could not determine the degree to which
the speaker was interested in having the action performed. Indeed, it is not
clear why a speaker would mention an obligation to perform an action unless
s/he had some interest in having that action carried out.
Functions of Obligation Expressions 243
1.2. Grammaticalization.
another list of tokens of English have to, divided up into its different func
tions, we have no way of telling whether the differences in frequency of these
functions are artifacts of the respective texts. For example, we found that in
our English texts gotta is used for biological needs (e.g. I gotta take a leak) but
in our Hopi texts nawus is never used for this function; however, there was no
context of this type in the Hopi texts, and we therefore cannot conclude that
nawus lacks this function.
In order to limit the effect of this problem, i.e. that the distribution of
morpheme functions is an artifact of the texts used, and to facilitate cross-
linguistic comparison, we will make use of English translation data in our
studies of Chinese, Hopi, and Biblical Hebrew. Translations are not an ideal
source of data, but they are objective, as we have used someone else's
translations rather than doing it ourselves. Translations sometimes retain
more of the structure of the original than would be colloquial in the target
language. Thus, our claims about obligation functions in English will be based
on original English sources; the translation data serve only as a supplement
for purposes of comparison, and it is important not to overestimate the
similarities between expressions of obligation in two languages on the basis of
translation. However, the tendency to retain the structure of the original in
translation makes cases in which the structure of the original is not retained
all the more striking as evidence for functional differences between the
languages.
Biblical Hebrew presents a special problem for the study of obligation
marking. In our database for this language, there is no commonly-occurring
lexical marker of obligation, and so it is not immediately obvious which data
to include.3 As stated above, we will argue that something resembling 'obliga
tion' function in this language is marked by word order alternation: the
language is normally verb-initial, and non-verb-initial word order has as one
of its functions something similar to 'obligation' marking. As this type of
'obligation' marking (if it can be called that) is grammatical rather than
lexical, we would expect it to be associated not only with the more specific
lexical types of obligation function considered in other languages but also
with the more grammaticalized functions which obligation markers com
monly develop. Therefore, in comparing Biblical Hebrew and English, we
will consider not only more basic obligation functions such as the obligation
meaning of must and should, but also more grammatical functions, such as
inference and those associated with shall.
Functions of Obligation Expressions 247
2. Empirical studies.
In this section, we will discuss the results of empirical studies we have done of
obligation marking in English, Chinese, Hopi, and Biblical Hebrew. We have
used original texts from each language (supplemented by English translations
for the last three languages); these will be briefly discussed in the relevant
sections. As will be seen from the following discussion, the situation regard
ing usage of the various obligation markers is extremely complicated; to keep
confusion to a minimum, we will limit the data we analyze to one genre,
dialogue in written texts; future work will of course have to investigate other
genres.
2.1. English.
The English data were collected from three plays by the modern American
playwright Lanford Wilson. The plays are The Hot L Baltimore (1973), The
mound builders (1976), and Burn this (1987). Since coding of data was often
highly context dependent, brief plot summaries are useful: The Hot L Balti
more is the story of downtrodden but often witty hotel residents whose
decaying home is up for demolition. In The mound builders, archaeologists
uncover a rare Indian burial site whose treasures are lost forever when the
landowner's son, no longer able to sell the land, avenges his lost profits. Burn
this portrays a group of New York artists who insightfully and often cynically
reflect on their lives after the accidental death of a close friend. These plays
have the advantage of using very colloquial language, which is especially
important for gathering tokens of (have) gotta. We chose plays by a single
author to avoid admitting author preference (rather than function) as a factor
in morpheme choice; this will serve to minimize the number of complicating
variables in this pilot study. For practical reasons, we narrowed down English
obligation markers to five: have to, gotta, should, oughtta, and supposed to.4
A total of 153 occurrences were found in the three plays.
For reasons described above, traditional distinctions such as strong vs.
weak obligation are not useful to the study of naturally occurring data because
they cannot be objectively applied; therefore, we have developed alternative
means of categorization. After extensive analysis of the data, the criterion
which most clearly suggested itself was whether or not someone is negatively
affected by the action expressed in the verb phrase, and, if so, which person is
248 John Myhill & Laura A. Smith
(6) Why did he go out? Why didn 't someone hear him... WHY DID HE
HAVE TO HEAR NOISES IN THE NIGHT? (caps in original)
(Wilson 1976:146)
We can see that this is not an obligation usage because the subject has no
control over the action, hear. Rather, have to expresses the speaker's negative
evaluation of a grievous situation (her husband's disappearance). We have
classified a number of different types of negative effect in Table 1 below (a-g),
and will discuss each in turn, after which we will examine tokens with no clear
negative effect (h-k). We note here that coding tokens is a difficult task with
Functions of Obligation Expressions 249
which have a number of had to tokens. Hence, there is some evidence that had
to occurs in functions not normally associated with have to.
When there is another auxiliary preceding, only have to and not gotta
appears in our data (*He might gotta leave, *He's gonna gotta leave, *He'll
gotta leave, *Does he gotta leave?).5 However, it is generally grammatically
possible to substitute gotta alone for AUX+ have to (e.g. You're gonna have
to leave vs. You gotta leave). On the basis of our data, it appears that in this
situation the usual functional distinction between have to and gotta is main
tained; in other words, the AUX+ have to tokens pattern like the AUXless
have to tokens, not like the gotta tokens. For this reason, we include the
AUX+ have to tokens with the other have to tokens in Table 1.
Turning now to discussion of the categories in Table 1, the 'Negative
effect on subject' category includes those usages illustrated in examples (7)-
(9) below, where the action expressed in the verb phrase is emotionally or
physically unpleasant to the subject.6 Table 1 shows that this function favors
have to over gotta (18 vs. 4).
(7) This glowering older brother had to go get my clothes...
(Wilson 1987:22)
(8) They're tearing down the whole building, so we all have to move.
(Wilson 1973:48)
(9) / gotta get Horse's things.
(Wilson 1973:46)
We have evaluated 'negative effect' on the strength of various cues either
in the utterance or in the broader context. In (7), for example, glowering sig
nals the brother's view of the action as unpleasant (as perceived by the
speaker, of course). As for (8), earlier comments (e.g. the speaker's refusal to
help distribute eviction notices) reveal that the speaker is upset about the
impending demolition. Lastly, in (9), a mother is resigned to moving her son's
belongings out of the hotel after unsuccessfully imploring the manager to let
him stay.
The second category listed in Table 1 is 'Mitigated inconvenience to
listener.' All tokens of this type have 1st person subjects, and so the speaker is
in effect excusing doing something against the interests of the listener by
referring to an obligation requiring this action, as in (10):
(10) I've got to talk money with some St. Louis real-estate men this
evening. This isn't going to be much of a vacation for you, Jean.
(Wilson 1976:13)
Functions of Obligation Expressions 251
itly apologizes for the negative effect the action has on the listener. There is an
interesting illustration of this contrast in two excerpts from the narrative of a
single speaker. In (15), she uses gotta to portray herself trying to weasel out of
an invitation, while in (16), she uses have to to describe the unpleasant
consequences of failing to do this.
(15) His mother's on the bed with a washcloth on her forehead. I'm
trying to tell them how I've got to get a bus back to civilization...
(Wilson 1987:20)
(16) So then it's midnight and the last bus has left at ten, which they
know, I'm sure, damn them, and I hadn 't checked, like an idiot. So I
have to spend the night in Robbie's little nephew's room in the attic.
(Wilson 1987:21)
Note that (15) is humorous precisely because it mixes the offensive
phrase back to civilization with the courtesy of justifying a departure. Have to
(historic present) in (16), on the other hand, is consistent with the subject/
speaker's obviously negative view of the ordeal.
The third functional category in Table 1 is 'Negative effect on listener,
speaker unsympathetic'. In the English data, there is only one subtype for this
category, with 2nd person (or 1st person inclusive, which includes the lis
tener) subjects. For this subtype, the listener is to perform the action and be
negatively affected by it, and the speaker is unsympathetic or even hostile;
this unsympathetic attitude distinguishes these tokens from tokens of 'Nega
tive effect on subject' with 2nd person subjects (which are in any case very
rare in the English database). This type is exemplified in (17)-(18):
(17) It's a damn shame you 're going to have to find yourself some other
field of operation. (Wilson 1976:128)
(18) We gotta go when I need it, damnit. (Wilson 1976:61)
In (17), the speaker owns some property which he intends to sell to a
developer, while the listener is an anthropologist committed to protecting an
excavation site on the property. In (18), the speaker is trying to persuade the
listener to do as he requests, and the listener is resisting.
This type has 6 tokens of have to and three of gotta. It appears that an
important factor distinguishing have to from gotta here is whether the speaker
regards this unpleasant obligation as being imposed by factors beyond the
immediate control of the interactants. When there is a legal or quasi-legal
Functions of Obligation Expressions 253
basis for the obligation, have to is used 5 of 6 times (this is related to the
'Procedures/routines' function of have to, which we will discuss below) 7 .
Thus in (17) the speaker believes that negotiations have already been made to
sell the land, and is under the mistaken impression that the anthropologist has
no legal recourse to prevent the sale. On the other hand, when the basis for
imposing the obligation is purely personal, as in (18), gotta is used two of
three times.
As we have noted, the only subtype of 'Negative effect on listener'
represented in the English data is the one with 2nd person subjects. Obligation
markers can also be used in this category with 1st person subjects (If you don 't
keep quiet, I'm gonna have to break your nose) and 3rd person subjects (He's
gonna have to teach you a lesson), but there are no tokens of this in the
English database (we will see in Section 2.2 that there are such tokens in the
Chinese database).
The fourth functional category is 'Negative effect on speaker'; this type
is defined as having a non-1st person subject, which distinguishes it from
'Negative effect on subject' with 1st person subject. There are no tokens of
this category in the English database, although such uses are possible, e.g. in
Why did he have to do that? the obligation marker can indicate the speaker's
disapproval (of course, it can also indicate that the subject could be expected
to view the obligated action negatively). We will see in Section 2.2 that
obligation markers are used in the Chinese database with this function.
The fifth functional category, 'Reprimands', also implies a negative
effect on someone. 8 There are 12 tokens of this type, exemplified in (19)-(20):
(19) Hey. Paul. You're supposed to be doing this with me.
(Wilson 1973:136)
(20) You ought to be ashamed of yourself robbing Mr. Morse.
(Wilson 1973:102)
This type is most commonly represented in our data with supposed to (6
tokens) or oughtta (3 tokens). Supposed to implies a tacit understanding
between the subject and a second party (e.g., compare You're supposed to be
meeting him tomorrow with You ought to be meeting him tomorrow). In (19),
the speaker is looking through hotel records to help Paul find his missing
grandfather; the speaker assumes an understanding between herself and Paul
that he will help in response to her gesture, but he has not done so. Oughtta, on
the other hand, conveys the speaker's personal and often detached viewpoint,
254 John Myhill & Laura A. Smith
(21) I'm supposed to be home with Frank 'cause he's on morphine and
God knows what condition he might be in. (Wilson 1973:12)
(22) He ought to sleep with it (the window) open anyway; you can't talk
to him about it. (Wilson 1973:34)
Moving now to 'Advisable action' with 2nd person subjects, the most
common markers used were gotta (7 tokens), e.g. (30), and should (8 tokens),
e.g. (31):
(30) You gotta get up to your girl (Wilson 1987:20)
(31) There are some cost accountants and some professional architects
down at Memphis you should talk to. (Wilson 1976:129)
The difference here is probably at least partially due to the relatively
colloquial nature of gotta. We also suspect that something such as 'speaker
involvement' favors the use of gotta; however, this parameter is extremely
difficult to measure, especially with few tokens.
There is one example of 'Advisable action, generic subject':
(32) It's the night of nights! People should be up. (Wilson 1976:79)
Our next category, 'Procedures/routines', consists of obligated actions
which have no clear negative effect on anyone. As Table 1 shows, have to and
gotta are used for this function, with have to favored (7 vs. 3). 13 In this
category, actions are often habitual, as in (33)-(34)14:
(33) She took her stuff— / have to take my own, though. That's the way
we do it. (Wilson 1973:116)
(34) They had to hang around the house and tend the fields — sacrifice
to the gods of harvest and whatnot. (Wilson 1976:46)
There are only three instances of gotta in this category. It is not clear why
gotta is sometimes used here, and we would need a larger database to
demonstrate anything conclusive.
The last of our categories, 'Biological/physical need', is straightforward.
In the present tense, it is only expressed with gotta', in the past, it is expressed
twice with had to, presumably because gotta lacks a past tense form. Exam
ples are (35)-(36):
(35) / had to piss so bad — about Fifteenth Street and Eighth or Ninth
Avenue... (Wilson 1987:62)
(36) I gotta have another beer. (Wilson 1976:120)
Perhaps one factor favoring gotta in these contexts is informality: we are
more inclined to express physical needs around people we know well and with
whom we are more likely to use the more colloquial gotta.
Functions of Obligation Expressions 257
2.2. Chinese.
The Chinese data were gathered from dialogue in two plays by Cao Yu,
Beijingren ("Peking Man," 1954) and Leiyu ("Thunderstorm," 1961)15. The
first portrays a once-prosperous family in decline, the second a pair of ill-fated
258 John Myhill & Laura A. Smith
lovers from different social classes who turn out to be long-separated siblings.
There were several reasons for selecting these plays. First, they are by the
same author and hence keep author-preference and dialect variables con
stant.16 Second, the plays are recent enough to contain dialogue fairly typical
of modern colloquial Chinese; Cao Yu's works are recognized as representa
tive of the new genre huaju (spoken drama), which contrasts sharply with the
lyrical, highly formulaic style of traditional Chinese drama. Third, Beijingren
is available in a reasonably idiomatic English translation and is long enough
to yield a sufficient number of English tokens. This is important for Chinese-
English comparison (at the end of this section).
We examined three Chinese obligation markers, the preverbal auxiliaries
dei, yao, and gai.17 We have gathered only their preverbal and (where there
was an overt subject) post-subject occurrences, excluding tokens with nomi
nal or sentential complements (e.g., Zhe dei ta guan 'this dei he handle'='This
has to be handled by him').
All of these morphemes also have epistemic functions, which are ex
cluded here. The morphemes are illustrated in the following examples:18
(37) Ni qian le wo de, ni dei huan (Cao 1954:369)
you owe ASP me PRT you pay-back
'You have to pay back what you owe me.' 19
(38) Yihou shenme dou yao gaosu ma! (Cao 1954:395)
from-now-on something all tell Mother
'From now on you must tell your mother (=speaker) everything.'
(39) Nimen ye gai gei nimen fuqin songxing nal
you(pl) also to your(pl) father bid-farewell PRT
(Cao 1954:335)
'You two should also say goodbye to your father.'
The status of dei and gai as obligation markers is supported by descrip
tions in several grammars (e.g., Liu 1983:116, Chao 1968:741-42, Li and
Thompson 1981:182-83). Yao, however, is problematic because it can also
express desire/intention and prediction/future meanings, as exemplified in
(40)-(41):
(40) Ke ni daying le wo. Wo yao fang, wo yao fang!
but you promise ASP me I fly I fly
(Cao 1954:296)
'But you promised! I want to fly it (the kite), I want to!'
Functions of Obligation Expressions 259
described in section 2.1), and the other (9 tokens) comprised of the different
subtypes of obligation function without negative effect (i.e. types h-k).
The remaining 106 yao tokens are typically either assertions of intended
action (with 1st or 3rd person subjects), unsolicited promises (with 1st person
subjects), or detached inquiries as to the listener's intentions (with 2nd person
subjects); we set these tokens aside. The 68 yao tokens included in our
database do not invariably translate into English with obligation morphemes,
and in some cases such a translation is not even plausible. However, we do not
regard this as a problem, because it is our ultimate goal to develop characteri
zations of types of obligation functions which are language-universal and not
dependent upon whether one language or another happens to represent them
with an obligation marker.
We should note that, by adopting this strategy, we are proposing not a
solution, but an approach. Exactly what constitutes an obligation usage of yao
is still an open question, partly because the morpheme conflates what from an
English viewpoint are two distinct meanings, obligation and intention.
Our database of Chinese obligation markers thus consists of 29 tokens of
dei, 68 tokens of yao, and 26 tokens of gai. Dei and gai tokens were gathered
from both plays, yao tokens from only one. Table 2 compares the raw
frequencies of the three morphemes by showing their distribution in the one
play from which yao tokens were gathered:
Table 2. Obligation dei, yao and gai in 'Beijingren'
Dei 10 11%
Yao 68 73%
Gai 15 16%
Total 93 100%
a. Negative effect on 7 14 0 21
subject
b. Mitigated inconvenience to 5 4 0 9
listener (1st pers. subj.)
c. Negative effect on
listener, unsympathetic
1st pers. subj. 0 8 0 8
2nd pers. subj. 10 0 0 10
3rd pers. subj. 0 7 0 7
d. Negative effect on speaker 1 24 0 25
e. Reprimand (2nd pers. subj.) 0 2 3 5
f. Counterfactual (non-2nd pers. subj.) 0 0 7 7
g. Help not needed 0 0 0 0
h. Facetious suggestion 0 0 0 0
i. Advisable action
1st person subject 0 2 8 10
2nd pers. subj. 6 4 6 16
Generic subject 0 2 1 3
j . Procedures/routines 0 1 0 1
k. Biological/physical need 0 0 0 0
1. Uncodable 0 0 1 1
Total 29 68 26 123
To express 'Negative effect on subject', both dei and yao are used, as in (43)-(44):
(43) ...chi renjia de qianliang, jiu dei shou
eat somebody PRT money-grain then be-subject-to
renjia de guan. (Cao 1961:7)
somebody PRT discipline
'...if somebody's paying your wages you've got to take orders from
them.'
(44) Keshi tarnen shuo zhe zhang xianzai yao fu...
but they said this account now pay
(Cao 1954:277)
'But they said the bill had to be paid right now.'
The distinction between dei and yao in this category appears to be
related to the nature of the obligation. Most of the dei tokens (6/7) deal with
262 John Myhill & Laura A. Smith
either the intentional meaning (with will or gonna and no obligation marker),
or the 'Mitigated inconvenience' meaning, where the obligation marker 'ab
solves' the speaker of responsibility (e.g. I'm sorry I have to do this). In
Chinese, on the other hand, yao conflates the intention and obligation mean
ings and is commonly used when the listener is negatively affected, with or
without mitigation on the part of the speaker.
Yau is the preferred morpheme in our next functional category, 'Negative
effect on speaker', occurring in 96% (24/25) of the cases. This category
includes only cases where the subject is non-1st person, as in (50) below,
which distinguishes it from 'Negative effect on subject' with 1st person
subjects:
For reasons which are not clear, English does not use an obligation marker
when there is a question word other than why. For example, What do you have
to do? does not occur when the obligation marker indicates the speaker's
disapproval of the action (parallel to the speaker-affected interpretation of
Why do you have to do that?); rather, the obligation marker depicts the action
as having a negative effect on the subject. Although obligation markers are
sometimes possible with 'Negative effect on speaker' in English, this usage is
obviously less common than with yao. The functions discussed thus far have
been represented in the data by dei or yao, but not gai, and in this respect differ
from the remaining functions, which generally favor gai. We summarize the
functions of dei and yao discussed thus far (categories a-d) in Table 4 (the
parenthetical English examples capture the gist of the Chinese contexts).
The contrast between dei and yao is clear from Tables 5-6, which show
(on the x-axis) the subject type, and (on the y-axis) the negatively affected
party, for the functional categories discussed so far (a-d). Most of the dei
clauses in our data (16/23, or 70%) have the listener negatively affected.
These are exclusively 1st person subjects (mitigated discourtesies) or 2nd
person subjects (hostile commands). With yao, on the other hand, listener-
affected clauses comprise only 35% (20/57) of the tokens, while speaker-
affected clauses comprise 58% (33/57). Furthermore, with yao, the subjects of
these clauses are often 3rd person.
266 John Myhill & Laura A. Smith
We will now turn to Chinese usages in which the action of the verb
suggests no clear negative effect on anyone. The first two such categories
('Help not needed' and 'Facetious suggestion') are not represented in our data
by the Chinese morphemes studied here.
We will next discuss the 'Advisable action' category. With 1st person
subjects, gai is usually used here (8/10 times), as in (56):
(56) Wo shuo ye gai wenwen Su biaomei de yijian ba.
I say also ask Su cousin PRT idea PRT
(Cao 1954:407)
'...I think that we should talk to Cousin Su about it first.'
With 2nd person subjects, on the other hand, 'advisable actions' are
expressed using all three obligation markers (6 dei, 4 yao, 6 gai). Dei is used
when the speaker cannot be assumed to have authority over the listener;
indeed, in 4 of the 6 dei clauses here, there is independent evidence that the
speaker is showing deference. In (57), deference is indicated by the polite
form of the 2nd person pronoun; the speaker is a servant, consoling a bereaved
houseguest.
(57) Zhe shi meifazi de shi, —keshi nin dei
this is no-way PRT matter but you (honorific)
kuku... (Cao 1961:133)
have-a-cry
'...there's nothing you can do now — now come on, you must have
a cry.'
In contrast to dei, all four yao clauses in this category are uttered by an
authority figure (e.g. parent, elder), as in (58), where a woman is admonishing
her pregnant niece:
268 John Myhill & Laura A. Smith
(59) Lin zou le, ye gai liu dian jinian. (Cao 1954:310)
about-to leave ASP also keep a-bit remembrance
'Now that he's leaving, you should have something to remember
each other by.'
There are also three tokens of 'Advisable action' with generic subjects,
two using yao, as in (60), and one using gai:22
(60) wo shuo yige ren yao you dian liangxin. (Cao 1954:330)
I say a person have a-little conscience
T say a person should have a little conscience.'
Gai is typically translated as 'should', the 'weak' obligation marker in
English, and in our data it shares several features with that morpheme. Over
half of the gai tokens (15/26) occur in the 'Advisable action' category, so that
its use with this function is much more common than dei (6/29) or yao (8/68).
Similarly, most of the tokens of should (16/22) occur in this category, but very
few tokens of have to (1/42) or gotta (7/40) do. As in English should clauses,
many of the gai tokens in this category (6/15) occur with a hedge, e.g. with the
particle ba meaning 'Don't you think so?' 2 3 or with something translating as
'I think'. Such hedges (see (60) above) never occur with dei or yao in these
categories.
The last two functional categories, 'Procedures/routines' and 'Biologi
cal/physical need' are only minimally represented in our Chinese data, and
hence will not be discussed here. Returning now to categories e-k in Table 3,
we can summarize the 'No negative effect' functions of dei, yao, and gai as in
Table 7 (again, the English examples are intended to convey the gist of the
categories).
Functions of Obligation Expressions 269
Table 8. Chinese-English
comparison
havel
(Beijingren+English other no
translation)
must had to should marker marker TL
dei 0 1 3 2 4 10
yao 2 22 5 2 41 72
gai 1 0 11 1 2 15
other oblig 2 1 13 X X 16
no marker 1 20 8 X X 29
TL 6 44 40 5 47 142
X=no data gathered
another third (12/41) use the Present Progressive form to express unilateral
action already underway (e.g., ...and he's ranting and raving at me). We only
included tokens of yao corresponding to one or another of our categories of
obligation function, generally having a negative effect on someone;26 this
negative effect frequently makes it at least possible to translate yao tokens
into English with obligation markers, but the data in table 8 show that this is
relatively uncommon.
Of the English morphemes, have to/had to marks the contexts least likely
to take an obligation expression in Chinese. 20 of the 29 'non-obligation'
contexts in Chinese take have to in the English, and 14 of these 20 fall into two
categories: they refer either to past events, and translate as had to or did have
to (6 tokens), or to habitual actions or states (8 tokens), usually with a time
adverbial like meitian 'every day'. In our Beijingren data, Chinese mor
phemes which generally share the functions of have to — dei and yao — occur
almost exclusively in either future or present contexts: dei never translates
with any past tense English form, while yao does so only 10% of the time (7/
72 times). It appears that past obligations in Chinese are rarely expressed with
either dei or yao. As for habitual actions or states, English seems more likely
than Chinese to explicitly mark a negative evaluation by using an obligation
marker. In (61), for example, have to makes it clear that the speaker does not
perform her routine willingly, while the Chinese translation uses no obligation
marker:
(61) ...lian wo bu shi ye xia chufang bangzhe
even I not be also go-down-to kitchen help
Zhang Shun... (Cao 1954:295)
'Even I have to help Zhang Shun (=servant) in the kitchen.'
Functions of Obligation Expressions 271
2.3. Hopi.
Hopi has only one word which we can consider to be a productive marker of
obligation, nawus.21 Basic word order in Hopi is SOV, and nawus normally
comes between the subject and the object or complement PPs. Unlike the
obligation markers in English and Chinese, which have syntactic and/or
phonological properties of auxiliaries, nawus is a free, fully lexical form,
which has the syntactic distribution of an adverb. Although it is the only
productive way of marking obligation in Hopi, it is relatively uncommon;
must, have to, should, and be to each occur more frequently in the English
translation than nawus occurs in the Hopi.
Our Hopi data are taken from Malotki and Lomatuway'ma (1987a and
1987b), and Geertz and Lomatuway'ma (1987), collections of Hopi texts with
English translations. The texts consist of personal narratives as well as tradi
tional stories about the god Maasaw and his interactions with Hopis and
various anthropomorphized animals. As in the other languages, we include
only data from conversational (as opposed to narrative) parts of the texts.
Nawus occurs 27 times in the conversations in these texts, always with an
obligation function (even in two cases where it is not translated into English
with a typical obligation form).
For the purposes of delimiting our database and comparing Hopi and
English, it is necessary to determine when a type of obligation meaning can be
272 John Myhill & Laura A. Smith
inferred from the text even if it is not marked with nawus. The strategy
employed in Section 2.2 in selecting tokens of yao to include in the data base,
namely, including tokens which correspond to an obligation category de
scribed in Section 2.1, is not feasible in Hopi. As we will see, the categories
are not really appropriate for the Hopi data, and additionally we would have to
apply them to every single clause in the data, with nawus or without, which
would be prohibitively time-consuming. Under these circumstances, the only
realistic course of action is to use the English translations as our criteria for
inclusion. Thus we include in our database for this section all sentences which
have nawus in the Hopi and/or a word expressing obligation in the English
translation. As mentioned in Section 2.2, this strategy has the disadvantage of
excluding tokens where some kind of obligation meaning appears to be
present but an obligation marker is not used in the English translation;
however, the alternative approaches have stronger disadvantages. The transla
tion relationships in the database are shown in Table 9.28
The first observation which we can make from Table 9 is that nawus
appears to be associated specifically with 'strong' obligation, as it is only
translated with the 'strong' obligation forms must, have to, and no choice, but
never with the 'weak' markers should, be to, supposed to, or better. For
reasons noted above, it is problematic to develop independent criteria to test
the 'strong/weak' distinction. However, it is worth mentioning that in all but
one of the 27 cases, the obligation is fulfilled; in the one exception, a child
disobeys his parents' command, but it is clear that the parents intended a
command rather than a recommendation. This pattern is consistent enough to
support the view that nawus is specifically associated with 'strong' obligation.
Functions of Obligation Expressions 273
Thus far, we have discussed markers in English, Chinese, and Hopi with a
variety of functions which can be considered to represent types of obligation.
Our analysis of Biblical Hebrew differs in that we will not attempt to catego-
278 John Myhill & Laura A. Smith
rize different types of obligation functions but rather to demonstrate that these
and related functions can be expressed through purely grammatical rather
than lexical means.
As argued in Section 1.2, obligation markers can develop functions
which are more grammatical, such as inference and future meaning, and we
would expect these more grammatical functions to be related to the more
lexical obligation functions in a non-arbitrary way. There will then be a
network of related functions consisting of different types of obligation func
tion and also certain types of more grammatical functions like inference and
future. We will refer to this network of related functions as 'obligation' (with
scare quotes). Note that 'obligation' does not include all types of inference/
future functions, but only those related to obligation and likely to develop
from an erstwhile obligation marker.
This view of 'obligation' as representing a network of related functions
(ranging from strongly lexical to highly grammaticalized) is supported in an
interesting fashion by data from Biblical Hebrew. This language has no
lexical marker of obligation similar to English have to, Chinese dei, or Hopi
nawus. However, something resembling 'obligation' meaning is expressed
through word order alternation. Biblical Hebrew generally has verb-initial
order (69% of the time in our database), and the use of non-verb-initial order
has as one of its functions the expression of something resembling 'obliga
tion'.
Before demonstrating this, we should note that we are not claiming that
'obligation' meaning is expressed through non-verb-initial order in Biblical
Hebrew. It is more accurate to say that the use of non-verb-initial order
expresses an abstract function which, in certain contexts, resembles 'obliga
tion', and which, from the English point of view, is interpreted as 'obligation'.
For this reason, sentences using non-verb-initial order in these contexts are
consistently rendered in languages such as English with 'obligation' markers.
We have gathered data from all the conversations in the first 30 books of
Genesis and an English translation (Plaut 1981). Our database includes all
clauses with functions which might be considered future, subjunctive, opta
tive, imperative, and/or modal, a total of 450 Hebrew clauses and their
translations. We then coded the data according to the Hebrew word order
pattern and the English translation.
Because Hebrew lacks a lexical obligation morpheme, our strategy for
analyzing must be different from that used for the other languages in this
Functions of Obligation Expressions 279
Table 11. Hebrew word order and English 'obligation ' markers
Non-verb-initial Verb-initial Non-verb-initial%
Shall 46 27 63%
Should 5 0 100%
Beto 4 1 80%
Must 2 1 67%
Total 'obligation' 57 29 66%
Other 81 283 22%
Chi-square=63.42 p<.001
translation simply uses an imperative (e.g. 'Go from your land...'). It appears
that the translators are following a stylistic convention of using shall in God's
speech about someone else's future actions (excluding imperatives); this
occurs 89% of the time. The most likely explanation for this is that shall has
acquired an archaicizing function independent of its' obligation' function and
so is particularly suitable for conveying God's prognostications and pro
nouncements about the fate of humans.32
Of the remaining 7 tokens (out of 29) of verb-initial constructions excep
tionally translated with English 'obligation' markers, all follow the equivalent
of if or when clauses in Hebrew, e.g. (69):
(69) v?im-lo? to?veh ha?ishah lalexet akhareixa
and-if-not willing the-woman to-go after-you
vniqatah mishvu'ati. (Genesis 24:8)
you-will-be-free from-my-oath
'And if the woman is not willing to go with you, you will be free
from my oath.'
Such apodosis clauses are invariably verb-initial in our Biblical Hebrew
database (in 25 tokens), regardless of whether they translate with 'obligation'
words. We have shown, then, that one function of non-verb-initial order in
Biblical Hebrew is related to 'obligation'. Since this is not the only function of
this order (it also marks contrast), we are not claiming a categorical correla
tion between this order and translation with 'obligation' markers, but 'obliga
tion' is a major function of this order. The use of word-order alternation to
mark 'obligation' function presumably compensates to some extent for the
fact that the language does not have a lexical obligation marker.
It is not at this stage entirely clear why Biblical Hebrew should mark
'obligation' function with word order. Cross-linguistic research will be neces
sary to determine whether the relationship between non-verb-initial order and
'obligation' is widespread and based on general cognitive principles or simply
restricted to Biblical Hebrew and more or less coincidental. We might tenta
tively hypothesize that this relationship may be related to the notion of
continuity.33
Givón (1977) and others have argued that in narrative discourse, the
unmarked order of presentation is 'iconic sequence' (the order of clauses
mirrors the order in which the events reported in these clauses took place);
more specifically, temporally sequenced clauses are continuous and unse-
282 John Myhill & Laura A. Smith
3. Conclusion.
The main purpose of this study has been to develop a framework for the cross-
linguistic description and comparison of obligation functions in natural usage.
We have discussed and exemplified types of obligation functions in four
languages and correlated these with obligation markers in particular lan
guages. We have gone to some length to describe and exemplify the functions
identified in our database so that linguists working in other languages might
test them against their data. The functions generally correlate with particular
markers in the languages we have investigated, and when a given function
284 John Myhill & Laura A. Smith
occurs commonly with more than one marker we have attempted to divide it
into subfunctions which might account for the distribution of markers. Given
the relatively small size of our database, we cannot expect to have entirely
determined the factors governing the choice of forms, nor do we expect our
list of obligation functions to be exhaustive.
It is important to note here what we have not attempted to do. We have
not attempted to define exactly what obligation means in general. After
analyzing a large number of expressions of obligation in the languages
investigated here, it is clear to us that obligation expressions do not simply
report objectively necessary actions. Rather, they are devices used by speak
ers to evaluate and justify actions, so that the only way to analyze their
function is in terms of usage rather than 'meaning'. We have of course used an
intuitive notion of obligation to delimit our database, but this has been done
purely for convenience; neither our functional categories nor the morphemes
we have chosen to analyze are intended to exhaustively represent or define
obligation.
We have also not attempted to organize the categories of obligation
function into 'core' and 'peripheral' types, along the lines of Coates 1983. It
would of course be possible to choose one or another function as 'core'
obligation and rank the others according to the number of shared semantic
features. However, there appears to be no objective basis for choosing one
function over another as 'core', nor any obvious motivation for organizing the
data in this fashion.
It is probably the case that some of obligation functions are more lexical
while others are more grammatical. Research such as Bybee 1985 has demon
strated that some functions are consistently associated with more lexical
representation while other functions are consistently associated with more
grammatical representation, and it is clear that the future and inference
functions which obligation markers can develop are more grammatical than
the obligation functions we discuss here. However, among the obligation
functions we identify here, it is not clear which are more lexical and which
more grammatical. To determine this, it would be necessary to gather data
from a wide variety of languages and to look for cases in which some of the
functions are expressed through more lexical forms and others through more
grammatical forms. Alternatively, we might study in detail the diachronic
development of a single marker in a single language to see how this marker
has progressed from one function to another. If there is a change in terms of
Functions of Obligation Expressions 285
The present paper, then, has sought to show that obligation function can
be productively analyzed with quantitative methods. Largely due to historical
accident, quantitative discourse-oriented research has focused primarily on
word order and voice alternations, and it is important that linguists working in
this field expand the range of phenomena under investigation. Quantitative
methods produce results which cannot be obtained from introspection alone,
and they force the researcher to think in terms of objectively defineable
criteria. We do not deny the value of introspective research; however, we see
few if any linguistic phenomena which cannot benefit from quantitative as
well as introspective investigation.
NOTES
* We thank Joan Bybee and Suzanne Fleischman for their extensiveand helpful comments
on an earlier draft of this paper.
1 Both Palmer (1979:91) and Coates (1983:33) admit that it is often difficult to categorize
tokens according to the criterion of speaker involvement.
2 Shall is very rare in the original English texts we used and in the English translations of
the Hopi and Chinese texts. It is, on the other hand, quite common in the translation we
used of the Biblical Hebrew text. However, this is not a problem, because in Biblical
Hebrew we are analyzing 'obligation' function (including the more grammaticalized
functions which obligation markers develop) rather than obligation function, so there is
no need to distinguish the obligative shall from its future-oriented uses.
3 Lambdin (1971:129) notes that the expression 'al + NP + / + infinitive is used to express
obligation or responsibility, but there were no tokens of this construction in our data
base.
4 Except where otherwise indicated, gotta will include have got to, got to, and gotta, and
oughtta will include ought to . Must occurs only once in Wilson's dialogue, need to only
four times. Hence, we have not included them in this study.
5 Does (subj) gotta? is acceptable to some speakers, but it does not occur in our data. We
use the asterisk to indicate non-occurrence in our data base, not to indicate that we or the
playwright would reject a form as ungrammatical (although in some cases we would).
6 In one token, the subject is actually avoiding something emotionally or physically
unpleasant (No, you do that 'cause you got to, 'cause you'd be embarrassed not to
(Wilson 1976:117)).
7 This speaker-external source of authority is probably related to the low 'speaker involve
ment' or low 'subjectivity' associated with have to (see Section 1.1).
8 Conceptually, of course, reprimands are a type of counterfactual, e.g., You shouldn't
have done that is counterfactual because the auxiliary is negated but the addressee must
have done the action. We distinguished these categories in our coding by including as
reprimands only counterfactual clauses with 2nd person subjects.
Functions of Obligation Expressions 287
9 Two of the supposed to tokens may also be partly motivated by negation (...you aren't
supposed to be here rather than you ought not be here) and/or past tense; we get neither
* ought not nor *didn 't ought in our data.
10 Our oughtta reprimands are somewhat borderline obligation usages, since their verbs are
Stative (be ashamed, be used to). You cannot oblige someone to 'perform' a state; on the
other hand, you can oblige someone to perform an action that avoids or effects a state,
and on this reasoning, we have included such tokens.
11 This discussion of counterfactuals is not intended to be exhaustive; a much larger
database is needed for a thorough and systematic analysis of natural usage.
12 The association of supposed to with arrangements by unmentioned agents is likely due to
its origin as a passive construction (we thank Joan Bybee for suggesting this).
13 One have to token is negated with don't, which may explain the speaker's not using gotta.
14 Coates (1983:54) observes that, in its 'root' or obligation meaning, have got to is never
habitual while have to can be. She cites the distinction between / have to get up at 7 a.m.
every day and *I've got to get up at 7 a.m. every day.
15 Beijingren was written in 1940 and slightly revised by the author in 1954; in the text, we
cite it as Cao 1954. Leiyu was written in 1934 and revised twice by the author before its
1961 printing, and we cite it as Cao 1961.
16 This is important because dei is commonly recognized as a Beijing colloquialism. As a
Tianjin native (see Rand et al 1980), Cao Yu is not a representative speaker of the
Beijing dialect. It would be interesting to compare the dei vs. yao distinction in Cao's
work to that of a Beijing native, where some of the yao functions described here would
presumably be filled by dei.
17 All of these can also be used as main verbs, often translating as 'take, require', 'want',
and 'owe', respectively. Under gai we have included 7 tokens of yinggai, the latter often
described as a more formal variant. Though there are no doubt some differences in their
functions, we will not attempt to distinguish them here.
18 Our abbreviations are: ASP=aspect, PRT=particle, FUT=future, CL=classifier.
19 Unless otherwise indicated, translations of Beijingren examples are from Lo 1986 and
translations of Leiyu examples are from Wang and Barnes 1954.
20 The two exceptions refer to obligations to father-in-law and family, respectively. But
even these obligations are to some extent also specific to the circumstances: The
speaker's husband, who usually assumed part of this responsibility, has left.
Another feature distinguishing dei and yao tokens in the 'Negative effect on
subject' category is word order. In our data, most transitive verbs with yao (10/12, or
83%) have preverbal patients (see (44)). This is significant given that none of the four
transitive dei constructions in this category have preverbal patients, and yao construc
tions with preverbal patients using are rare (less than 10%) in our other functional
categories and in the language overall (Myhill and Xing 1992). It is not clear why yao
utterances which are 'Negative effect on subject' should have this unusual word order
pattern, but we did notice that patient-verb order is always used in the 8 instances where
the action is not marked as inherently negative. It may be that the patient-verb order is
used here to emphasize that the subject is obligated to do the action and to discourage the
intentional reading.
288 John Myhill & Laura A. Smith
21 This is our translation; Wang and Barnes 1958 omit this sentence in their translation.
22 This translation is our own and directly reflects the structure of the Chinese. The
translation in Lo 1986 is much looser: 'Shouldn't we listen to our consciences?'
23 See Li and Thompson 1981:307 for a description of the function of this particle.
24 Unfortunately, the translators appear to have had a stylistic bias against (have) got to;
they used it only once, in contrast with Wilson, who uses it frequently.
25 We have excluded 'Don't have to' clauses from this table, since these are not expressed
with any of the Chinese morphemes studied here.
26 Except for the four other tokens translated into English with an obligation morpheme.
27 We have decided to analyze only nawus after extensive study of Hopi texts and Albert
and Shaul's 1985 Hopi-English lexicon. We investigated several other words as possible
obligation markers. Albert and Shaul translate paapu as 'admonition (modal), should do
it (reminder)', but in the texts we consulted, the English translations of sentences with
paapu almost never have a word with obligation meaning. Albert and Shaul list 'should
do' as one of the functions of as, but this is an extremely common discourse particle with
a wide variety of functions, and only a fraction of the tokens using it have any obligation
meaning. They refer to naapas as a 'modal of disapprobation' ('you shouldn't do that');
naapas is used with this function in our database, but there are only three tokens of it.
Sonqa sometimes appears to function as a marker of obligation. However, it really
means 'definitely' (it is composed of son and qa, both negative markers). It translates
considerably more often as some non-modal marker of certainty than as an obligation
marker. It is of course an interesting question why certain markers which are not
basically obligation markers occur consistently (though infrequently) in sentences trans
lating into English with explicit obligation; this is relevant to the present study, but
beyond its scope.
28 We include here only the obligation meanings of these English words. This contrasts
with the translation data from Biblical Hebrew, where we include the usages of the
English words associated with obligation, whether or not the particular token appears to
represent obligation. The motivation for this distinction, as discussed in Section 1.2, is
that in Hopi we are dealing with a lexical, literal expression of obligation (nawus), which
we want to compare with literal obligation in English, while in Biblical Hebrew, as we
will see, we are dealing with a grammatical expression of obligation (signaled by a word
order alternation). We therefore want to include both concrete (literal) expressions of
obligation in English and the more abstract (grammatical) meanings related to obligation
which words associated with the expression of obligation later develop.
There are two tokens of nawus which we have marked as being translated with 'no
obligation word'. One is translated as 'It will take you several days to reach your goal'
(Malotki and Lomatuway'ma 1987a: 116), more literally 'You will have to arrive at your
goal in several days'. The other is translated as 'It's Maasaw only who rests during the
day and goes around at night' (Malotki and Lomatuway'ma 1987b:89); this is a repri
mand to someone who is sleeping too late, and it is more literally translated as 'It should
be only Maasaw who...' Both of these clearly convey some type of obligation meaning.
However, the fact that they are represented with an obligation form in Hopi but not
English shows that it is not totally accurate to say that nawus marks a subset of the
functions marked by English obligation markers.
Functions of Obligation Expressions 289
29 It is of course the case that mothers are generally in a social position which sanctions
giving commands to their children; thus mother-to-child commands are generally rela
tively unmarked. However, in the Hopi texts even this type of relationship calls for
nawus when the speaker is trying to orient the listener towards a new goal in order to deal
with a crisis which has arisen.
30 The data here also include subjects possessed by humans (e.g. 'His soul shall be cut off
from his people'). They exclude sentences with negatives and question words (which are
obligatorily clause-initial) and subordinate clauses (which have subordinators in initial
position).
31 Implausible imperative contexts are:
(a) The listener can't control the action because the verb doesn't take a volitional subject
(e.g. 'You shall be the father of many nations').
(b) The listener can't control the action because it is addressed to him and his descend-
ents (e.g. 'You shall keep my covenant').
(c) The listener can't control the action because it is addressed to other people not
present (e.g. 'You shall come into the ark, you and your sons and your wife and your
sons' wives with you.').
(d) The listener can control the action, but it is contingent on another action which the
listener cannot control (e.g. 'Your wife Sarah shall bear you a son, and you shall call
his name Isaac').
32 It appears that a number of the non-imperative constructions with (a) God speaking;
(b) a 2nd or 3rd person subject; (c) non-verb-initial order in Hebrew; and (d) shall in the
English translation are cases in which the Hebrew text portrays God as speaking in a
more cooperative, human fashion (which is not uncommon), but the English translation
nevertheless represents God in a transcendent manner. Examples are 'you shall be buried
at a ripe old age' (Genesis 15:15), where God makes a personal statement about
Abraham's future, and 'your descendants shall seize the gates of their foes' (Genesis
22:17), where God is rewarding Abraham for his willingness to sacrifice his son Isaac,
and so there is an element of cooperation. This may be compared with the more typical
case, where God simply makes a promise independent of any particular human action,
and non-verb-initial order is normal (e.g. 'Kings shall come from her' (Genesis 17:16)).
Scholars such as Heschel (1975) and Rosenberg and Bloom (1990) have argued, for
completely different reasons, that there is no evidence for a purely transcendent concep
tion of God in the Old Testament but that interpreters of the text influenced by Greek
thinking have overemphasized the transcendent aspect in their translations and interpre
tations; this would account for tokens where shall is used to translate verb-initial clauses.
33 Another possible explanation is that the verbal form typically used in non-verb-initial
constructions with 'obligation' function (the Imperfect) is historically derived from a
modal marker of obligation plus a verb which subsequently fuss together. The Imperfect
is characterized by personal prefixes agreeing with the subject — possibly derived from
fusion of a pronoun + obligation marker which eventually became prefixed to the verb.
This account is purelyspeculative, and we do not know of any historical evidence for it.
Additionally, any such account would be complicated by the fact that the Imperfect is
largely syncretic with two other forms, the Va-consecutive and the Jussive, which are
invariably (for the Va-consecutive) or almost invariably (for the Jussive) verb-initial.
290 John Myhill & Laura A. Smith
34 It is tempting to speculate that, since Hopi has the most restricted overt marking of
obligation of the languages investigated, the functions associated with nawus are the
most lexical, while other functions are more grammatical. This may turn out to be true.
However, even if nawus does represent a more lexical type of obligation function, it does
not necessarily represent the only such function; obligation markers are themselves
diachronically derived from words with other meanings, and presumably the function of
a given obligation marker will depend upon the meaning of its source.
DATA SOURCES
Cao, Yu. 1962 (1934, rev 1961). Leiyu ('Thunderstorm'). In: Caoyu Xuanji (Selected
Plays by Caoyu). Beijing: People's Literary Press.
Cao, Yu. 1962 (1940, rev 1954). Beijingren ('Peking Man'). In: Caoyu Xuanji (Selected
Plays by Caoyu). Beijing: People's Literary Press.
Geertz, Armin W. and Michael Lomatuyway'ma. 1987. Children of cottonwood: Piety
and ceremonialism in Hopi Indian puppetry. American Tribal Religions, vol. 12.
Lincoln: University of Nebraska Press.
Lo, Leslie Nai-Kwai (et al), transl. 1986. Cao Yu Peking Man. New York: Columbia
University Press.
Malotki, Ekkehart, and Michael Lomatuway'ma. 1987a. Stories of Maasaw, a Hopi god.
American Tribal Religions, vol. 10. Lincoln: University of Nebraska Press.
Malotki, Ekkehart, and Michael Lomatuway'ma. 1987b. Maasaw: Profile of a Hopi god.
American Tribal Religions, vol. 11. Lincoln: University of Nebraska Press.
Plaut, W. Gunther. 1981. The Torah: A modern commentary. New York: Union of
American Hebrew Congregations.
Wang, Tso Liang and A. C. Barnes, transl. 1958. Thunderstorm (Translation based on
minor 1954 revisions). Beijing: Foreign Language Press.
Wilson, Lanford. 1973. The Hot L Baltimore. New York: Hill and Wang.
Wilson, Lanford. 1976. The Mound Builders. New York: Hill and Wang.
Wilson, Lanford. 1987. Burn This. New York: Hill and Wang.
REFERENTIES
Albert, Roy, and David Leedom Shaul. 1985. A Concise Hopi and English Lexicon.
Amsterdam: John Benjamins.
Brown, Penelope, and Stephen C. Levinson. 1987. Politeness: Some Universals in Lan
guage Usage. Cambridge: Cambridge University Press.
Bybee, Joan. 1985. Morphology: A Study of the Relation between Meaning and Form.
Amsterdam & Philadelphia: John Benjamins.
Bybee, Joan, and William Pagliuca. 1987. "The Evolution of Future Meaning." Papers
from the 7th International Conference on Historical Linguistics, ed. by A.G. Ramat et
al., 109-122. Amsterdam & Philadelphia: John Benjamins.
Functions of Obligation Expressions 291
Chao, Yuen Ren. 1968. A Grammar of Spoken Chinese. Berkeley: University of Califor
nia Press.
Coates, Jennifer. 1983. The Semantics of the Modal Auxiliaries. London: Croom Helm.
Cooreman, Ann. 1988. "Ergativity in Dyirbal Discourse." Linguistics 26.717-746.
Cooreman, Ann, Barbara Fox, and T. Givón. 1984. "The Discourse Definition of
Ergativity." Studies in Language 6.343-374.
Fox, Andrew. 1975. "Topic continuity in Biblical Hebrew narrative." Givón 1983. 215-54.
Givón, T. 1975. "Serial Verbs and Syntactic Change: Niger-Congo." Word Order and
Word Order Change ed. by Charles N. Li, 47-112. Austin: University of Texas Press.
Givón, T. 1977. "The Drift from VSO to SVO in Biblical Hebrew: The Pragmatics of
Tense-Aspect." Mechanisms of Syntactic Change ed. by C.N. Li, 181-254. Austin:
University of Texas Press.
Givón, T. (ed.). 1983. Topic Continuity in Discourse. Amsterdam & Philadelphia: John
Benjamins.
Heschel, Abraham J. 1975. The Prophets, Vol. II. New York: Harper and Row.
Herring, Susan. 1990. "Topic and Focus Positions as a Consequence of Word Order Type:
What is Universal?" Paper presented at the International Pragmatics Conference.
Hopper, Paul. 1979. "Aspect and Foregrounding in Discourse." Discourse and Syntax
{Syntax and semantics 12), 213-241. New York: Academic Press.
Lambdin, Thomas O. 1971. Introduction to Biblical Hebrew. New York: Charles
Scribner.
Li, Charles N. and Sandra A. Thompson. 1981. Mandarin Chinese: A Functional Refer
ence Grammar. Berkeley: University of California Press.
Liu, Yuehua. 1983. Shiyong Xiandai Hanyu Yufa (Chinese Grammar). Beijing: Foreign
Language Teaching and Research Press.
Lyons, John. 1977. Semantics. Cambridge: Cambridge University Press.
Myhill, John. 1984. "A Study of Aspect, Word Order, and Voice" University of Pennsyl
vania Ph.D. disseration.
Myhill, John. 1986. "The Two VS Constructions in Rumanian." Linguistics 24.331-350.
Myhill, John. 1992. "Word Order and Temporal Sequencing." The Pragmatics of Word
Order Flexibility ed. by Doris Payne, 265-278 Amsterdam & Philadelphia: John
Benjamins.
Myhill, John. 1993. A Verb Position in Future Clauses in Biblical Hebrew. Language
Variation and Change 4.289-309.
Myhill, John, and Zhiqun Xing. 1992. "Towards an Objective Definition of Contrast."
University of Michigan manuscript.
Palmer, Frank. 1974. The English Verb. London: Longman.
Palmer, Frank. 1979. Modality and the English Modals. London: Longman.
Palmer, Frank. 1986. Mood and Modality. Cambridge: Cambridge University Press.
Payne, Doris. 1987. "Information Structuring in Papago Narrative Discourse." Language
63.783-804.
Rand, Christopher C. and Joseph S. M. Lau, transl. 1980. The Wilderness (Yuan-Yeh), by
Cao Yu. Bloomington: Indiana University Press.
Rosenberg, David, and Harold Bloom. 1990. The Book of J. New York: Grove
Weidenfeld.
292 John Myhill & Laura A. Smith
Sun, Chao-Fen, and T. Givón. 1985. "On the So-Called SOV Word Order in Mandarin
Chinese." Language 61.2.329-51.
Thompson, Chad. 1989. Voice and Obviation in Athabaskan and other Languages.
University of Oregon Ph.D. dissertation.
Ultan, Russell. 1978. "The Nature of Future Tenses." Universals of Human Language,
vol. 3, ed. by J. Greenberg. 83-123, Stanford: Stanford University Press.
Apprehensional Epistemics
Frantisek Lichtenberk
University of Auckland
1. Introduction1
Epistemic modality, one type of which is the focus of this study, is character
ised by subjectivity, by pragmatic force. It has to do with "belief, knowledge,
truth, etc. in relation to proposition" (Palmer 1986:96), and with "the degree
of commitment by the speaker to what he says" (Palmer 1986:51).
Many languages exhibit a historical relation between epistemic modality
on the one hand and other types of modality on the other. Bybee and Pagliuca
(1985:63) refer to the latter group collectively as 'agent-oriented modalities',
including the notions of ability, obligation, desire, and intention. The results
of their extensive study suggest a unidirectional process of development of
agent-oriented modalities into epistemic modalities (Bybee and Pagliuca
1985:66). Developments in the opposite direction do not occur. To say that
agent-oriented modalities develop into epistemic modalities rather than vice
versa is, of course, not to say that epistemic modalities can only develop from
agent-oriented modalities. It remains an empirical matter to determine what
other sources of epistemic modalities there are; the type of epistemic modality
to be discussed here does not (typically) develop from agent-oriented modali
ties.
This study is concerned with a type of modality that is both epistemic and
attitudinal: it has to do with the speaker's degree of certainty about the factual
status of a proposition and also with his or her attitude concerning the
desirability of the situation encoded in the clause. This is a case not of
polysemy but of mixed modality (von Wright 1951): both types of modality
294 Frantisek Lichtenberk
are present simultaneously. For reasons that will become apparent later, I will
refer to this mixed modality as 'apprehensional-epistemic'.
The development of apprehensional-epistemic modality is of relevance
to the study of semantic/functional change, specifically to the issue of the
directionality of change in terms of degrees of subjectivity and pragmatic
strength. Traugott (1989 and elsewhere (see Section 5 for references)) has
suggested that semantic change is characterized by an increase rather than a
decrease in subjectivity, by pragmatic strengthening rather than weakening.
This is true of semantic change in lexemes and also of the early stages of
grammaticalization. Although the data discussed here by and large support
Traugott's hypothesis, they at the same time suggest a refinement.
I will start by discussing in some detail data from To'aba'ita, an Austro-
nesian language spoken in the Solomon Islands. Although the information on
the present-day uses of the element in question is reasonably detailed, there is
no direct historical evidence concerning the development of the modality. To
overcome this problem, I will first consider some indirect, structural and
historico-comparative evidence, and, following that, data from several other
languages where similar independent developments have taken place. In the
case of two of those languages, there is some direct historical evidence
bearing on the issue. From the various kinds of evidence available, a general
pattern in the development of the modality emerges even though there are
some differences of detail among the languages.
Compare:
I will refer to the use of ada (and the LEST elements in other languages)
with clauses embedded under predicates of fearing as the 'fear' function. In
the fear function also, the embedded clause introduced by ada must have a
Sequential subject/tense marker, regardless of the temporal status of its propo
sition vis-à-vis the temporal status of the proposition of the higher clause. In
(9) the situation encoded in the ada clause necessarily precedes the time of
fearing:
(9) Nau ku ma 'u 'asia na'a ada to'a na'i ki
I I:FACT be.afraid very LEST people this PL
keka lae mai keka thaungi kulu.
they:SEQ go hither they:SEQ kill US(INCL)
'I am scared the people may have come to kill us.'
Compare the parallel example (8) above, where the situation encoded in the
ada clause is subsequent to the time of fearing; in both cases the subject/tense
marker in the ada clause is sequential.
Even though ada clauses require the Sequential/subject tense markers,
regardless of the temporal status of the situation encoded in the clause, there is
an important distinction between present-time and past-time situations on the
one hand and future-time situations on the other (including future in the past).
The distinction has to do with the possibility of precautions against a possible
undesirable state of affairs, whether in the sense of averting it altogether, or in
the sense of somehow alleviating its (anticipated) effects. One cannot take
precautions against past and present situations, but one can take at least some
precautions against possible future situations. In some cases, one may be able
to avert a potential future situation; one may, for example, hide in order not to
be seen:
when it encodes negative purpose, the purpose clause is negative. See exam
ples (15) and (16), respectively:
(15) Fale-a ta-si fanga 'ana wela na'i
give-it some-CLASS food to child this
fasi(-a) ka bona.
PURp(-it) he:sEQ be quiet
'Give some food to the child so that he is quiet.'
(Fasi, which ultimately derives from a transitive verb 'leave, forsake' (see
further below), optionally indexes the purpose clause by the object marker -a.)
(16) Nau ku agwa 'i buira fau fasi-a wane 'eri
I I:FACT hide at behind rock PURP-it man that
'e a'i si riki nau.
it:FACT NEG.VB he:NEG see me
'I hid behind a rock so that the man might not see me.'
Although both fasi and ada can mark negative purpose (compare 16 and
12), ada is preferred in this function. However, unlike ada, fasi does not have
the in-case function. The following examples demonstrate the similarities as
well as the differences between ada and fasi. Examples (17) and (18) demon
strate the avertive, negative-purpose uses:
(17) Ngali-a kaleko 'aa'ako fasi 'osi
take -them clothes warm PURP you(SG):NEG
gwagwari 'a-fa rodo.
be cold at-CLASS night
'Take warm clothes so that you are not cold at night.'
(18) Ngali-a kaleko 'aa'ako ada 'oko gwagwari
take -them clothes warm LEST you(sG):SEQ be cold
'a-fa rodo.
at-CLASS night
'Take warm clothes so that you are not cold at night.'
It is even possible for fasi and ada to be used jointly to mark negative
purpose (although this is not common), in which case it is ada, not fasi, that
determines the structure of the clause encoding the situation to be averted (the
clause is grammatically positive). Compare in this regard (19) below with
Apprehensional Epistemics 301
At an early stage, the etymon functioned as a verb meaning 'see, look at,
watch'. Later the verb acquired a 'warning' meaning, 'look out, watch out';
see the South Lau example (25) above. The use of verbs meaning 'see, look at,
watch' with a warning meaning is not unusual crosslinguistically. It is found
in, for example, English (look out, watch out), Basque (begira 'look at',
begira gero! 'be careful!' (gero 'later')), Zulu (beka 'look, look at, watch,
take precautions against, be careful'), and also in present-day To'aba'ita
(compare the uses of riki in (27) and (28) below). Sweetser (1990) points out
a common type of polysemy in Indo-European languages that unites mean
ings having to do with vision and meanings having to do with guarding,
keeping control, and monitoring (although this kind of polysemy can appar
ently arise in either direction).
Ada was used as a verb of warning against possible future undesirable
situations: *ada + clause 'look out, Y might/will happen', 'look out so that Y
may not happen'. The use, in present-day To'aba'ita, of the sequential sub
ject/tense markers in ada clauses embedded under verbs of fearing and in
independent ada sentences regardless of the temporal status of the proposition
encoded in the ada clause is evidence that when ada was used as a verb of
warning it was so used with reference to situations subsequent to the act of
warning. Most likely, ada was used in this function in exhortations and
commands ('Look out,...!'), which in To'aba'ita are normally expressed by a
simple verb, without a subject/tense marker, as in (26) (and in 28 below):
And just as once was the case with ada, riki also has the meaning 'look out,
watch out'. It can thus cooccur with ada:
(28) Riki -a ada 'oko dekwe-a kwade'e kuki 'ena.
see -it LEST you(SG):SEQ break -it empty pot that
'Look out; you might break the empty pot.'/'Mind you don't break
the empty pot.'
When ada still functioned as a verb, it was used to encode a general
precautionary event: 'look out, watch out'; the specific nature of the precau
tionary event depended on the situation. After ceasing to function as a verb,
ada continued as a grammatical element with a warning function signaling
that a precaution was to be taken in view of a possible adverse future situation.
In this function, ada began to encroach on the territory previously occupied by
the negative-purpose marker, present-day fasi. As a result, fasi has to a large
degree ceded negative-purpose marking to ada; today it is used primarily to
signal positive purpose. The details of the historical process whereby ada has
nearly displaced fasi in marking negative purpose may be unrecoverable, but
the fact that fasi and ada may cooccur in present-day To'aba'ita (see 19
above) can be taken as evidence of the process.
The apprehensional-epistemic function of ada is a later development
from the precautioning function, most likely through an intermediate stage of
the fear function. An undesirable future situation is likely to be feared.
Through this metonymy, ada clauses began to be embedded under predicates
of fearing. When one fears a situation, one fears a potential situation: T am
afraid that Y might happen/have happened/be happening'. In a discussion of
LEST clauses subordinated to predicates of fearing in Ngiyambaa (Australian),
Donaldson (1980:286) says that such sentences express '"fear of the possibil
ity of [e.g.] falling', not 'fear of [e.g.] falling' as such" (original emphasis).
That the fear function is a later development, postdating the warning and
precautioning functions is suggested by the use of the sequential subject/tense
markers in ada clauses embedded under verbs of fearing even if the situation
encoded in the ada clause is not posterior to the time of fearing, as in (9)
above. It is conceivable that at an earlier stage ada was used to express
warning only about potential situations against which precautions could be
taken, i.e. situations subsequent to the act of warning, and that by the time ada
developed the fear function, the subject markers had become devoid of the
sequentiality-marking function there.
306 Frantisek Lichtenberk
The final step (thus far at least) in the history of ada is the development of
the apprehensional-epistemic function. At the 'fear' stage, apprehension was
signaled by the predicate dominating the ada clause; the ada clause encoded a
possible situation. Later, through metonymy the notion of apprehension came
to be associated with ada. That is, since ada was used to introduce clauses that
encoded apprehension-causing situations (as signaled by the higher verb), it
acquired the connotation of apprehension. After this had happened, a predi
cate of fearing was no longer necessary. The notions of possibility and
apprehension were both signaled by ada; as an apprehensional-epistemic
complementizer, ada could now introduce independent sentences.
eration as well, the LEST element can be said to have a precautioning function.
Similarly, in the following (made-up) example the LEST element could be seen
as having not just a fear function but also an avertive function: 'Don't let him
go. I fear lest he should go in the water and drown.' Although the distinctions
among the precautioning, fear, and apprehension-epistemic functions are
clear in principle, in some cases the identification of the function of a LEST
element depends on whether or not one takes wider, extrasentential context
into consideration.
Note the absence of nominal (accusative or locative) marking on the verb
'go' in (35). According to Dench (1988:110-111),
the speaker apparently chooses not to imply that the unpleasant situation
will come about as a consequence of the addressee's actions or inaction. The
marking options — accusative, locative, or unmarked — allow a range of
interpretations about the responsibility on the part of the participants for the
situation described by the lest clause.
Aby can be used to express negative purpose, but it cannot be used with
an in-case function. For that reason, (38) is incongruous:
(38) #Vem si deštník aby
you(SG).take:IMP MID:DAT umbrella:ACC PURP:3SG
ti ne -prselo na cestu.
you(sG):DAT NEG-it.rained on road:ACC
#'Take an umbrella so that it does not rain while you're on the
road.'
(38) cannot be used to mean 'Take an umbrella in case it rains while you're on
the road.' 7
While aby is decidedly anomalous in (38), it can be used to introduce
parallel clauses that are embedded under a predicate of fearing or that are
independent; see (39) and (41) below.
Negative clauses introduced by aby may be embedded under predicates
of fearing:
(39) Bojím se aby nám ne -pršelo na
Lam.afraid MID:ACC LEST:3SG US:DAT NEG -it.rained on
cestu.
road:ACC
'I'm afraid it might rain while we're on the road.'
The feared situation need not be subsequent to the time of fearing:
(40) Bojím se aby ne -byl nemocnej.
Lam.afraid MID:ACC LEST:3SG NEG-he.was sick
'I'm afraid he might be sick.'
Finally, aby, introducing a negative clause, may also be used with an
apprehensional-epistemic function to encode a possible undesirable situation,
future or not:
(41) Aby nám ne -prselo na cestu.
LEST:3SG US:DAT NEG-it.rained on road:ACC
'[I fear] it might rain while we're on the road.'
Aby has an avertive, negative-purpose use, but not an in-case use. Never
theless, in the fear and the apprehension-epistemic functions aby can be used
in encoding situations that cannot be averted, either because they are of the
non-avertible type (39, 41) or because they may exist already (40, 42). This
suggests that the apprehensional-epistemic function developed from the fear
function rather than directly from the avertive, negative-purpose use.
Another example of the development of the apprehensional-epistemic
function comes from Classical Greek. The LEST element had the form mé:. An
early function of mé: was that of a negator used in certain contexts. (There
was another negator ou, used elsewhere.) Mé: was used in prohibitions, which
can be thought of as a kind of avertive function: a prohibition is issued in order
to avert an undesirable situation. Unlike the cases considered thus far, in
Greek prohibitions the precautioning situation is not encoded in a clause of its
own; instead, it consists of the speech act of issuing a prohibition:
shown by the fact that the cautious-assertion use of mé: is not restricted to
future situations. Mé: may be used with possible present or past situations (in
which case the verb is in the appropriate indicative rather than the subjunctive):
(49) Aristotle, Ethics X. 1, 3:
mé: pote dè ou kalô:s légetai
LEST at some time but NEG well it.is.said:IND
'but it may be that this is not well said' (Goodwin 1929:93)
The next set of examples comes from English. English has the form lest,
which is usually said to have two functions, which the OED identifies as
follows: First, it is used "as a negative particle of intention or purpose,
introducing a clause expressive of something to be prevented or guarded
against". This is a precautioning function, avertive, as in (50), and in-case, as
in (51):
(50) He was put in a cell with no clothes and shoes lest he injure
himself. (Collins COBUILD English Grammar, p.355)
(51) He remained inside the fortress lest the enemy should arrive in
great numbers.
The following sentence is ambiguous between an avertive and an in-case
interpretation of the LEST clause, depending on whether the medicine is to
prevent the illness or cure it:
(52) When you go to Africa, take this medicine with you lest you
(should) get ill.
The following sentence also is open to both an avertive and an 'in-case'
interpretation; in fact, the two interpretations are not mutually exclusive — an
instance of what Coates (1983) calls 'merger':
(53) Hole, C , English Shrines and Sanctuaries:
And lest future generations should wonder why St. Withburga
herself is not enshrined here, a tablet informs the visitors that the
monks stole the relics. (Scheurweghs 1959:381)
The tablet is there in case the visitors should wonder why St. Withburga is not
enshrined there (absence of the tablet would not necessarily bring about the
visitors' wondering), and its presence does remove the visitors' (further)
wondering.
314 Frantisek Lichtenberk
(58) Sir, — To those of Yugoslav origin who keep urging New Zealand
involvement in the affairs of their homeland, I would like to remind
them of the terrible cost to the world, and New Zealand, the last
time someone acted violently in Sarajevo. Lest we forget.(A letter
to the editor, New Zealand Herald, 11/20/1991.)
In (58) lest has an apprehensional-epistemic function: '[I fear] we might
forget'. It is lest itself that expresses the apprehension about a potential
undesirable event.
Lest is not common in modern English; a recent grammar characterises it
as formal, old-fashioned (Collins COBUILD English Grammar). Given the
relative obsolescence of lest, it is unlikely that its apprehensional-epistemic
use will ever become well established. (The few examples that I have come
across of lest introducing an independent sentence all contain the verb forget.
This usage is most likely ultimately modeled on Rudyard Kipling's use of lest
in his poem Recessional: Lord God of Hosts, be with us yet, Lest we forget —
lest we forget... Judge of the Nations, spare us yet, Lest we forget — lest we
forget.)
The last set of examples comes from Fijian, an Austronesian language,
but only relatively distantly related to To'aba'ita; there is no reason to assume
that the developments in the two languages are related to each other.8 What is
interesting about Fijian is that a further development of the LEST element has
taken place there, beyond the apprehensional-epistemic stage. The examples
below come from Standard Fijian (essentially the Bauan dialect) and from the
Boumaa dialect; the LEST element appears in several different forms, depend
ing on the dialect, on its linguistic environment, and on the source of the data:
dē or de in Standard Fijian and dee in Boumaa.
The LEST element is used with a precautioning function:
(59) Standard Fijian
kua ni driva vakatotolo de -o na qäi
PROH COMP drive fast LEST-you(SG) FUT SEQ
coqa
run into s.t.
'don't drive fast lest you run into something' (Schütz 1985:442)
A LEST clause can be embedded under a predicate of fearing:
316 Frantisek Lichtenberk
With (67), according to Dixon (1988:270), "what the speaker means (and
is understood to mean) is 'I feel that the motion should be changed'":
(67) Boumaa Fijian
au aa rai -ca dee rawa ni veisau -ta 'i
I PAST see-TRANS POLITE can COMP change-TRANS
a moosoni yai
ART motionthis
'I felt that the motion might possibly be changed'
(Dixon 1988:270)
The politeness function of the etymon is also found in Standard Fijian:
(68) Standard Fijian
dē rawa ni tou cakacaka vata
POLITE can COMP we(PAUC,INCL) work together
'perhaps we could work together' (Geraghty. p.c., July, 1993)
According to Geraghty (personal communication), (68) expresses a po
lite suggestion: the speaker desires to work together with the others.
What has happened in Standard and Boumaa Fijian is that the LEST
element is losing the necessarily apprehensional connotation and is develop
ing into a neutral epistemic downtoner; furthermore, on its way to becoming a
neutral epistemic downtoner, the element has come to be used as a politeness
marker. 9 Crosslinguistically, epistemic downtoning is a common feature of
polite speech, a marker of deference (see Givón 1990:822 for some refer
ences). 10
Since epistemic downtoning is one of the features of politeness, how can
we be certain that the Fijian LEST element has acquired a new pragmatic
function, that of a politeness marker? There are two pieces of evidence that
this is the case. First, in polite speech, the element is (normally) followed by
rawa 'can' (Dixon 1988:269, Geraghty, personal communication, July,
1993); this does not appear to be the case with the other uses. Second, and
more importantly, in the polite-speech use there need not be any epistemic
uncertainty: "dee rawa can be used to make a strong point in a polite way"
(Dixon 1988:270; emphasis added): in (67) the speaker is expressing his
belief (and is understood as such) that "the motion should be changed" (ibid).
Similarly, in (68), according to Geraghty, the speaker makes a suggestion in a
polite way. In one function, the element signals epistemic uncertainty without
Apprehensional Epistemics 319
whether these are merely lacunae in the data or whether the apprehensional-
epistemic function may indeed develop directly from the precautioning func
tion.
Crosslinguistically, apprehensional epistemics are used typically with
respect to future situations, but non-future uses are possible as well, at least in
some languages. (This is also true of the fear uses of the LEST elements.) The
typically future use of apprehensional epistemics is to be ascribed to 'persist
ence' (Hopper 1991) from the avertive, negative-purpose use (purpose being
necessarily future-oriented).11
An apprehensional epistemic may lose the connotation of apprehension
and thus become a neutral epistemic downtoner, even though the apprehen
sional connotation may still be the typical case. This kind of development has
taken place in Standard and Boumaa Fijian.
The category of the precautioning function subsumes the avertive and the
in-case functions. At this stage, because of lack of relevant data, it is impossi
ble to tell whether there is a general unidirectional historical process whereby
one of the two functions develops from the other, avertive from in-case, or in
case from avertive. Example (53) above suggests that the development of one
function from the other proceeds through a state where both meanings may be
present or available simultaneously in a sentence (see also 52).
As far as the sources of the precautioning function, as a category, are
concerned, little can be said at present. In To'aba'ita, the precautioning
function derives from a verbal warning meaning, 'look out, watch out', itself
from 'see, look at, watch'. English lest derives from a comparative construc
tion, 'by which less', while the Classical Greek apprehensional epistemic can
be traced back to a negator. I do not have relevant information on any of the
other LEST elements discussed above. Wayan Fijian has a LEST element with
avertive, in-case, and fear functions, but apparently not an apprehensional-
epistemic function (Pawley and Sayaba, forthcoming; Pawley, personal com
munication, October, 1992). The Wayan LEST element (unrelated to the
Standard and Boumaa Fijian and the To'aba'ita LEST elements) derives from a
verb meaning 'survey, watch, see s.t., take a close look at s.t., examine or look
carefully at s.t.'; this is reminiscent of the development of the To'aba'ita LEST
element. Central Siberian Eskimo has a Volitive-of-Fear mood, which ex
presses the notions of 'for fear that Y', 'lest Y'. According to de Reuse (1991),
the mood marker (a suffix on verbs) derives from the verb 'think'; there is no
mention of an apprehensional-epistemic function.
Apprehensional Epistemics 321
are subject to T3, on the one hand, and the development of other, non-
epistemic meanings, which are subject to Tl and T2, on the other. The
development of non-epistemic meanings is typically metaphor-based, while
the development of epistemic meanings is based on metonymy, in particular
on the conventionalization of conversational implicatures/invited inferences.
Now, assuming that the scenario concerning the development of appre-
hensional epistemics presented above is essentially correct, it is evident that
— where relevant — the developments in question do in many respects follow
the three tendencies identified by Traugott. The change in the meaning of
To'aba'ita ada from 'see, look at, watch' to 'look out, watch out' exemplifies
both Tl and T2. The change is from a relatively objective meaning to a
relatively subjective meaning, and the lexeme comes to be used to perform a
speech act: a situation is interpreted by the speaker as undesirable, and the
lexeme is used to issue a warning. It is the speaker's own interpretation of a
situation as one calling for a warning (others need not interpret the same
situation in this way).
In the absence of historical evidence, the details of the development
cannot be determined, but it is not implausible to assume that after ada
acquired the warning function, ada clauses began to be preceded by clauses
encoding precautionary situations, and ada eventually became a conjunction
with new, precautioning — avertive and in-case — functions. What we get is
development of a textual element from a speech-act element. 12 When ada
clauses began to appear embedded under predicates of fearing, ada acquired a
new textual function, that of a complementizer. Finally, the development of
the apprehensional-epistemic function from the fear function is an instance of
T3: ada comes to signal both apprehension and less-than-full certainty. 13
Traugott (1989:35) says that "meanings tend to become more subjec
tive". While it is true that changes from less to more subjective meanings can
be observed in the developments of the apprehensional epistemics, there are
also developments that do not fit this pattern. First, although it is true that the
development of apprehensional epistemics involves pragmatic strengthening
(the LEST element coming to signal uncertainty and apprehension), this is not
necessarily the end of the process of semantic/pragmatic change. As we have
seen in the case of Fijian, a LEST element may lose its necessarily apprehen
sional force and come to be used as a neutral epistemic downtoner. When an
Apprehensional Epistemics 323
ABBREVIATIONS
NOTES
1. I am grateful to Reuel Riianoa and Lawrence Foanaota for the To'aba'ita data, to
Athanasius Faifu for the South Lau data, to Paul Geraghty and Albert Schütz for data and
information on Standard Fijian, to Andrew Pawley for information on Wayan Fijian, and
to Frantiska Lichtenbergová for her help with the Czech data. In revising this paper, I
have greatly profited from discussions at the Mood and Modality Symposium and from
detailed comments by Suzanne Fleischman.
2. The other conventions and abbreviations used in glossing the examples are given above.
3. Besides fear, Palmer includes hopes and wishes under volitive modality, "fear [being]
essentially the counterpart of hope (not of wish)" (Palmer 1986:119).
Apprehensional Epistemics 325
REFERENCES
Adamson, Sylvia, Vivien Law, Nigel Vincent & Susan Wright, eds. 1990. Papers from the
5th International Conference on English Historical Linguistics, Cambridge, 6-9 April
1987. Amsterdam/Philadelphia: John Benjamins.
Aronoff, Mark, ed. 1991. Proceedings of the LSA Arizona Institute Workshop on Mor
phology. SUNY Press.
326 Frantisek Lichtenberk
Austin, Peter. 1981. A Grammar of Diyari, South Australia. Cambridge Studies in Lin
guistics 32. Cambridge: Cambridge University Press.
Austin, Peter, ed. 1988. Complex Sentence Constructions in Australian Languages. Ams
terdam/Philadelphia: John Benjamins.
Bybee, Joan L. & William Pagliuca. 1985. "Cross-Linguistic Comparison and the Devel
opment of Grammatical Meaning." Fisiak 1985. 59-83.
Capell, A. 1973. A New Fijian Dictionary (4th ed.). Suva, Fiji: Government Press.
Churchward, C. Maxwell. 1941. A New Fijian Grammar. Suva, Fiji: Government Press.
Coates, Jennifer. 1983. The Semantics of the Modal Auxiliaries. London and Canberra:
Croom Helm.
Collins COBUILD English Grammar. 1990. London and Glasgow: Collins.
Dench, Alan. 1988. "Complex Sentences in Martuthunira." Austin 1988. 97-139.
Dixon, R. M. W. 1977. A Grammar of Yidin. (Cambridge Studies in Linguistics, 19.)
Cambridge: Cambridge University Press.
Dixon, R. M. W. 1988. A Grammar of Boumaa Fijian. Chicago and London: University of
Chicago Press.
Donaldson, Tamsin. 1980. Ngiyambaa: The Language of the Wangaaybuwan. (Cam
bridge Studies in Linguistics, 29.) Cambridge: Cambridge University Press.
Fisiak, Jacek, ed. 1985. Historical Semantics: Historical Word-Formation. Berlin: Mou
ton.
Fox, Charles E. 1974. Lau Dictionary. Pacific Linguistics C-25. Canberra: Australian
National University.
Givón, T. 1990. Syntax: A Functional-Typological Introduction, Vol. II. Amsterdam/
Philadelphia: John Benjamins.
Goodwin, William W. 1929. Syntax of the Moods and Tenses of the Greek Verb. London:
Macmillan and Co.
Haiman, John. 1980. Hua: A Papuan Language of the Eastern Highlands of New Guinea.
(Studies in Language Companion Series 5.) Amsterdam: John Benjamins.
Hall, John R. C. 1950. Beowulf and the Finnesburg Fragment (new ed.). London: George
Allen & Un win.
Hopper, Paul J. 1991. "On Some Principles of Grammaticization." Traugott & Heine
1991. 17-35.
Horn, Laurence R. 1989. A Natural History of Negation. Chicago and London: University
of Chicago Press.
Jespersen, Otto. 1933. Essentials of English Grammar. London: George Allen & Unwin.
Keesing, R. M. 1975. Kwaio Dictionary. Pacific Linguistics C-35. Canberra: Australian
National University.
Klaeber, Fr. 1950. Beowulf and the Fight at Finnsburg (3rd ed.). Boston: D. C. Heath.
Lehmann Winfred P. and Yakov Malkiel, eds. 1982. Perspectives on Historical Linguis
tics. Amsterdam: John Benjamins.
Lichtenberk, Frantisek. 1991. "On the Gradualness of Grammaticalization." Traugott &
Heine 1991. 37-80.
Mitchell, Bruce. 1985. Old English Syntax (Vol. II). Oxford: Clarendon Press.
Oxford English Dictionary (2nd ed.). 1989. Oxford: Clarendon Press.
Palmer, F. R. 1986. Mood and Modality. Cambridge: Cambridge University Press.
Apprehensional Epistemics 327
Pawley, Andrew. 1982. "A Sketch Grammar of the Nabukelevu Language of Kadavu." Te
Reo 25.35-93.
Pawley, Andrew and Timoci Sayaba. Forthcoming. Wayan Dictionary: Describing the
Wayan Dialect of the Western Fijian Language. (Pacific Linguistics.) Canberra:
Australian National University.
de Reuse, Willem J. 1991. "The Role of Internal Syntax in the Historical Morphology of
Eskimo". Aronoff 1991.
Scheurweghs, G. 1959. Present-Day English Syntax: A Survey of Sentence Patterns.
London: Longmans.
Schütz, Albert J. 1985. The Fijian Language. Honolulu: University of Hawaii Press.
Shearin, Hubert G. 1909. 'The Expression of Purpose in Old English Poetry." Anglia
32:235-252.
Sweetser, Eve. 1990. From Etymology to Pragmatics: Metaphorical and Cultural Aspects
of Semantic Structure. (Cambridge Studies in Linguistics 54.) Cambridge: Cambridge
University Press.
Traugott, Elizabeth Closs. 1982. "From Propositional to Textual and Expressive Meanings;
Some Semantic-Pragmatic Aspects of Grammaticalization." Lehmann & Malkiel
1982.245-271.
Traugott, Elizabeth Closs. 1989. "On the Rise of Epistemic Meanings in English: An
Example of Subjectification in Semantic Change." Language 65.31-55.
Traugott, Elizabeth Closs. 1990. "From Less to More Situated in Language: The Unidirec-
tionality of Semantic Change." Adamson et al. 1990, 497-517.
Traugott, Elizabeth Closs & Bernd Heine, eds. 1991. Approaches to Grammaticalization,
Vol. I: Theoretical and Methodological Issues. Amsterdam/Philadelphia: John
Benjamins.
Traugott, Elizabeth Closs & Ekkehard König. 1991b. 'The Semantics and Pragmatics of
Grammaticalization Revisited." Traugott & Heine 1991. 189-218.
Tryon, D. T. & B. D. Hackman. 1983. Solomon Islands Languages: An Internal Classifi
cation. Pacific Linguistics C-72. Canberra: Australian National University.
von Wright, Georg H. 1951. An Essay in Modal Logic. Amsterdam: North-Holland.
Moods and MetaMessages.
Alienation as a Mood
John Haiman
Macalester College
0. Introduction
1. The sarcastive
One might (for example) call the sarcastive that modality (or, more plausibly,
that collection of related modalities) which expresses the speaker's belief that
the content of the ostensible message is not only false, but ridiculous. Like
many other propositional attitudes, sarcasm is easily recognized, and we have
a folklore about sarcasm and an extensive vocabulary for talking about it. But
there is no spoken language familiar to me in which the sarcastive modality is
marked with a clearly recognized and stable separate morpheme of the sort
that gets written up in sober grammatical descriptions.
There are a small number of very widespread and iconically motivated
suprasegmental markers of disgust (e.g. nasalization), non-commitment (e.g.
monotone), contemptuous mimicry (e.g. falsetto) and alienation (e.g. sing
song) which signal lack of belief, identification, or enthusiasm in one's
message. I have described some of these in greater detail elsewhere (Haiman
1989, 1990,1991).
But there are segmental (and, in the written language, orthographical)
realizations of these attitudes as well. Here are a couple of segmentally
realized sarcastive modality markers which are current in colloquial Ameri
can English (current, at least, as I write these words):
1.1 Like
In the examples of (1), the whole utterance following like (or its predictable
synonym as if) is in the scope of this marker. The hearer is warned that what
follows like is what the speaker may be expected to believe, but does not.
(1) a. Like I care. (= I don't care.)
b. Like there's any difference. (=There's no difference.)
c. Like I haven't heard that one before. (=I've heard that one
before.)
d. As if TV advertising weren 't intrusive enough already. (= TV
advertising is too intrusive already.)
Possible equivalents with invariably sarcastic meaning in French and
Russian are si tu crois que "if you think that" and mozhno podumat' "(it is)
possible to think":
Moods and MetaMessages: Alienation as a Mood 331
1.2 ....not
This utterance deflater signals that the preceding utterance (often a compli
ment) was meant facetiously. Until it is uttered, the hearer may believe that
the prior utterance was sincere.
(3) a. He's a snappy dresser.....not!
b. Guess I'll hit the books in time for that quiz not!
c. Real Life with Jane Pauley... not.
à. Bush will kill that program. Not.
e. That's a fabulous science fair project. Not!
According to Larry Horn (e-mail circular of December 21, 1991), this
prototypically generation X (Wayne's World) locution surfaced not on the TV
show Saturday Night Live about three years ago (as is commonly supposed),
nor even (pace William Safire) ten years before that, in 1978, when Steve
Martin, on the same show, uttered (3e), but at least a couple of generations
before then. Horn credits his correspondent Richard Piepenbrock with un
earthing a Little Nemo cartoon of 1910 in which this locution appears (McCay
1990), and notes, I think quite correctly, that it is highly unlikely to have been
an innovation there. Clearly, this totally motivated construction has come
around often in American speech — without ever becoming firmly estab
lished. A British English equivalent with a longer history is — I don't think,
now almost entirely obsolete (Suzanne Kemmer, pc).
Sarcastive retroactive ..not is clearly an example of what Horn (1985) has
called metalinguistic negation, and it differs from garden-variety epistemic or
deontic negation only in position. It is interesting that the formal relationship
between de dicto negation and the ordinary de re negation is parallel to the
difference that Horn extensively discusses in cases like
(4) a. I'm not happy: in fact, I'm downright miserable.
b. I'm not "happy" : I'm ecstatic.
The latter is metalinguistic. And it is the metalinguistic not which cannot
appear as a derivational prefix:
332 John Halman
1.3 or anything
The clause introducer not that is generally used to signal that the speaker's
interest in what follows is minimal:
(6) a. Not that I value my sanity so highly.
b. Not that I care about the money.
The motivation is clear. A reconstituted "full" paraphrase would be something
like:
(7) a. It's not that I value my sanity so highly. (What terrifies me are
screams of my fellow lunatics, the curses of my keepers, and
the clanging of the chains.) (with apologies to Pushkin)
b. It's not that I care about the money. (What I do care about is
the principle of the thing.)
Like many other things, however, a meaning can be mocked through ostensi
ble exaggeration or emphasis. (It seems to me that this is the essence of all
caricature.) When strengthened by the tag or anything, the belittling constitu
ent not that is apparently strengthened: but in practice, the speaker's ostensi
ble act of trivialization is mocked.
(8) a. Not that I care about the money or anything. (=1 care passion
ately about the money.)
b. Not that it's cold out or anything. (= It's real cold out.)
c. Not that you annoyed me or anything. (=You annoyed the hell
out of me.)
But for the fact I have a teenager in the family, I wouldn't have heard this
marker.
Moods and MetaMessages: Alienation as a Mood 333
1.4 Quote
Quote is the only sarcastive morpheme which has lasted for any continuous
length of time in English. It is also the only one which has grammaticalized
congeners that I have been able to find in other languages.
The quotative is an evidential category in a large number of the world's
languages (cf. Chafe & Nichols, 1986). In at least two of these, its meaning
has been extended to convey irony as it does in English. Thus, Turkish:
(10) Her gün ko§- uyor- mu§
every day run PROG evid 3SG
'He reportedly runs every say'
"can convey not only hearsay, but doubting scorn when predicated of a well-
known exercise hater" (Aksu-Koç & Slobin 1986:162; the authors also cite
Kononov (1956:232) who notes that the evidential in Turkish conveys "an
ironical attitude toward the carrying out of an action").
Similarly, the Albanian admirative, which is 'traditionally
defined as the mood expressing surprise' is 'also used to express irony, doubt,
and reportedness' (Friedman 1986:180):
2. The guiltive
In the following pages, I will discuss a more elusive (though instantly recog
nizable) related propositional attitude which I'll call the guiltive, for which
not even an adequate pretheoretical folk vocabulary exists. Consider the fol
lowing three examples from Philip Roth's Portnoy's Complaint:
Moods and MetaMessages: Alienation as a Mood 335
(14) "Don't ask what kind of day I had with him yesterday. " So I don't.
"Alex", sotto voce still, "when he has a day like that you don't
know what a difference a call from you would make." I nod. "And,
Alex" — and I'm nodding away, you know — it doesn't cost
anything, and it may even get me through— "Next week is his
birthday.
That Mother's Day came and went without a card, plus my
birthday, those things don't bother me." (39)
(15) ...my father carries himself to the kitchen table, his head sunk low,
as though he has just taken a hand grenade in his stomach. Which
he has. Which I know.
"You can wear rags for all I care, you can dress like a peddler,
you can shame and embarrass me all you want, curse me,
Alexander, defy me, hit me, hate me —" (70)
(16) Yes, she will give me the food out of her mouth, that's a proven
fact! And still I will not stay five full minutes by her bedside.
"Run ", says my mother, while Mrs. Re-ver-ed, who in no time at all
has managed to make herself my enemy, and for the rest of my life,
Mrs. Re-ver-ed says, "Soon Mother will be home, soon everything
will be just like ordinary...
"Sure, run, run, th ey all run these days," says the kind and
understanding lady — oh, they are all so kind and understanding, I
want to strangle them! — "walking they never heard of, God
bless them." (75)
rhetorical questions and commands (e.g. "Don't ask me what kind of day I had
with him yesterday"), and a small repertoire of the indices of mimicry,
including most obviously (written, gestured, or pause-marked) quotation
marks. Colloquial English includes as well a handful of segmental signs. All
of these signs more or less iconically convey the framing metamessage This
message is counterfactual' or 'I don't mean this', whence the inference
follows that 'this message is bogus'.
The guiltive speech act is entirely different, in that, in order to properly
trash her interlocutor, the guilter (typically but not always the stereotyped
Jewish mother) has to sound perfectly sincere (and hence cannot be heard to
broadcast the metamessage 'I don't mean this', which must therefore be
covert). This suggests a kinship between the guiltive and polite language, in
which the speaker also suppresses his/her own emotions, is known to be
suppressing them, and still sounds "sincere".
Nevertheless, it is instantly obvious that the bolded passages from
Portnoy's Complaint above are in the guiltive mood, rather than polite: while
the illocutionary force of politeness is (presumably) to avoid aggression, the
illocutionary force of guiltive utterances is clearly a kind of passive-aggres
sive one-upsmanship whose purpose is to make the hearer feel bad. How is
this clear? How do the passages above do this work, and how do they contrast
with the same passages uttered with genuine politeness or with genuine
sincerity, assuming that such a sincerity is even possible?
Some obvious suggestions that Dear Abby might provide for a non-guilt-
producing illocution include the following:
(14') Don't exaggerate your selflessness. ( If you want to be sincere, say:
" Of course it bothered me that you forgot Mother's Day and my
birthday. " If you want to be polite, don't mention the incident at
all.)
(15') Don't exaggerate your pain. (If you want to be sincere, say: "It
really hurts my feelings that you won't dress up for synagogue. " If
you want to be polite, don't show you 're upset at all.)
(16') Don't exaggerate your forbearance. (Sincerity might impel you to
say: "You wretched little beast, running off to play baseball!"
Politeness, once again, would impel you to keep quiet.)
What's common to all of the ostensibly benign messages from Portnoy's
elders is that the speaker's ludicrously exaggerated selflessness, hurt, and
Moods and MetaMessages: Alienation as a Mood 337
generosity, all express the explicit meta-message that "I the utterer of this
message am a saint/martyr".
This metamessage, then, is not so much a commentary on the message
itself, as on the nature of the person who can utter it sincerely. Implicit for
Portnoy to elucidate are the derived metamessage "this message is therefore
objectively bogus", and the accompanying inference that "you, Portnoy, are a
heartless swine".
3. The mass-productive
I have already suggested that emphasis may have contradictory effects: osten
sibly it increases the import of a message, while in practice it undermines it.
The same is true of repetition, the simplest kind of emphasis imaginable.
Paradoxically, replication (both of objects and of signs) validates in some
cases, invalidates in others: see for example Moravcsik's exemplary cross-
linguistic discussion of reduplication (Moravcsik 1978).
3.1.1 The "big lie" repeated is believed. This much is a truism, but it entails
an equally plausible corollary: not even the truth is believed if uttered only
once in the context of a culture of incessant repetition. If, on TV, we saw
George Bush streaking through Washington just once, and the clip were never
repeated, we would soon tend to think we had only imagined it. Incessant
repetition breeds a kind of if-it's-true-Fll-get-a-chance-to-see-it-again-later
philosophy. In other words: replication is a prerequisite for truth.
3.1.2. Replication produces wealth, visibility, and glamour for mass media
stars. It confers "seriousness" on political candidates. The validating power of
the prototypical replicating medium, TV, is so much a given, that the humour
of the Calvin and Hobbes cartoon included here as Figure 1 lies not in the
recognition that "TV validates existence" ( a point for me as a plodding
linguist, but a self-evident cliché for the cartoonist) but in the pomposity of
the kid's diction, and in the irony that Calvin and Hobbes have themselves
achieved the status of "cultural icons" precisely through the replicating me
dium of the comics.
338 John Haiman
Figure 1
History repeats itself: the first time as tragedy, the second (and all subsequent)
times as farce.
Campbell Soup Can laid up in Heaven. Each of the glamorous zombies in the
videos of Robert Palmer is conceptually and perceptually identical with all the
others.
Figure 2
3.3.2 Grammaticalization
The form of an utterance is also affected by these three variables. When both
form and meaning are affected by repetition, grammaticalization has oc
curred. The most familiar semantic aspect of grammaticalization is the ero
sion of meaning. But repetition may flat-out destroy meaning:
Moods and MetaMessages: Alienation as a Mood 341
This brings me finally to the point of this section. If the utterance of a repeated
expression may express the speaker's lack of commitment to it, then whatever
formally marks an utterance as a repetition can count as a sort of mood
marker. Given how often what we say is recycled, it is amazing how undevel
oped are our means for labelling our clichés or standard routines as such,
thereby safely distancing ourselves from them (if that is what we want to do).
For example, it is entirely clear what the quotation marks are signalling in the
342 John Haiman
Figure 3
The Holy Roman Empire, as everyone knows, was neither holy, nor Roman,
nor an Empire. Although many proper names originate as definite descrip
tions, once congealed as proper names, they are no longer required nor
expected to mean what they say. Perhaps this could be viewed as a special
case of grammaticalization. In any case, the device is self-consciously used as
an artful disclaimer of commitment in at least one recent case of false
advertising, Ragu foods vs. the Food and Drug Administration (Advertising
Age, May 28, 1990). The company marketed "Ragu fresh Italian pasta
sauces", and the FDA took exception to the word "fresh", which was inappro
priate for processed tomatoes. After a nine-month suit, the company capitu
lated: they announced they were marketing "Ragu fresh Italian brand pasta
sauces". The term "fresh" is now "part of the product's brand name, and not a
product description".
Arguably, this example is one of a purely legalistic trick (and one that
actually failed, as Larry Horn has informed me, his best efforts as hired gun
344 John Haiman
Conclusion
My major question in this discussion is: why have virtually none of these
devices for signalling lack of speaker's commitment achieved the grammati
cal status of the subjunctive or the interrogative? (The only ones that come
close are quote and stylized intonation, both of which have other, more central
functions.)
I suspect that this has not happened because the indices of sarcasm, like
those of anger and other personal emotions, belong to the same realm: they are
not really signs but symptoms of these emotions. Raising one's voice is
probably a symptom of anger in whatever language that one chooses to speak:
nevertheless, no language has grammaticalized increased amplitude as a sign
of anger. Why? Maybe because it's too universal, too natural, and too self-
evident. Signs don't get codified until they're at least a little bit arbitrary.
And maybe sarcasm has failed as yet to get grammaticalized (this sounds
paradoxical, speaking of sarcasm), because — being a symptom — it's too
sincere. I regard it as an essential defining property of a sign that we can
choose to use it, and thus be able to use it insincerely. We have at our disposal
a number of metalinguistic devices for signalling insincerity of the first order:
"this message is bogus". But perhaps as humans at the present stage of our
evolution we are limited by a purely performance-like constraint (similar to
that on center-embedding) which inhibits us from signalling insincerity of the
Moods and MetaMessages: Alienation as a Mood 345
ACKNOWLEDGEMENTS
I'am very grateful to Larry Horn, Suzanne Fleischman and Michele Emanatian
for their comments and suggestions on this paper.
REFERENCES
Wallace Chafe
University of California, Santa Barbara
0. Introduction
There are many ways in which language sheds light on the nature of the mind,
and conversely ways in which an understanding of mental phenomena can
further our understanding of language. The realis-irrealis distinction illus
trates the usefulness of combining linguistic and cognitive insights in ways
that will contribute to an improved understanding of both. In Chafe (1994) I
explored various paths by which ideas enter consciousness, among them the
direct perception of current states and events, the remembering of previously
experienced states and events, and the imagining of states and events that are
judged not to accord with current objective reality. Of interest here is the
evidence languages provide for a cognitive distinction between what may be
called realis and irrealis ideas, and how recognizing such a distinction can
help us understand what we find in languages.
I will begin with a discussion of the Caddo language, which obligatorily
expresses reality or irreality in all its verbs. Caddo also suggests one of many
diachronic paths by which the distinction may come to be grammaticized, and
helps us understand how, in the course of such a development, there may
develop inconsistencies. I will then turn to the Northern Iroquoian languages,
whose morphology reflects a three-way reality distinction. Finally, I will
mention the fact that English, which does not mark reality overtly, neverthe
less shows its influence indirectly in the semantics of specificity.
350 Wallace Chafe
language and having some kind of connection with unreality. Palmer (1986)
recommends that this term should be avoided in linguistic theory on the
ground that it corresponds to no consistent linguistic content."1 Bybee,
Perkins and Pagliuca (1994) express similar qualms. These reservations are
understandable, given the diverse and often inconsistent ways in which reality
and irreality manifest themselves. As we will see, however, it is fair to
imagine that if linguistics in its present guise had been developed by the
Caddo and not by speakers of the languages of Europe, the reality distinction
would be regarded as having a central importance to language and thought.
Bybee, Perkins, and Pagliuca specifically question the functional basis of the
distinction, they are troubled by its seemingly heterogeneous manifestations
in languages, and they find it unhelpful in understanding diachronic develop
ments. Rather than responding immediately to each of these areas of doubt,
largely repeating points made in Givôn (1994), I will proceed to a description
of the manifestations of reality in Caddo, the Northern Iroquoian languages,
and English, returning in the end to the general issue of whether and how
reality is relevant to an understanding of language and the mind.
1. Caddo
The Caddo people once lived in a large number of towns that were spread over
what is now eastern Texas, northern Louisiana, and southern Arkansas. Euro
pean encroachment into that area, which began with the De Soto expedition in
1541, was devastating, and after a series of tragedies and removals, the few
remaining Caddo settled in southwestern Oklahoma in the nineteenth century.
In spite of four and a half centuries of subservience to other languages, the
Caddo language is still spoken, although only by a rapidly diminishing
number of elderly people. I would hope that one of the lessons of the
following discussion would be the urgent need to learn as much as we can
from such rapidly dying languages during the few years in which they will
still be with us.
Caddo is a prototypically polysynthetic language, and one of the promi
nent features of its verbs is the inclusion of a prefix that refers to one or more
of the participants in an event or state. Such a "pronominal prefix" distin
guishes person, case, and — of most interest here — reality. (Number is marked
352 Wallace Chafe
elsewhere in the verb.) Persons include 1st, 2nd, 3rd, and, in addition, what I
have called a Defocusing person which functions rather like German man or
French on (Chafe 1992). Case includes Agent, Patient, and Beneficiary.
Reality includes Realis and Irrealis. The reality distinction is an obligatory,
clearly marked, and unambiguous feature of every pronominal prefix (with
one minor exception), and thus of every verb. Realis and Irrealis can be
thought of as contrasting semantic components that combine with the four
persons and the three cases in a three-dimensional array that establishes the
semantics of the pronominal prefix system.
The Realis and Irrealis prefixes have the following underlying forms:
Realis Prefixes:
Agent Patient Beneficiary
1st person ci- ku- ku-
2nd person yah?- si- si-
3rd person — — nu-
Defocusing yi- ya- yu-
Irrealis Prefixes:
Agent Patient Beneficiary
1st person t'a-lt'i- ba- ba-
2nd person sah?- sa?a- sa?u-
3rd person sa- sa- ?u-
? ?
Defocusing ?a- ?a?a- a u-
Several points are worth noting. It can be seen that 3rd person partici
pants are not always marked. Specifically, they are missing for Realis Agents
and Patients, which are actually the most frequently occurring of all 3rd
person referents. They are marked, however, when they function as Realis
Beneficiaries or when they are Irrealis, regardless of case. Thus, even though
there is not always an overt 3rd person marker, it is always clear whether a 3rd
person verb is Realis or Irrealis. It can also be seen that the Realis and Irrealis
forms have virtually nothing in common. Realis ci- contrasts with Irrealis fa-
ox t'i-, Realis ku- with Irrealis ba-, and so on. There is thus no reason to think
that the Irrealis forms had their origins in modifications of the Realis forms.
The historical origins of the two sets appear to be distinct.
In addition to the prefixes listed above, there are also various combina
tions of 1st and 2nd person Agents with 1st and 2nd person Patients and
Beneficiaries:
The Realis-Irrealis Distinction 353
Realis Prefixes:
1st person Agent with 2nd person Patient t'a-
lst person Agent with 2nd person Beneficiary t'u-
2nd person Agent with 1st person Patient yahku-
2nd person Agent with 1st person Beneficiary yahku-
Irrealis Prefixes:
1st person Agent with 2nd person Patient fa
lsi person Agent with 2nd person Beneficiary t'a?u-
2nd person Agent with 1st person Patient sahku-
2nd person Agent with 1st person Beneficiary sahku-
Here it can be seen that the combination of a 1st person Agent with a 2nd
person Patient is the only environment in which the Realis-Irrealis distinction
is neutralized. The fact that this single form, t'a-, is also homophonous with
one form of the Irrealis 1st person Agent prefix (see above) appears to be only
a coincidence.
While person and case are properties of the referent expressed by the
pronominal prefix, the domain of reality is not the referent, but rather the state
or event expressed by the entire clause (a clause being typically coextensive
with a verb). One may wonder why the expression of a clause-level meaning
should ride along on a pronominal prefix, but we may never know the
circumstances under which the language developed this way of doing things.
There are a variety of contexts that condition the use of the Irrealis
prefixes. All but one of them involves the simultaneous presence of a
so-called prepronominal prefix: a verb-initial prefix whose function is com
patible with irreality. The fact that these prepronominal prefixes participate
less than fully in the complex phonology of the verb suggests that some if not
all of them were, at an earlier stage of the language, separate particles, and that
they have become attached to verbs more recently.
1.2 Negations.
An obvious site for irreality is negation (but see Mithun in press). Simple
negations in Caddo are expressed with the Negative prepronominal prefix
kúy- 'not':
The Realis-Irrealis Distinction 355
1.3 Prohibitions.
1.4 Obligations.
1.5 Conditions.
Conditions are also Irrealis. There are two prepronominal prefixes that ex
press conditionality. One of them is the Particular Conditional prefix hí- 'if',
indicating the possibility of a particular event:
(12) hít' áybah
hí-t' a-yi=bahw
CONDITIONAL-1ST. AGENT. IRREALIS-See
'if I see it'
The other is the Generic Conditional prefix nas- 'if, whenever', used with
generic events:
(13) nast' áybah
nas-t' a-yi=bahw
GENERIC.CONDITIONAL-1ST. AGENT.IRREALIS-see
'if, whenever I see it'
The Realis-Irrealis Distinction 357
There are several other prepronominal prefixes that condition Irrealis pro
nominal prefixes. One of them is the Simulative prefix dúy- 'as if:
(16) dúyt'áybah
dúy-t'a-yi=bahw
SIMULATIVE-1ST. AGENT.IRREALIS-See
But the exemption of Caddo imperatives and futures from the irrealis
category could have a different explanation. It is likely that ways of express
ing imperatives and futures were established in the language well before the
negative and other particles came to be attached to what were originally
interrogative verbs, as suggested above. Imperatives and futures, that is, may
stem from a more ancient layer of Caddo morphology. Already entrenched in
the language, they would then have failed to participate in the more recent
grammaticization of irreality in the pronominal prefixes. It is difficult to
decide which of these explanations is correct (perhaps both are), but either is
plausible, and either serves to demonstrate how irreality (like other semantic
functions) may come through time to be represented in ways that may appear
synchronically to be inconsistent.
Spoken in New York, Ontario, and Quebec, the languages that constitute the
northern branch of the Iroquoian family include Mohawk, Oneida, Onondaga,
Cayuga, Seneca, and Tuscarora, as well as other languages now extinct. All of
these languages behave in a similar fashion with respect to the matters to be
discussed here, and I will discuss them as a single entity.
When Northern Iroquoian verbs are inflected for the Punctual (perfec
tive) aspect, they obligatorily contain one of a set of three prefixes that
segment a continuum of reality. The occurrence of these prefixes with other
aspects — the Stative (perfect) and Habitual (both imperfective and generic)
— is optional, and is marked with a special added suffix. At one end of this
reality continuum is the Factual prefix (reconstructable in most environments
as *wa?-), expressing the speaker's belief in the reality of the event or state. At
the other end is the Optative prefix (reconstructable in most environments as
*aa-), expressing either obligation or possibility. In some contexts the Opta
tive is best translated 'should', in others 'might'. Intermediate between these
extremes lies the Future prefix (reconstructable as *ę-), used for a state or
event the speaker expects to materialize but is not certain of. Verbs with the
Future prefix are also often used as imperatives, a fact which suggests that the
exemption of both imperatives and futures from irrealis marking in Caddo, as
discussed above, may indeed have a functional basis. In short, whereas the
360 Wallace Chafe
Caddo system divides this dimension simply into Realis and Irrealis, the
Northern Iroquoian system shows a three-way division. The examples below
are taken from the Seneca language:
(22) o?geegę?
wa?-ke-kę-?
FACTUAL-1ST. AGENT-See-PUNCTUAL
'I see/saw it' (direct perception or memory)
(23) ęgeegę?
ę-ke-kę-?
FUTURE-1ST. AGENT-See-PUNCTUAL
'I'll see it' (prediction)
(24) aageegę?
aa-ke-kę-?
OPTATIVE-1ST. AGENT-See-PUNCTUAL
'I should/might see it' (obligation or possibility)
This Northern Iroquoian continuum interacts with yes-no questions,
negations, and other irrealis contexts in complex ways. I will limit the discus
sion here to questions and negations. Yes-no questions are expressed with
both an intonation contour and a phrase-final particle in most of the lan
guages, but in Seneca with intonation alone. Such questions are expressed in
ways that are independent of the reality dimension. That is, they may be
marked as Factual, Future, or Optative, or (with aspects other than the Punc
tual) they may have no reality marking at all.
The difference between Caddo, where all yes-no questions are Irrealis,
and Northern Iroquoian, where such questions may show any degree of reality
or none at all, also invites two possible explanations. One might suppose that
interrogation lies outside the scope of reality, so that, for example:
(25) ęhseegę?
ę-hse-kę-?
FUTURE-2ND.AGENT-see-PUNCTUAL
'will you see it?'
with a yes-no question intonation has a meaning that could be paraphrased "Is
it or is it not the case that you will see it?" whereas the corresponding Caddo
word:
The Realis-Irrealis Distinction 361
(26) sàyybáw?a?
sah?-yi=bakw-?a?
2ND.AGENT.IRREALIS-see-FUTURE
'will you see it?'
might be paraphrased "Will you see it or won't you see it?"
An alternative possibility, not inconsistent with the first, is that Northern
Iroquoian questions are actually requests for confirmation. Thus, (25) might
correspond more closely to English "You'll see it?" with a question intona
tion, a translation that mirrors the Seneca surface form (regarding confirma
tive questions in English see Chafe 1970a:333-337 and, in Onondaga, Chafe
1970b:24). A confirmative question allows any degree of reality in the propo
sition for which confirmation is requested. This explanation can be combined
with that in terms of scope, in the sense that confirmation lies outside the
degree of reality. For present purposes it is enough to point out that the
different treatment of yes-no questions by Caddo and the Northern Iroquoian
languages can have a functional motivation.
Negation presents a different picture. A detailed discussion of negative
forms and their cooccurrences in the Northern Iroquoian languages would
take us too far afield, but we can note that the Negative prefix in these
languages (allowing for a few special exceptions) does not cooccur with any
of the reality prefixes. Negative past meanings are expressed with a Stative
(perfect) form of the verb, which does not require a reality prefix:
(27) da?ageegęęh
te?-wake-kę-ęh
NEGATIVE-1ST.PATIENT-see-STATIVE
'I haven't seen it'
Negative Futures are expressed with the so-called contrastive prefix (which
involves an element of surprise), together with the Optative prefix and a verb
that ends in the same way as an imperative (Chafe 1967: 32):
(28) thaageegęh
th-aa-ke-kç-h
CONTRASTIVE-OPTATIVE-1ST. AGENT-see-IMPERATIVE
'I won't see it'
362 Wallace Chafe
3. English
Imagine, for example, that you and I are walking past an airport bookstore and
I say, "I need to buy a book." With the specific meaning, I might continue "It's
one I read a review of yesterday". With the nonspecific meaning, I might go
on to say "So I'll have something to read on the plane." It appears that in
irrealis contexts English allows and even encourages the interpretation of a
direct object referent as nonspecific. The contexts in (31) are just those that
are overtly marked as Irrealis in Caddo, except for the fact that they include
Future, thus confirming the natural inclusion of future in the Irrealis category.
It is possible that this relation between irreality and nonspecificity of refer
ence is widespread if not universal, thus confirming the widespread or univer
sal relevance of the irrealis category. But that remains to be seen.
4. Conclusion
The realis-irrealis distinction reflects judgments that certain ideas stem from
direct perception, memory, or expectations of what is normal, while others
have their source in imagining. This distinction can be thought of as a covert
semantic pressure that emerges in different languages in different ways. It
emerges in Caddo in a clear and ubiquitous formal distinction, every verb
being marked for either reality or irreality. The Caddo assignment of specific
functions to these two categories is motivated and coherent, with yes-no
questions, negations of various sorts, prohibitions, obligations, conditions,
simulations, and surprises in the irrealis category. Neither futures nor impera
tives are marked as Irrealis, however, either because Caddo draws the line at
this point in the realis-irrealis continuum, and/or because ways of expressing
futures and imperatives were morphologized in the language prior to the
marking of irreality.
The distinction emerges differently in the Northern Iroquoian languages,
which show a three-way rather than a binary contrast with perceived and
remembered reality at one extreme, obligation and possibility at the other, and
between them the future (with an imperative use as well). Unlike Caddo, the
Northern Iroquoian languages do not treat yes-no questions as irrealis, per
haps because the scope of questioning is external to reality, perhaps also
because Iroquoian questions are more confirmative in nature than the more
clearly disjunctive questions of Caddo. The Negative and Factual prefixes are,
as might be expected, incompatible in the Northern Iroquoian languages, and
negative futurity is expressed with a grammaticized combination of surprise
and possibility, along with an imperative ending. The three-way reality dis-
364 Wallace Chafe
tinction is obligatory with perfective verbs, but it is optional and marked with
perfects, imperfectives, and generics.
Reality fails to be overtly marked in English, but its force is nevertheless
felt through the irrealis creation of an environment in which referents may be
interpreted as nonspecific. The contexts that produce this effect line up well
with the Caddo Irrealis contexts, except for the inclusion of future.
I hope to have shown (1) that the realis-irrealis dimension has a consist
ent functional basis in people's judgments concerning the degree to which
their ideas accord with what they believe to be objective reality; (2) that the
apparent inconsistency with which such judgments are manifested in different
languages can be explained sometimes in terms of a gradient rather than a
binary dimension of reality, sometimes in terms of layers of morphologiza-
tion, and sometimes in terms of scope; and (3) that recognition of the pres
sures exerted by the reality dimension is ultimately essential to an
understanding of certain aspects of language change.
We are left with the question of the ontological status of the reality
dimension as it relates to the several moods and modalities. The extent to
which ideas are judged to accord with objective reality leads to a categoriza
tion of specific moods and modalities, in combination with specific tenses and
aspects, into two or three overarching categories. The categories themselves
may surface as realis and irrealis semantic components of morphemes as in
Caddo, as complete morphemes as in the Northern Iroquoian languages, or as
indirect influences on the specific-nonspecific referential distinction as in
English. Realis and irrealis can thus be seen as comparable to broader catego
ries such as tense or aspect, or for that matter mood and modality, within each
of which several values are available. They are most closely related to moods
and modalities in the sense that they reflect a speaker's attitude toward the
idea being expressed, but their effect is to combine the more specific moods
and modalities into larger groupings.
FOOTNOTES
1 Curiously, a search of Palmer's book does not reveal any such recommendation.
2 In the first, and sometimes the second lines the acute accent mark indicates a high
pitched vowel, the grave accent a pitch that falls from high to low. Long consonants and
vowels are shown with doubled letters. Caddo stems usually consist of a lexicalized
combination of two or more historically distinct morphemes, which are joined in the
The Realis-Irrealis Distinction 365
second line with an equals sign. The separate meaning of each of these stem-internal
morphemes is often obscure, and here I have simply given an English gloss for the stem
as a whole.
REFERENCES
Bybee, Joan, Revere Perkins & William Pagliuca. 1994. The Evolution of Grammar:
Tense, Aspect, and Modality in the Languages of the World. Chicago: University of
Chicago Press.
Chafe, Wallace. 1967. Seneca Morphology and Dictionary. (Smithsonian Contributions
to Anthropology, Volume 4.) Washington, DC: Smithsonian Press.
Chafe, Wallace. 1970a. Meaning and the Structure of Language. Chicago: University of
Chicago Press.
Chafe, Wallace. 1970b. A Semantically Based Sketch of Onondaga. Indiana University
Publications in Anthropology and Linguistics, Memoir 25.
Chafe, Wallace. 1992. "Uses of the Defocusing Pronominal Prefixes in Caddo." Anthro
pological Linguistics 32:57-68.
Chafe, Wallace. 1994. Discourse, Consciousness, and Time: The Flow and Displacement
of Conscious Experience in Speaking and Writing. Chicago: University of Chicago
Press.
Givón, Talmy. 1989. Mind, Code and Context: Essays in Pragmatics. Hillsdale, NJ:
Lawrence Erlbaum.
Givón, Talmy. 1994. "Irrealis and the Subjunctive." Studies in Language 18:265-337.
Jackendoff, Ray S. 1972. Semantic Interpretation in Generative Grammar. Cambridge,
MA: The MIT Press.
Mithun, Marianne. "On the Relativity of Irreality." This volume.
Palmer, Frank R. 1986. Mood and Modality. Cambridge: Cambridge University Press.
Roberts, John R. Ms. The Category 'Irrealis' in Papuan Medial Verbs.
On the Relativity of Irreality
Marianne Mithun
University of California, Santa Barbara
0. Introduction
1. Central Porno
The markers identifying Irrealis dependent clauses are -hi and =hla. The first,
-hi, links clauses expressing what are portrayed as components of the same
event.
(2) Irrealis same event linker: FJ
Mé-n=?ti mí= lálil čá-n-ma-hi kay
SO=but that=at b a c k run-IMPFV-COOPERATION-SAME.IRREALIS tOO
'But we could drive back there and
ya mu-l da ?=mí= =li hlá-qač=?le.
1.AGENT that road cop=that=at=with go.PL-up-=coNDITIONAL
go up that road.'
The second Irrealis marker, =hla, links clauses expressing what are portrayed
as different events.
(3) Irrealis different event linker: FP
Ma čalél qów=?-ne=hla
2.AGENT just out=by.gravity-set=DIFF.IRREALIS
'If you just threw it out
't- mu-l bá-č-i?le.
feel that big-INCHOATIVE.PFV=CONDmONAL
I guess it would grow.'
The clauses bearing the linkers are grammatically dependent, since they
cannot stand alone as independent sentences, but they are not necessarily
subordinate semantically. In (2), for example, 'We could go back up there and
370 Marianne Mithun
drive up that road', the Irrealis marker links two semantically and pragmati
cally equivalent constituents of a larger sentence expressing root possibility.
(The free translations were provided by Mrs. Jack, who was present at all
conversations and aware of both the linguistic and extra-linguistic context.)
Irrealis linkers appear in a wide variety of constructions that express events
portrayed as non-actualized.
(4) Irrealis linker in counterfactual construction: KD
Me-n mí-hla mu.l
so say=DIFF.IRREALIS that
'If she'd said that
?a- ?čhá-=ne ?
a.
1 .AGENT Sit=CONDITIONAL 1 .AGENT
I'd stay longer.' [But she said ...]
(5) Irrealis linker in conditional construction: FJ
Wá-q-hi ?e
gO-level-SAME.IRREALIS COP
'If I go
ló'-h-du-w-?khe.
help-IMPFV-IMPFV-PFV=FUTURE
I'll be helping out.'
(6) Irrealis linker in deontic construction: FJ
?
čá-w-htow é y=yo-hi
house-LOC=from away=go-SAME.iRREALis
'He should leave home and
táwhal da-čé = ?le.
work handling-catch=CONDITIONAL
get a job.'
(7) Irrealis linker in future construction: FJ
Té-nta=lil wá-n-hi
town=to go-IMPFV-SAME.IRREALIS
'I'll bring it back
?
a qó=be-w=?khe.
1.AGENT toward=carry-PFV=FUTURE
on my way to town.' ('I'll go to town and bring it back.')
On the Relativity of Irreality 371
There are two pair of Realis dependent markers. They link past or present
events. (There are no past or present tense markers.) One pair, -in and -da
'while', link simultaneous actions or states. The actions or states need not be
simultaneous for their full duration, but simply overlapping at some point.
The first marker, -(i)n, links what are portrayed as components of the same
event.
(10) Simultaneous Realis, same event: EO
?a- Edna-to cá=l yó-h-du-n.
1 .AGENT =PATIENT house=to go-PFV-IMPFV-SAME.SIM-REALIS
T go to Edna's house and
híntil=?el ča-nó-'d-an-ya mú-tu.
Indian=the talk-IMPFV-IMPFV=EVID 3.PATIENT
talk Indian to her.'
The other, -da, links what are portrayed as distinct events.
(11) Simultaneous Realis, different events: SA
?
Mu-l oč qášóy-da
that still alive=DIFF.SIM.REALIS
'While he was still alive,
372 Marianne Mithun
Multiple markers can appear within a sentence. In (15), 'grab' and 'pull
out' are joined by -ba, both of which are in turn linked by -da to the verb 'cut'.
(15) Central Porno linked dependent clauses: SA
thaná=?el da-cé-ba
hand=the handling-seize-SAME.SEQ.REALis
'When they grabbed his hand and
s-thí-c-na-w=da,
drawing-Open-SEMELFACTIVE-AWAY-PFV=DIFF.SEQ.REALIS
pulled it (a knife) out,
Pun mu-l ča-qhá-č-ka-m=ma.
self that Sawing-CUt-RFL-CAUS-PFV=FACTUAL
he cut himself.'
The negative statement about the past in (21) contains a Realis linker, just like
positive statements about past events.
(21) Realis negative past: FP
Ranch =?el qdí yhé-t-ač čhó-w ?
í-n
= t h e g o o d do-M.E-IMPFV.PL not-PFV be-SAME.SIM.REALIS
'Because they didn't keep up the ranchería,
ya-l qo=-l mčá-w dá- ?-či-w.
1 .PL.PATIENT out=to throw.PL-PFV WANT-RFL-IMPFV.PL-PFV
they wanted to throw us out.'
The Central Porno Irrealis/Realis distinction is strikingly similar to those
in a large number of other unrelated languages. In the Papuan language
Amele, for example, described in Roberts (1990), dependent clauses are also
identified by markers that carry an Irrealis/Realis distinction. Counterfactual,
conditional, obligatory, future, hortatory, and imperative constructions appear
with Irrealis markers, essentially the same construction types as in Central
Porno. Past, present, and habitual actions are linked with Realis markers. The
Realis markers further distinguish simultaneous and sequential events. Ques
tions and negatives show the same Irrealis/Realis classification as would their
positive declarative counterparts. It is not difficult to discern a motivation
behind the choice of terms 'Irrealis' and 'Realis' for this distinction. State
ments categorized as Irrealis portray events as still within the realm of thought
alone, while those categorized as Realis are portrayed as actualized, actually
occurring or having occurred.
Categories identified as Trrealis' and 'Realis' are not uniform across all
languages, however. While many languages show patterns strikingly similar
to those in Central Porno and Amele, some show minor differences, and a few
show substantial ones. The variation suggests that either the distinction under
lying the categories is not in fact comparable across languages, or that the
distinction is constant, but its application differs. If the basic distinction
varies, then the common terminology could be misleading. If the distinction is
the same, and differences are confined to its application, the variation might
be explicable in terms of the diachronic developments of grammatical catego
ries in individual languages.
376 Marianne Mithun
2.1 Imperatives
Polite Imperative, containing the basic second person Subject Prefix and an
Irrealis Suffix, and a Basic Imperative, containing a special Second Person
imperative Prefix and no Irrealis Suffix.
(25) Jamul Diegueño Imperatives (Miller 1990:119),
a. nya-m-mápa-pu m-rar-x-s
INDEF-2-NOM.Want-DEM 2-do-IRREALIS-EMPH
'Do whatever you want (polite imperative)'
b. k-naw
2-run
'Run! (basic imperative)'
Roberts (1990:390) notes that Alamblak, an East Sepik language of Papua
New Guinea, shows a similar choice. Polite Imperatives are categorized as
Irrealis, and strong imperatives as Realis. Roberts points out the parallel with
English commands: Would you be seated versus Sit down!. The presence of
both Irrealis and Realis imperatives in these languages shows that both
options are semantically possible.
2.2 Futures
hlá-?w-ač=khe.
walk.PL-around-IMPFV.PL=FUTURE
eating.'
On another occasion, Mrs. Jack related what her mother had told her as a
child.
(27) Central Porno Realis Future: FJ
?
sé- ul ma,
long.time already 2.AGENT
Tn the future,
yém-aq-da,
old-INCHOATIVE.PFV=DIFF.SIM.REALIS
When asked about the use of =da in this context, she replied, 'With =da he'll
get home for sure.' The Irrealis counterpart =hla would suggest some uncer
tainty. The Realis futures cited in (26) and (27) above were used for events the
speaker portrayed as certain to occur: eating at the picnic, getting older, and
dying.
The Central Porno option of categorizing futures as either Irrealis (fre
quent) or Realis (rarer) shows that both are semantically coherent combina
tions. A system in which futures were systematically categorized as Realis
could conceivably arise because the Irrealis form was so highly marked
semantically that speakers used it sparingly with futures. Such a scenario is
certainly not the only way in which such a pattern could develop. Chafe (this
volume) describes a possible path of development of a different sort for
Caddo, in which structural characteristics of the markers shape the system.
All negative statements are categorized as Irrealis in Caddo, since they report
non-actualized events or states.
(38) Caddo Irrealis negative (Chafe this volume)
kút' áybah
kúy-t'a-yibahw
NEG-1 .AGENT.IRREALIS-see
T don't see him.'
- REALIS ( NEGATIVE ( I see him ))'
The scope relations described here for Central Porno and Caddo correctly
predict the categorization of negative commands, or prohibitives in each
language. In Central Porno, prohibitives are systematically categorized as
Irrealis on the basis of their imperative status.
(39) Central Porno scope relations: imperatives
commands:- REALIS ( IMPERATIVE )
prohibitions: NEGATIVE ( -REALIS ( IMPERATIVE )).
(40) Central Porno prohibitive: FJ
Dá-wi ?čhá-č-hi khyá swé-lar?khe thín ?e ma.
road.on stop-SAME.IRREALIS game play not it.is you
'Don't stop and play on the way home!'
NEGATION (- REALIS ( IMPERATIVE ( you stop and play on the way
home )))
In Caddo, prohibitions are systematically categorized as Irrealis as well but on
the basis of their negativity.
(41) Caddo scope relations: Imperatives
commands: + REALIS ( IMPERATIVE )
prohibitions: - REALIS ( NEGATIVE ( IMPERATIVE ))
(42) Caddo Prohibitive: (Chafe this volume)
kaššáy?bah
kaš-sah?-yibahw
NEG-2.AGT.IRREALIS-see
'Don't look at it!'
- REALIS ( NEGATIVE ( IMPERATIVE ( you look at it )
Such scope differences could arise in a number of ways. A simple
hypothesis would be that scope differences might result from differences in
the order in which the distinctions are grammaticized. Their paths of develop
ment can be much more complex, however. Scope relations may even shift
within the history of a single language.
Mesa Grande Diegueño, a Yuman language of southern California
closely related to Jamul, contains an Irrealis suffix -x cognate with the Jamul
Irrealis suffix of the same form (Langdon 1970:158-9). The suffixes appear in
all of the same contexts except for one: in Mesa Grande, negated clauses
obligatorily carry the Irrealis suffix, but in Jamul they do not.
384 Marianne Mithun
3. Conclusion
The comparison of the Irrealis/Realis distinctions described here suggests
some fruitful directions for the investigation of grammaticization patterns. In
Central Porno, as in many other languages, counterfactual, conditional,
deontic, future, hortative, and imperative constructions all contain Irrealis
markers. Some of these types, such as counterfactual and conditional con
structions, are widely categorized as Irrealis cross-linguistically. They are in
fact often cited as the best evidence for the appropriateness of the label
'Irrealis' for a marker. Counterfactual and conditional constructions convey
ideas that are most clearly within the realm of thought (imagination) rather
than actualized reality (perception).
On the Relativity of Irreality 385
reversed. Question formation and negation have narrower scope than reality
categorization. The Irrealis/Realis classification is thus sensitive to the status
of constructions as questions or negatives. A number of diachronic paths
could lead to such differences in semantic scope. One might be a difference in
the order of grammaticization of the individual constructions within a lan
guage. Others might be more complex, such as the Jamul formal reanalysis of
the negative construction.
The comparison of these seemingly disparate systems shows complete
accord in the nature of the basic Irrealis/realis distinction that underlies them.
In all of the languages, events and states classified as nonactualized, those that
remain within the realm of thought and imagination, are overtly distinguished
from those portrayed as actualized, having occurred or currently occurring.
The differences among the systems lie in the application of the distinction to
various contexts, the categorization of individual grammatical constructions
within each language. Some differences may result from special uses of the
distinction for expressive purposes, such as the classification of Imperatives
as Realis to indicate certainty of immediate actualization. Some may stem
from differences in the degree of probability deemed necessary for Irrealis
marking, as in the case of futures. Some may mirror differences in the relative
semantic scope of the Irrealis/Realis distinction and other grammatical dis
tinctions, such as interrogative or negative. Such scope differences could arise
in many ways, only a few of which have been explored here.
If the 'Irrealis/Realis' terminology were not used, the cross-linguistic
convergences in the semantic nature of the distinction, and the contrasts in its
application, might go unnoticed. Of course special care should be taken in
cross-linguistic comparisons of these categories, as with any modal catego
ries, to ensure that the distinctions on which they are based are indeed
comparable. Grammarians describing particular languages can help to prevent
misunderstandings if they provide sufficient evidence that a particular marker
does indeed represent the nonactualization of Irrealis mood in a variety of
contexts, for example, rather than the temporal deixis of future tense. With
such care, this area of modality will give us much to discover about processes
of grammaticization that lead to the convergence and divergence of gram
matical systems.
On the Relativity of Irreality 387
ABBREVIATIONS
NOTE
* Edith Bavin, Joan Bybee, and Suzanne Fleischman provided helpful comments on an
earlier draft of this paper.
REFERENCES
Suzanne Romaine
Merton College, University of Oxford
0. Introduction*
In this paper I will address a case of semantic and syntactic change involving
the grammaticalization of irrealis marking in Tok Pisin, an English-based
pidgin/creole spoken in Papua New Guinea. Modality, and in particular, the
marking of so-called irrealis, has figured prominently in discussions of tense-
mood-aspect (hereafter TMA) systems in pidgins and creoles since
Bickerton's claims about the universality of a creole prototype consisting of a
three member inventory of preverbal particles marking the same semantic
functions. These elements all contrast with the unmarked form of the verb. In
most creoles the irrealis marker is rather transparently derived from verbs or
auxiliaries such as go or sa (<shall) in English-based creoles. In others such as
Tok Pisin, however, a temporal adverbial has been recruited for this function
(compare Portuguese-based creoles lo from logo 'next' with Tok Pisin
baimbai from 'by and by').
Bickerton (1981:80) identifies three stages in a gradual process which
leads to the incorporation of these particles from a clause-external syntactic
position to the status of preverbal auxiliaries. Some examples are given in 1 to
3 to illustrate these proposed stages with reference to the grammaticalization
of baimbai in Tok Pisin.
390 Suzanne Romaine
(1) Baimbai mi go
'By and by I'll go.'
(2) Bai mi go
T U go.'
(3) Mi bai go
'I'll go.'
In the first stage baimbai and other similar elements appear optionally
clause-finally or clause-initially. In the second, they are incorporated into the
auxiliary but cannot combine with any Aux constituents, while in the third
they display combinatorial capacities. Tok Pisin is not quite in step with
Bickerton's stages since in (2) we see a phonologically reduced and un
stressed form of bai still in clause-external position rather than incorporated
within Aux. The third example shows bai in preverbal position, but does not
specify whether there are any restrictions on its co-occurrence with other Aux
constituents. Nevertheless, in many respects Tok Pisin represents an excellent
case for examining this alleged sequence of grammaticalization since all three
stages currently co-exist. In other ways, however, Tok Pisin's development is
atypical, both with respect to what has happened in other pidgins and creoles,
as well as with respect to the grammaticalization paths chosen by other
languages.
This general sequence of grammaticalization of TMA markers is also
taken by many others to be a significant hallmark of creolization. Labov
(1977:29), for example, says:
It is not at all obvious that a pidgin will develop obligatory tense markers
when it becomes a native language. Yet this has happened in case after case.
When pidgins become creoles, the system of optional adverbs gives way to
an obligatory tense marker next to the verb.
More recently, Markey (1982) and Mühlhäusler (1986:156-7) have cited the
use of sentence-external propositional qualifiers as a major typological char
acteristic of pidgins (see also Kay and Sankoff 1974:64). Sankoff and Laberge
(1980) argued on the basis of research done in the late 60s and early 70s that
the Tok Pisin temporal adverb baimbai is becoming a marker of future tense.
Moreover, they link this change to "the passage of Tok Pisin from a second
language lingua franca to the first language of a generation of urban New
Guineans". I will say more about the extent to which such developments are
triggered by creolization and exclusive to creoles in Sections 3., 4.3 and 5.
The Grammaticalization of Irrealis in Tok Pisin 391
In 2.I situate my use of the term 'irrealis' within the context of modality and
explain its relevance to the Tok Pisin case.
Bickerton's claims have been based largely on his analysis of data from
Hawai'i Creole English (HCE), partly in the conviction that "data of any kind
on the antecedent pidgins of any creole but HCE are simply non-existent"
(1981:79). The latter assertion is unwarranted. In this paper I will make use of
new historical data on Pidgin English in the Pacific together with synchronic
Tok Pisin data I collected during 1986 and 1987 in Papua New Guinea.
The spoken data were recorded over a two year period in 1986 and 1987 as
part of a study of rural and urban Tok Pisin in two provinces in Papua New
Guinea (see Romaine 1992 for further details). The two provinces, Madang
and Morobe, lie on the New Guinea Coast. The number of Tok Pisin speakers
has been steadily increasing there, though it is difficult to give precise figures.
In the 1966 and 1971 Censuses, which covered only 10% of the village
population, respondents were asked to say which members of the household
over the age of 10 were able to speak Tok Pisin. In Madang Province there has
been an increase in the number of reported speakers of Tok Pisin between the
years 1966 and 1971 from 62% to 69%, while in Morobe Province it has
increased from 47% to 59%.
regular after-school recording sessions over a period of two years. The data
for these children consist of narratives, free play and video story retellings. In
other areas such as Indagen and Bulolo there are some longitudinal data too
since some óf the same speakers were interviewed in 1986 and 1987.
At the same time as the analysis of the spoken material is being carried out, I
have been compiling a corpus of written texts (both contemporary and histori
cal) for comparison. Table 2 gives brief details of that part of the corpus I will
be referring to here and further information about the texts can be found in
Romaine (1992). Nupela Testamen (the 1966 translation of the New Testa
ment) and the weekly newspaper in pidgin, Wantok, played important roles in
the standardization of Tok Pisin. The other two main sources which were used
to make up the written corpus are Save Na Mekim (Understanding and Doing;
It is not entirely clear from the historical record when baimbai began giving
way to bai. My guess is that this happened sometime in the 1950s and 1960s,
but I will have more to say about that in 3. All three stages are represented in
the data from Bulolo, Unitech (Lae), Erima and Indagen, as can be seen in
examples (4) to (8). The same is true incidentally for the speech of adults.
(4) Baimbai yu kaikai. [BL1F4]. 1
'Then [after you've cooked], you'll eat.'
(5) Mi droim maunten, na bihain bai mi wokim wara wantaim.
[BL1M5].
'I'll draw the mountain, and then I'll do the water that goes with it.'
(6) Em i go insait nau, bai ol kaikai [INDIFl]
'He went inside and then they ate.'
(7) Yu stretim ston insait long hul ia orait ston bai stap nau bai yu
stretim hul ok yu putim disla kaikai ia [BL M5]
'You arrange the stones inside the pit, and once the stones are
there, you arrange the pit and you put the food in.'
(8) Ol man bai bihainim em ol bai kisim em. [IND1F1]
'The men then followed him and they caught him.'
Example (4) is one of ten examples of baimbai I have found so far in the
children's corpus. The speaker is a fourth grade girl in Bulolo, who is a second
language speaker of Tok Pisin from Irian Jaya. She also uses bai in both
394 Suzanne Romaine
clause-initial and preverbal position, as can be seen in (9). The only other
instances of baimbai are from one of the Unitech girls and the children in
Erima.
(9) Nau wanpla longlong man ia draipla kanu blem, nau disla
longlong ia em sa kaikai ol man nau. Longlong man ia kam luk ol
manki na em tok, "Yupla sa waswas long ia, mi kisim yupla go, bai
mi lusim yupla na yu bai waswas long disla ". Na em kisim ol go.
[BL1F4].
'There was a crazy man who had a canoe, and he ate people. This
crazy man saw the boys and said, "You boys wash over here and
I'll catch you, now I'll leave and you'll wash over there." Then he
got them.'
Thus, even today all three stages of this process of grammaticalization
can be represented synchronically in the speech of one speaker. It is evident,
however, that even at the time when Sankoff and Laberge obtained their data,
baimbai was a recessive feature. In a corpus of 395 examples of bai in the
speech of 18 people (9 children and adolescents between the ages of 5 and 17,
and 9 adults between the ages of 25-45) recorded near Lae, they found only
five instances of baimbai. These were phonetically already somewhat re
duced, i.e. [bəm'bai] or [bə'bai] and were found in the speech of three of the
nine adults. There were no instances of baimbai in the speech of the children
they interviewed (see Romaine and Wright (1987) on morphophonological
condensation in children's speech). Nevertheless, it is clear from my data that
the use of baimbai is not entirely exclusive to rural and/or second language
speakers, or adults.
As far as the meaning of baimbai and bai is concerned, it will be obvious
even from the few examples I have given here that bai is a highly redundant
marker which signals more than time relations. It often occurs with other
adverbs indicating relations of time or the sequence of events. For example, in
5, the connectives na bihain 'and then/afterwards' make it clear that the
activity of the second clause takes place after that reported in the first. In this
context bai is clearly redundant, and functions not so much as a tense marker
but merely as a sequence marker in discourse. This use of bai is in line with its
historical origin as a temporal adverb. In some texts it appears almost with
every clause. This can be seen in (10).
The Grammaticalization of Irrealis in Tok Pisin 395
(10) Krokodail tok olsem, "Nau bai mi go long warn na bai mi painim ol
dok na ol dok bai sindaun, bai mi kisim em go long aus bilong em.
Em bai wokabaut bilong mi. ", em tok, krokodail olsem. Krokodail
tok olsem, "Bai mi go long wara na painim dok". Na em go long
wara na em lukim bikpla ston na dok sindaun i stap. [IND1F6].
T h e crocodile said, "I'll go down to the river and look for dogs.
Dogs will be sitting there and I'll get one and take him back to his
house. Then he'll come to mine", the crocodile said. The crocodile
said, "I'll go down to the river and look for the dog." He went to the
river and saw a big stone and the dog was sitting there.'
Generally speaking, there is no obligatory marking of tense in Tok Pisin.
In this extract from a narrative in (10), the context makes it clear that the
events referred to take place in the past. The crocodile is of course making
statements here about what he will do in the future and uses bai to do so. In
this respect bai marks an intention, but it can also be understood in some
respects as a prediction about what will happen at some later point in time. It
is this meaning of prediction which is central to the notion of future according
to Bybee et al. (1991). I will return to this point in 4.3. Once the direct
discourse of the crocodile is finished, we are back into the frame of narrative
time, i.e. past events. That is why we find the unmarked verb form lukim 'he
saw', which is the usual form for reporting past events. The latter is cited by
Bickerton (1981) as a general characteristic of creole tense systems; namely,
the zero form of the verb encodes past tense for verbs of action. As might be
expected from the nature of reported speech, a great many occurrences of bai
are found in conjunction with it. At this stage I want to introduce the notion of
'irrealis' into the discussion and to claim that bai could be regarded either as
an irrealis or as a future marker. The paths leading to the grammaticalization
of futures and modality intersect at certain points because events in the future
are by definition hypothetical and there is some uncertainty about them. In
this sense we might say that the common demoninator of irrealis modality is
futurity or uncertainty (see Givón 1984:318).
Many linguists have noted the confusing and often overlapping ways in
which the terms 'aspect', 'tense', 'mood' and 'modality' have been used. In
their discussion of mood and modality Bybee et al. (1994: Ch 6, section 12)
comment, in particular, on some of the problems surrounding the use of the
term 'irrealis,' especially in discussions of non-Indo-European languages.
396 Suzanne Romaine
tense and unreal or hypothetical events and states. As Steele (1975) and others
have noted, the basic connection between the past tense and irrealis is their
sharing of the characteristic of being distant from present reality. Consider,
for instance, the use of the past in hypotheticals such as I wish I had a new car
(see the discussion in James 1982 and 1991). Verbs in the protases of con
trary-to-fact or unreal conditionals referring to the future are also inflected
with -ed in English, e.g. If you travelled to Hawai'i, you'd like it there.
Not surprisingly, bai is also used in similar kinds of conditionals in Tok
Pisin. This can be seen particularly when bai occurs in construction with
sapos (from 'suppose') in a kind of if/then construction as in (11). It can also
be used on its own, as in (12), where both the antecedent or cause clause and
the result clause are introduced by bai. It is semantically equivalent to an if/
then construction, even though there is no overt marking as such. The exten
sion from temporality to causality in this kind of construction is a natural one
in the sense that what is caused is later in time than the event or state which
precedes it. As I will show in 4., an understanding of the semantics of bai is
crucial to interpreting the significance of the syntactic developments I will
look at next.
I will now look at the syntactic position of bai in the nine areas in more detail.
My analysis revealed that 56% of the occurrences of bai are preverbal, while
44% are clause-initial. The data also show that although bai can either precede
398 Suzanne Romaine
or follow noun phrases, there is a tendency for noun phrase subjects to occur
with preverbal bai (see further in Sankoff and Laberge 1980:207). The third
person singular pronoun em strongly favors the occurrence of preverbal bai.
The shift to preverbal position appears to begin in the third person singular
and then spreads throughout the pronominal paradigm.
My findings on the position of bai in the nine individual areas can be
compared with those of Sankoff and Laberge shown in Table 3. If we are
dealing with on-going change, as they claim, then their data appear to have
intersected the change at a point just before bai shifts to preverbal position
since most of their occurrences of bai are still clause-initial. In my data the
balance has shifted in favor of preverbal positioning of bai. Interestingly, they
found no significant differences between the adults and children with respect
to the ordering of bai.
More specifically, it can be seen that the situation in Bulolo and Rempi
more closely parallels that reported by Sankoff and Laberge (1980), who
worked in Lae. That is, the shift from clause-initial to preverbal positioning of
bai is underway, but clause-initial position is at least marginally still the most
frequent. It is really only in Indagen that the shift is well advanced, and
possibly nearing completion, whereas it is only just beginning in Bulolo and
Rempi. The other areas, both rural and urban, represent an intermediate stage.
In general, apart from Bulolo and Waritsian, the areas in Morobe Province are
ahead of those in Madang. The Taraka children in Lae are most directly
comparable to the urban children interviewed by Sankoff and Laberge nearly
20 years ago and it would appear that change has progressed remarkably
quickly in that time span in this area.
Judging from past sociolinguistic studies of the role of social factors in
linguistic change, we might reasonably expect both sex differentiation and
significant age-grading, if change is taking place. Recall that Sankoff and
Laberge found no differences between children and adults. However, their
data base is too small to show the effect of age grading either between the
adult and younger population, or within these groups. Moreover, they do not
take into account the possibility of regional differences. My study found no
significant gender distinctions in the use of bai.
If it is the case that preverbal bai represents a change in progress, we
would probably find differences between both children and adults as well as
within the younger group of speakers. More specifically, we should find that
adults and older children showed a predominantly clause-initial pattern and
The Grammaticalization of Irrealis in Tok Pisin 399
Table 3. The distribution of bai in nine areas (compared with Sankojf and Laberge;
figures are percent)
% Clause-initial % Preverbal
[Sankoff/Laberge 59 41]
Rempi 58 42
Bulolo 56 44
Kusbau 50 50
Bahor 50 50
Waritsian 47 53
Erima 45 55
Geraoun 39 61
Lae (Taraka) 36 64
(Unitech) 32 68
Indagen 25 75
younger children showed increasing use of preverbal bai. There are some
indications that this is true, though there is not always a regular age-by-age
increment in certain areas. Age grading is particularly apparent for some of
the individual areas more so than others. For instance, there is no significant
age grading in Bahor, Erima or Geraoun. However, the oldest children in
Geraoun use mainly clause-initial bai, while the younger ones all use more
preverbal bai. In Unitech, there is significant age grading between younger
and older speakers, which happens to coincide with sex differentiation be
cause the girls are all younger than the boys. All the Indagen children use
more preverbal bai, though the oldest age group, namely, 16 year-olds use
predominantly clause-initial bai. The Indagen adults who use 44% clause-
initial bai are thus behind the younger generation with respect to this change.
Age-grading is, however, particularly apparent in the two areas where the
shift appears not to be implemented yet, namely Rempi and Bulolo.
Thus, the change appears rather sporadic in its transition and is not
clearly progressing in some areas like Erima, Kusbau and Bahor in a regular
way which is tied solely to increasing age, but there is still more to be said
about the differences between children and adults in my corpus. It is obvious
that whatever changes were taking place in the status of bai were well
underway prior to the existence of a large number of native speakers. If
stabilization and nativization are prerequisites for this kind of grammaticali
zation, then it is surprising that Sankoff and Laberge did not find the change
further along in Lae 20 years ago. In fact, they claim (1980:208-9) that not
only do native speakers appear to be carrying further tendencies which were
400 Suzanne Romaine
Table 4. Incidence of bai among first (i.e. urban) and second (i.e. rural) language
speakers (figures are normalized to distribution per 1000 words)
Normalized N bai N words
children: urban 8.0 2374 295965
rural 7.0 2878 409156
5252 705121
adults rural 11.5 498 43221
already present in the language, but also that fluent speakers show no differ
ence from native speakers with respect to use of bai. Sankoff and Laberge
(1980:205) say that 'bai seems to be redundant and obligatory for both adults
and children', with no observable differences between them. They record
almost equal numbers of bai for both groups (children = 192; adults = 203).
Certainly at the moment in other areas of the grammar and lexis, those
innovations which are associated with town speakers are the ones most likely
to be picked up by others (see Romaine, 1989b and 1993). It may be that the
reason why Indagen and Geraoun show such a high rate of pre verbal usage is
that influence from Lae has spread out first to these areas and they have now
overgeneralized or accelerated the change, though if that is the case, it is
surprising that Bulolo does not follow Lae too. Bulolo is connected to Lae by
a relatively good road while Indagen, though closer to Lae (some 25 miles in
the mountains to the northeast), is connected by air only. Geraoun is con
nected only by bush track to Indagen and surrounding areas.
However, by comparing Table 3 with Table 4, we can see statistically
significant differences between both rural and urban children as well as
between children and adults. Although first language speakers of Tok Pisin
clearly use bai more frequently overall than second language or rural speak
ers, it is the second language speakers in rural areas who lead in the introduc
tion of preverbal bai. While it would appear that children rather than adults
are ahead in introducing preverbal bai as the canonical way of marking
irrealis, it is the rural adults who show the highest rate of bai usage.
To interpret these findings more fully, I turn now to a consideration of the
syntactic developments in comparative and historical perspective, and then I
will look at the semantic aspects of this change. At that stage I will also return
to the question of age grading.
The Grammaticalization of Irrealis in Tok Pisin 401
There are at least three issues which need to be discussed in order to clarify the
sequence of steps identified by Sankoff and Laberge as part of the grammati
calization process. Firstly, do these stages reflect true diachronic ordering?
Secondly, what is the connection between syntactic positioning and phono
logical reduction? Thirdly, is this sequence coincident with creolization? To
take the last question first, I have already indicated that it does not appear to
be, despite the fact that Sankoff and Laberge (1980:195) link this change in
bai to 'the passage of Tok Pisin from a second language lingua franca to the
first language of a generation of urban New Guineans'. In order to illuminate
all three questions, however, I will need to consider further diachronic and
comparative evidence.
Evidence from the formative stages of Tok Pisin's development is very
sketchy. Earlier sources mention baimbai but not bai. This is true even of
Mihalic's (1957) first edition of the Tok Pisin grammar and dictionary.
Murphy (1966:59), however, lists it in his dictionary and it can be found in
print in Tok Pisin newspapers, e.g. Nius bilong Yumi published fortnightly by
the colonial administration, from the 1960s onwards. In the 1971 edition of
his dictionary and grammar, Mihalic (1971:63) lists bai with baimbai and
says that it is derived from English 'by and by', and has the same range of
meanings.
It seems likely that bai is a result of phonological reduction of baimbai,
as Sankoff and Laberge claim, even though the two forms have apparently
co-existed for some time. Further support for this can be found in Solomon
Islands Pijin, a near relative of Tok Pisin. The form bambae and its reduced
variants babae and bae are all currently in use and have been for some time.
Older speakers (36+) tend to be conservative and use mainly bambae and
babae. The younger generation tend to use bae (Jourdan 1985). In Bislama, a
variety of Melanesian pidgin spoken in Vanuatu also closely related to Tok
Pisin, similar variants are in use, though Crowley (1990:262) reports that bae
is now more widespread than bambae in the modern language. Whatever the
precise historical details are, it seems clear that baimbai has become very
marginal in ordinary conversational use in Tok Pisin, and indeed, much more
so than in Bislama and Solomon Islands Pijin.
402 Suzanne Romaine
With regard to the positioning of bai, the historical sources which have
been discussed by other scholars such as Sankoff and Laberge indicate that
clause-initial positioning is the earlier. All of the 11 instances of baimbai cited
by Churchill (1911:37) are clause- or sentence-initial. Four of these occur
before a noun phrase, and the others before pronouns. Mihalic (1957:43) lists
baimbai with a group of adverbs which he claims are found only at the
beginning of a sentence. In his later edition he says only that it is one of a set
of adverbs found usually at the beginning of a sentence.
It is striking that none of the earlier historical sources mentions the
possibility of preverbal baimbai, and Sankoff and Laberge have overlooked it
too. I have, however, collected a number of attestations of it both diachron-
ically and synchronically in speech and writing, as can be seen in examples
(13) through (42). I will now use them to argue that the question of syntactic
positioning of bai and phonological reduction must be seen as separate issues.
Their conflation by Sankoff and Laberge (and Labov) fails to accommodate a
number of competing developments which have made the grammaticalization
process less straightforward than it appears. The existence of these examples
indicates that the incorporation of the full form baimbai within the verb
phrase probably existed as a grammatical option long before phonological
reduction or creolization, or indeed, the existence of a community of fluent
second language speakers.
Examples (13), (14) and (15) are from the earliest textual records from
the jargon stage. The first two attestations for pidgin English in New South
Wales and Queensland predate the labor trade. Crowley (1990:252) notes
there is evidence that the clause-initial future marker also began to appear
occasionally in post-subject position in the early 1880s. It is also attested in
Samoan Plantation Pidgin, as in (16). Examples (17) through (21) are re
corded from the German colonial period.2
(13) me bye and by come back. [New South Wales, 1844, Troy 1985]
'I'll come back'.
(14) Suppose you no kill piccaninnies, that fellow by and by jump up
kipper. [Queensland, 1858, Praed 1885]
'If you don't kill the children, they will grow up into adolescents
and kill you.'
(15) Brother belong-a-me by and by he dead, [reported in Beche-la-mar
by E.L. Lazard 1880 in Noumea, New Caledonia to Schuchardt 1883]
'My brother will die soon.' 3
The Grammaticalization of Irrealis in Tok Pisin 403
'He will bring the word of the redeemer, and he will make all men
straight.'
(37) Long las de bilong dispela graun bambai i olsem tasol.
[Nupela Testamen, 1966. Mt. 13:49].
'And on the last day of the world this is the way it will be.'
(38) Em bambai i kam bek gen wankain.
[Nupela Testamen, 1966. Ap.l:ll.]
'He will come back again.'
(39) Em bambai i save as bilong dispela tok mi autim.
[Nupela Testamen, John 7:17]
'He will know the reason for what I am saying.'
(40) Skai wantaim graun baimbai i pinis.
[Nupela Testamen, Mk 13:31.]
'Heaven and earth will pass away.'
(41) Ol dispela man baimbai i stap long Australia inap olsem wan mun.
Oli baimbai i stap wantaim arapela kain kain taun kaunsel long
Victoria na Tasmania. As bilong limlimbur bilong ol em long
baimbai long lukim Australia taun kaunsel wok olsem wanem.
[Nius bilong Yumi 15 April 1966. Vol. 8 no.6., p.2].
'These men will stay in Australia for a month. They will visit other
kinds of town councils in Victoria and Tasmania. The reason for
their extended stay is so that they will be able to see how Australian
town councils work.'
(42) Dispela buk long Tok Pisin i bambai helpim planti pipol long
Papua Niugini save long nupela na guípela we long mekim gut
sindaun long ples bilong ol. [Preface to Save na Mekim, 1982.
Michael Somare, former Prime Minister]
'This book in Tok Pisin will help plenty of people in PNG to know
about new and good ways of making a good life in their villages.'
My examples are important for a number of reasons. They demonstrate
the option of using preverbal baimbai has apparently been available in the
language for at least a century, if we accept examples like (13) through (21) as
accurate renditions, and it still exists today in the speech and writing of fluent
users. They show that phonological reduction is neither a necessary nor
sufficient precondition for preverbal placement of the marker. Example (43)
may also provide evidence for the possibility of yet another variant, namely, a
clause-initial reduced form of baimbai.
406 Suzanne Romaine
then have set the precedent for the reduced form bai to occur both clause-
initially and preverbally. This also fits in with the finding that it is the third
person pronoun which provides the point of departure for the diffusion of
preverbal bai throughout the pronominal paradigm. If this is true, then phono
logical reduction of baimbai is not the only source for the short form bai.
(44) em bai tupela sindaun. [NDMISC87FA]
T h e two of them will sit down'.
(45) i go pas turnas, em bai skin bilong en kamap hatpela turnas.
[IND86MISCSB]
'If it [airplane] goes very fast, the outside becomes very hot.'
There are also many examples where bai appears both preverbally and
clause-initially with a repetition of the same verb, as in the cases in (46-48).
There are also cases where preverbal and clause-initial uses are juxtaposed
within the same utterance, which suggests that for some speakers the two are
optional variants, possibly with some stylistic or pragmatic significance.
(46) Nau bai kau bai go.
'Now the cow will go.'[UTlF5]
(47) Bai bed ia bai bruk. Bed ia bai bruk.
'The bed will break. [UT1F1]
(48) Ol bai katim ol karot ia bai ol katim ol karot.
'They will cut the carrots.'[UTIF]
Bybee et al. (1991) make two strong claims relating to the development of
futures:
(1) Futures in all languages develop from a small set of lexical sources
and all future morphemes from a given source go through similar stages of
development.
(2) The semantic change is accompanied by formal reduction.
With regard to the first point, they make a more general distinction
between aspectual and non-aspectual futures because they say that these two
groups develop in entirely different ways. Tok Pisin bai is an example of a
non-aspectual future. 7 However, they identify four routes of development,
one of which can be applied to the case of bai in Tok Pisin; namely, cases
where temporal adverbs indicating a time after the moment of speech or a
reference to time come to encode the future. Since the other three routes are
more commonly attested in the sample which Bybee et al. (1991) looked at,
they do not exemplify the fourth one or discuss it in detail. Thus, the extent to
which Tok Pisin matches their claims in all the precise details will have to
await investigation of other languages in which temporal adverbs have devel
oped into tense markers, such as the case of the Kru languages studied by
Márchese (1986).
410 Suzanne Romaine
Bybee et al. (1991:20) identify prediction as the protypical and defining use
for future grams, although more than half of the future grams in their data base
also had other uses. Examples (10) through (12) in Tok Pisin have shown a
similar range of uses other than prediction as components of the meaning of
bai. Bybee et al. (1991) take these as clues to the semantic development of
futures, telling us where the grams come from and how far they have ad
vanced in their development. Other meanings such as imperative are typical
of later stages. The original meaning of a gram also determines the range of
use it will have at later stages.
There is evidence to suggest that in Pacific Jargon English and its
immediate descendents that baimbai already encoded an irrealis distinction. A
small corpus of 48 examples of baimbai collected from an on-going survey of
19th century textual records on Pacific Pidgin English was examined to see
what meanings were expressed. Out of a total of 48, only 8 were pre verbal and
these all encoded the meaning of later, or remote future, or prediction in the
sense of Bybee et al. (1991), as in (13). Of the cases of clause-initial baimbai,
most of these (N = 24) too encoded remote futures, but others encoded results
or later sequences of events, some of which can be understood as warnings
and threats, as they still do in modern Tok Pisin, as in (50).
Table 6. Number of preverbal and clause-initial bai in subordinate vs. main clauses
Bahor Kusbau Rempi Erima Lae Indagen
PV CI PV CI PV CI PV CI PV CI PV CI
main 66 34 55 45 47 53 62 38 63 37 79 21
subord 47 53 40 60 40 60 60 40 66 34 76 24
412 Suzanne Romaine
main clauses. The data in Table 7, which I will discuss shortly, support this
too, in that the most frequent meanings in the preverbal slot in main clauses
are the tense/aspect ones of future, habitual and sequence.
Before looking at Table 7 in more detail, however, we need to look more
closely at the meanings encoded by bai in preverbal and clause-initial posi
tions. The relevant semantic categories appear to be the following: habitual
(aspect), sequence, result, imperative/suggestion and intention/purpose. I noted
earlier that some linguists such as Roberts (1990) categorize imperatives as
irrealis. Habituais also fall within the scope of irrealis for Givón (1984) since
they make reference to generics rather than specifics and are in this sense non-
referential. The merger between, or use of the same marker, for habituais and
irrealis apparently occurs in a number of creoles (see Taylor 1971). Bybee et
al. (1991) observe a connection between futurity and imperatives which arises
from situations in which a speaker makes a prediction about the addressee,
e.g. you're going to wash to hands before dinner.
Bybee et al. (1991:26) also say that developing grams often retain traces
of their earlier lexical meaning. Certainly, bai has retained traces of the earlier
meanings of baimbai. It has other uses too, such as imperative. This supports
their proposal that the use of future for imperative develops out of the
prediction sense and is therefore a late use (Bybee et al. 1991:28). I have
already given some examples of the other modal and aspectual meanings of
bai in (10) to (12). Some additional examples of bai used to encode result/
sequence/habitual in the past are in (55) to (56). Notice that both preverbal
and clause-initial bai can encode the past habitual. Example (57) shows the
use of bai in sequencing (cf. also (5)), while (58) shows its use in suggestions or
imperatives. Example (59) illustrates the intention/prediction meaning of bai.
Table 7. Meanings encoded by preverbal and clause-initial bai in main and subordinate
clauses (F/H/S = future, habitual, sequence; R = result; I/It/S =imperative/
intention/suggestion)
Kusbau Rempi Erima Lae Indagen
PV CI PV CI PV CI PV CI PV CI
M F/H/S 88 12 77 23 89 11 91 9 94 6
A R 33 67 33 67 25 75 25 75 100 0
I I/It/S 18 82 1 99 23 77 27 73 55 45
N
S F/H/S 38 62 47 53 65 35 79 21 84 16
U R 53 45 33 67 32 68 51 49 69 31
B I/It/S 19 81 20 80 34 66 18 82 48 52
The Grammaticalization of Irrealis in Tok Pisin 413
(55) Tasol man em putim kain hat olsem ol men bai respectim ol moa
yet. Em pasin bilong tumbuna. [IND86MISC1]
'However, if men wore this kind of hat, the women respected them
even more. That was the custom of our ancestors'
(56) Ol mipla save ron shutim ol sampla fis ol bai slip antap lo rif
[REM4MAM]
'We habitually catch the fish that sleep on the reef.'
(57) Bihain bai pepa bai i kam insait long ... komuniti
gavman na bai mipela lukim wanem tok bai i kam insait.
[IND87MISC1]
Then the document will come to the community and we will look
to see what it says.'
(58) Desla mangi ia tok: "Bai yumi go long bus". [REM2FBA]
'This boy said: "Let's go to the bush'.
(59) Ating bai mi go long maket nau. [REM1MFM]
T think I will go to the market now.'
Clauses which express imperatives, suggestions, hypothetical results and
intentions/purposes show the greatest clause-initial use of bai, while clauses
with future, habitual and sequence show the strongest tendency toward pre-
verbal bai. Thus, preverbal bai tends to be used where there is a good chance
of realization of the activity of the main verb modified by bai, while clause-
initial bai is preferred for cases where there is a dependency on some condi
tion or state, e.g. in clauses introduced by sapos. Contrast, for instance, the
hypothetical or irrealis condition in (60), where bai is clause-initial, with (12),
where the events have a greater likelihood of occurrence and bai expresses
predictions and intentions.
syntactic environment, namely, long clauses. In example (61) from Bahor bai
is clause-initial in a subordinate clause introduced by long, whereas in (62)
from Indagen both instances of bai are preverbal. In (62) the first use of bai
encodes a prediction or future result, whereas the second one expresses
purpose or intention, as does (61). Both of these meanings now tend to be
expressed increasingly by preverbal bai in Indagen. In the other areas clause-
initial bai is still used. In (63), also from Bahor, we see a case of preverbal bai
used in a long clause because here the meaning is habitual result.
(61) Em tokim pikini bilong en lo [a reduced form of long] tumoro b 'ol
go chekimpig. [BAH5FSK]
'He told his son that tomorrow they would go to check on the pig.'
(62) Sapos sios i karnap bikpela, em bai gim graun long ol bai sanapim
aus lotu. [IND86MISCSN]
'If the church [membership] gets big, it will give land for them to
build a church.'
(63) Olgeta taim em sa singaut long mani long ol man bai helpim em.
[BAH1FBS]
'He is always crying out for money for people to help him'.
The children from Kusbau, Bahor, Rempi, Erima, Lae and the Indagen
adults all generally use the same system. The Indagen children, who are the
most advanced in their use of preverbal bai, behave differently. In cases
where the other children have primarily clause-initial bai, Indagen usage is
split roughly 50/50 between clause-initial and preverbal bai. Thus, there are
no environments where clause-initial bai is favored. Clauses expressing inten
tion are on the point of becoming environments for preverbal bai. This
indicates the existence of a hierarchy of environments for change, as in Figure
1. The change begins in main clauses and is less advanced in subordinate
ones.
Table 8 shows the differences among four areas in more detail with
reference to the age grading dimension. The results show that Indagen clearly
leads the other areas in its use of preverbal bai in marking futures as well as
other modalities such as intentions, suggestions, etc. There is also age-grading
such that younger speakers are introducing more preverbal bai. The difference
between the Indagen younger (Grades 1-4) and older (Grade 5+) speakers is
considerable with respect to the use of preverbal bai in the marking of
suggestions, intentions, etc. The adults and older children (i.e. from Grade 5
The Grammaticalization of Irrealis in Tok Pisin 415
onward) in Indagen have the older system which is more like Lae in that the
meanings of suggestion, imperative and intention are still encoded primarily
by clause-initial bai. It is the youngest children in Grades 1-4 whose usage is
most advanced. 8
I now return to the predictions Bybee et al. make about the development
of futures. They (1991:29) classify future grams into four semantic ages based
on the uses they have in addition to their future use, as shown in Table 9. They
classify grams which develop from temporal adverbs in the same schema as
movement-derived futures. They put all the non-aspectual future grams into
this schema since they illustrate a different path of development from the
aspectual futures. It is also clear from Table 9 that some of the postulated
meanings for FUTAGE 1 and 4 overlap with those of modality, i.e. possibility,
obligation, desire, probability, etc. Table 9 can also be compared with Figures
1, 2 and 3 in Bybee et al. (1994: Ch. 6), which implicate the future in
grammaticalization paths towards various modalities.
416 Suzanne Romaine
Table 9. Meaning components for each FUTAGE (from Bybee et al. 1991:33, Table 2)
FUTAGE 1 FUTAGE 2 FUTAGE 3 FUTAGE 4
obligation intention future probability
desire root poss possibility
ability andative imperative
venitive use in complements
immediate future use in protases
Bybee et al. (1991) also make some predictions about the form and position
ing of grams in relation to the degree of grammaticalization they have under
gone. They (1991:33) note that positioning of affixes is governed by the
source constructions from which the affixes arose diachronically, which in
turn is governed by general typological features of the language. They predict
a form/meaning correlation reflecting degree of grammaticalization. Grams
that are older and have undergone more development tend to be closer to the
stem, more fused and shorter or more reduced in segmental material (Bybee et
al. 1991:33). Bickerton (1981) and others (e.g. Givón 1982) have made much
of the tendency for creoles to encode tense, mood and aspect by means of
preverbal particles. Bickerton's claim is that there is a creole prototype. The
use of particles rather than bound morphology clearly reflects the influence of
typology since pidgins and creoles generally dispense with morphology.
As far as Tok Pisin is concerned, the preverbal slot is not the only one
reserved for grammatical markers. The tense, mood and aspect markers, save,
laik, ken, bin, etc. all occur preverbally, but the completive marker pinis is
almost always postverbal. By comparison, in Torres Strait Creole, for in
stance, all grammatical markers precede the verb.
The Grammaticalization of Irrealis in Tok Pisin All
comparing these scores to those for English grams, bai has the same score as
will for shortness, but both score less than -ed.
By correlating the scores achieved on these three measures of fusion,
dependence and shortness, Bybee et al. (1991:39) predict that forms with
higher FUTAGES will have significantly higher values and they argue for a
reliable trend in their data towards higher measures of formal grammaticaliza-
tion as one moves from FUTAGE 1 through FUTAGE 4. It is somewhat problem
atic to apply these measures to a language with little morphology since the
measures are best suited to cases which display highly developed phonologi
cal and morphological conditioning of allomorphs. Nevertheless, we can see
in Table 10 that preverbal bai has the greatest degree of grammaticalization.
This is consistent with the fact that preverbal bai is attested only in recent
decades.
Bybee et al. (1991:41) conclude that there is a significant correlation
between the semantic properties hypothesized to belong to older grams and
the formal properties which accompany grammaticalization. There are several
biases in their method, which I have no space to comment on here. One,
however, which they recognize and attempt to rectify, is that typology may
interfere with the correlations. This affects Tok Pisin since it is a young
language by comparison with languages like English. Older languages may be
expected to have a greater number of older and therefore more highly fused
grams than languages with shorter time depth. Inflecting languages on the
whole may be expected to show more fusion, but this fusion may not indicate
a greater age of grams.
Having now traced both the syntactic and semantic development from
baimbai to bai in Tok Pisin, I can now evaluate the relevance of the claims
made by Bybee et al. (1991) about the evolution of futures. I have no reason to
doubt their statement that all futures develop from a limited set of lexical
sources and go through similar stages of development. However, in the case
of Tok Pisin, it is clear that formal reduction was not an absolute prerequisite
for the development of future meaning. In fact, Bybee et al. (1991:42) found a
deviation in FUTAGE 4 for both the measures of dependence and shortness.
Upon examining their correlations in Table 3 and 4 more closely, it appears
The Grammaticalization of Irrealis in Tok Pisin 419
issues use mainly baimbai. I am currently doing a diachronic study of bai and
other syntactic features in media Tok Pisin, which I hope will shed light on
this usage.
(64) Bihain oli mekim pinis general survey o lukluk raun oli tingting
bilong mekim ol rot bai olsem bilong bung wantaim long ol ples
balus na bris bilong mekim isis moa wakabaut long olgeta hap long
Territory. (Nius bilong Yumi. October 1962, Vol. 4. No. 12, p. 4).
'After they complete the general survey they are thinking of mak
ing the roads connect with the airports and bridges in order to
facilitate travel within all parts of the Territory.'
Another problem which may be specific to the Tok Pisin data or have
more general relevance for the predictions made by Bybee et al. (1991) once
we have more data from other cases where temporal adverbs have become
future markers is the classification of imperatives under FUTAGE 4. A strict
chronological progression from FUTAGE 1 to 4 would imply that more cases of
preverbal than clause-initial bai should be used with the meaning of impera
tive. As I showed above, however, clauses which express imperatives, sug
gestions, hypothetical results and intentions/purposes show the greatest
clause-initial use of bai.
5. Conclusion
futures. The most common source for futures in pidgins and creoles is to use a
verb of movement, e.g. go in most English-based pidgins and creoles. Notable
exceptions occur among some of the Portuguese and Spanish-based Atlantic
Creoles such as Papiamentu and Cape Verde Creole, which use lo [<Spanish/
Portuguese luegollogo 'later']. In Kristang (Malacca Creole Portuguese),
which has logu, the future marker can occur both within and outside the verb
phrase.
Although I have already demonstrated that creolization is not a trigger for
the grammaticalization of bai as a future or irrealis marker in Tok Pisin, I
return now to this issue in my concluding remarks as it was my point of
departure. I have already argued that creolization per se, in so far as it is
mainly an urban phenomenon, does not appear to be responsible for the shift
of bai to preverbal position.
There are some further relevant parallels when we examine comparative
evidence from other pidgins and creoles, which weaken the link between
creolization and the grammaticalization of preverbal tense particles. The most
interesting and relevant case is that of Solomon Islands Pidgin. There nativi-
zation does not seem to have affected the positioning of bae. Jourdan (1985)
found that rural adults had the highest incidence of preverbal bae; however,
these accounted for only 11% of the occurrences of bae. Thus, preverbal bae
is not by any means as frequent a variant for younger speakers of Solomons
Island Pijin as it is for those of Tok Pisin. Monolingual speakers hardly ever
use preverbal bae. Thus, for both younger and older speakers in all areas,
clause-initial bae is still the most frequent variant. Although Crowley
(1990:496;270) suggests that Bislama in Vanuatu was probably acquiring
native speakers in small numbers for most of its history, he says that the
placement of bae probably parallels more closely the pattern found in Solo
mon Islands Pijin than that in Tok Pisin.
Another relevant case from the Pacific is Hawai'i English Creole. In the
pidgin stage, temporal adverbs occur sentence-externally, either initially or
finally. When the creole developed, neither bambai nor pau (compare Tok
Pisin pinis), the completive marker, underwent any change of meaning, nor
were they incorporated into the auxiliary. A closer comparison of Tok Pisin
with Hawai'i Pidgin/Creole English, however, reveals a difference in out
come which is probably due to the longer time period for stabilization in Tok
Pisin and in the Melanesian Pidgins in general by comparison to creolization
in Hawai'i, which was more abrupt. Although Hawai'i Pidgin English and the
The Grammaticalization of Irrealis in Tok Pisin 423
Melanesian Pidgins have both baimbai and go, in Hawai'i, bambai never
became regularized as a future tense marker.
Further evidence to support the lack of connection between creolization
and grammaticalization of preverbal tense markers is found in Sango. Despite
the fact that there is now a second generation of creole speakers, the future
marker fade (identical to the adverb fade 'right now') always occurs first or
early in the sentence and is never phonologically reduced (Samarin 1975).
Nor has its use become obligatory. An example is given in (65).
(65) fade tongana kete nginza ti mo ake, mo goe vo a-Nivaquine quoi
If small money of you is, you go buy Nivaquine.
'If you should have a little money, you ought to buy some Niva
quine (anti-malarial medicine).'
In conclusion then, I have argued that there is no evidence to indicate that
creolization and/or urbanization is associated with the incorporation of tense
markers into the verb phrase in preverbal position in Tok Pisin. The existence
of preverbal baimbai with future meaning shows that the preverbal slot was
available for tense markers long before phonological reduction or the exist
ence of a community of native speakers. I have also shown that the grammat
icalization process follows in many respects the universal paths of
development for futures predicted by Bybee et al. (1991) at the same time as it
intersects with many of the key routes leading to modality.
FOOTNOTES:
* I would like to thank Joan Bybee and Gillian Sankoff for their helpful comments on an
earlier version of this article. I would also like to thank the directors of the Max-Planck-
Institut-für-Psycholinguistik, WJ.M. Levelt and Wolfgang Klein, for their support and
encouragement, and also Fiona Wright and Rod Mitchell, who assisted in the research at
various stages. I am also very grateful to the Papua New Guinea University of Technol
ogy for providing a research base and to the Provincial Government of Morobe and
Madang Provinces for permission to conduct research. I am also grateful to Susan
Hockey of the Oxford University Computing Service for assistance with the Oxford
Concordance Programme used in the analysis of the data.
1. The coding system is to read as follows: BL1 = Bulolo 1986; INDI = Indagen 1986; UT1
= Unitech 1986, etc. F = female; M = male. The number indicates the grade. Where
initials appear and no grade is given, the speaker is an adult.
2. I am grateful to Philip Baker for providing me with some of the examples from German
New Guinea and Australia.
424 Suzanne Romaine
3. Schuchardt (1883) also notes the occurrence of by and by in Chinese Pidgin English and
quotes the following example, which interestingly contains a preverbal usage: my
by'mby catchee he 'I will get it.' Although Baker (1987:179) has attested this feature for
Chinese Pidgin English and other Pacific Pidgin Englishes he does not note any prever-
bal occurrences. He dismisses one instance recorded by Leland (1876:110) as
unauthentic because the author never visited the Far East and composed all his examples
in London, e.g. he fedders by'mby stlate all-same nother hin 'Its feathers will become
straight like those of any other hen.'
4. Detzner claims to have lived among bush speakers of Tok Pisin for several years. He
continued fighting the war in the remote bush withough surrendering for four years after
Rabaul was firmly in the hands of Britain and Australia. He quotes numerous examples
of Tok Pisin as used by the police force. However, the use of the copula be and several
other unexpected non-standard features suggests that we are dealing with a somewhat
suspect literary form of Tok Pisin. After Detzner was repatriated, he published a
disclaimer of his book, Vier Jarhen unter Kannibalen. He said (1932:307-8) said the
book contained a number of misrepresentations concerning his journeys in New Guinea.
His travels were scientific in part only and were to be taken as primarily a fictional
account (see Biskup 1968).
5. Borchardt was a Catholic missionary on Manus Island. His language is clearly influ
enced by the Rabaul tradition. His examples are fabricated; most other examples have
clause-initial baimbai. His Guidance for learning the Tok Boi was translated from the
German original in 1930.
6. I owe this example to John Z'graggen.
7. Grams that have 'future' as one of their meanings may also more typically develop from
auxiliary constructions which have the meanings of 'desire', 'obligation' or 'movement
towards a goal'. These routes are also illustrated by other ways of expressing future in
Tok Pisin, e.g. laik 'want/desire', go 'to go', and ken 'to be able'. Bybee et al. (1991:55)
classify laik as a FUTAGE 1 gram expressing an agent-oriented modality.
8. There may also be a developmental aspect to the progression of this change from clause-
initial to preverbal bai. The data partly reflect the fact that older children and adults use
more clauses expressing the meanings of intention, etc. and thus, have more environ
ments for clause-initial bai to occur than do the younger speakers. The higher rates of
preverbal bai usage could indicate that the temporal meanings of bai are learned earlier
than the others, at least in some areas. The youngest children in both Lae and Indagen in
fact use fewer modal clauses than the younger children in other areas. However, this
tendency appears at odds with claims made about the priority of aspect over tense in
language acquisition more generally (see Romaine 1988:Chapter 7 for an evaluation of
these claims in relation to creoles).
The Grammaticalization of Irrealis in Tok Pisin 425
REFERENCES
Baker, P. 1987. "The Historical Developments in Chinese Pidgin English and the Nature
of the Relationships between the Various Pidgin Englishes of the Pacific Region".
Journal of Pidgin and Creole Languages 2.149-62.
Bergmann, U., ed. 1982. Save Na Mekim. Lae: Liklik Buk Information Centre.
Bickerton, D. 1981. Roots of Language. Ann Arbor: Karoma.
Biskup, P. 1968. "Hermann Detzner: New Guinea's First Coast Watcher". Journal of the
Papua and New Guinea Society 2.5-21.
Borchardt, K. 1930. Anleitung zur Erlernung des Tok-Boi, mit Wörterbuch. Manus, mimeo.
Bybee, J., Pagliuca, W. and Perkins, R.D. 1991. "Back to the Future". Approaches to
Grammaticalization. ed. by E.C. Traugott and B. Heine. Vol. II, 17-58. Amsterdam:
John Benjamins.
Bybee, J.L., Pagliuca, W. and Perkins, R.D. 1994. The Evolution of Grammar: Tense,
Aspect and Modality in the Languages of the World. Ch. 6: Mood and Modality.
Chicago: University of Chicago.
Churchill, W. 1911. Beach-la-Mar, the Jargon or Trade Speech of the Western Pacific.
Washington,D.C: Carnegie Institution.
Crowley, T. 1990. Beach-La-Mar to Bislama: The Emergence of a National Language in
Vanuatu. Oxford: Oxford University Press.
Detzner, H. 1921. Vier Jahren unter Kannibalen. Berlin: A.Scherl.
Detzner, H. 1932. Zeitschrift der Gesellschaft für Erdkunde 1932.307-8.
Foley, W.A. 1986. The Papuan Languages of New Guinea. Cambridge: Cambridge
University Press.
Givón, T. 1982. "Tense-Aspect-Modality: The Creole Prototype and Beyond". Tense-
Aspect. Between Semantics and Pragmatics, ed. by P. Hopper, 114-63. Amsterdam:
John Benjamins.
Givón, T. 1984. Syntax. A Functional-Typological Approach. Introduction. Vol. 1. Amsterdam:
John Benjamins.
Hall, R.A. 1943. Melanesian Pidgin English: Grammar, Texts, Vocabulary. Baltimore:
Linguistic Society of America.
Hall, R.A. 1955. Hands Off Pidgin English. Sydney: Sydney and Melbourne Publishing
Company Pty, Ltd.
James, D. 1982. "Past Tense and the Hypothetical: a Cross-Linguistic Study". Studies in
Language 6. 375-403.
James, D. 1991. "Preterit Forms in Moose Cree as Markers of Tense, Aspect, and
Modality". International Journal of American Linguistics 57.281-297.
Jourdan, C. 1985. Sapos lumi Mitim lumi. Urbanization and Creolization in the Solomon
Islands. Ph.D. dissertation. Australian National University.
Kay, P. and Sankoff, G. 1974. "A Language Universals Approach to Pidgins and Creoles".
Pidgins and Creoles: Current Trends and Prospects, ed. by D. De Camp and I.
Hancock, 61-72. Washington,D.C: Georgetown University Press.
Keesing, R. 1988. Melanesian Pidgin and the Oceanic Substrate. Stanford: Stanford
University Press.
426 Suzanne Romaine
Labov, W. 1977. "On the Adequacy of Natural Languages: The Development of Tense".
Linguistic Agency University of Trier Paper No. 23. Series B.
Leland, C.G. 1876. Pidgin-English Sing Song. London: Trübner and Co.
Ligo, G. 1987. Lanwis long nyus. Introdaksen long Stadi blong Bislama: Buk Blong
Ridim. ed. by T. Crowley, 81-4. Suva, Fiji: Ekstensen Sevis, Yunivesiti blong Saot
Pasifik.
Liklik Katolik Baibel. 1934. Alexishafen: Catholic Mission Press.
Morchese, L. 1986. Tense/Aspect and the Development of Auxiliaries in Kru Languages.
SIL Publications in Linguistics, 78. Dallas: Summer Institute of Linguistics.
Markey, T.L. 1982. "Afrikaans: Creole or Non-Creole?" Zeitschrift für Dialektologie und
Linguistik IL. 169-207.
Mihalic, F. 1957. Grammar and Dictionary of Neo-Melanesian Pidgin. Westmead, NSW:
The Mission Press.
Mihalic, F. 1971. The Jacaranda Dictionary and Grammar of Melanesian Pidgin. Port
Moresby: The Jacaranda Press.
Mihalic, F. 1986a. Konstitusen Bilong Independen Kantri Papua Niugini. Boroko, PNG:
Word Publishing Company.
Mihalic, F. 1986b. Stail Buk bilong Wantok Niuspepa. [reprint].
Mühlhäusler, P. 1978. "Samoan Plantation Pidgin English and the Origin of New Guinea
Pidgin". Papers in Pidgin and Creole Linguistics No.1. Pacific Linguistics. Series A-
24.67-119. Canberra: Australian National University.
Mühlhäusler, P. 1986. Pidgin and Creole Linguistics. Oxford: Blackwell.
Murphy, J. 1966. The Book of Pidgin English. Brisbane: W.R. Smith and Paterson Pty.
Nupela Testamen bilong Bikpela Jisas Krais na Buk bilong Ol Sam. 1966. Port Moresby:
The Bible Society of Papua New Guinea.
Praed, Mrs. Campbell 1885. Australian Life: Black and White. London: Chapman and Hall.
Roberts, J.R. (1990). "Modality in Amele and other Papuan Languages". Journal of
Linguistics 26: 363-401.
Romaine, S. 1988. Pidgin and Creole Languages. London: Longman.
Romaine, S. 1989a. "Some Differences Between Spoken and Written Tok Pisin". English
World Wide 9.243-69.
Romaine, S. 1989b. "Change and Variation in the Lexicon of Tok Pisin in Papua New
Guinea". Synchronic and Diachronic Approaches to Linguistic Variation and Change.
ed. by T. Walsh, 268-80. Washington,DC: Georgetown University Press.
Romaine, S. 1990. "Substratum, Stabilization and Grammaticalization in Melanesian
Pidgin English". Pacific Studies 14: 79-85.
Romaine, S. 1993. "The Decline of Predicate Marking in Tok Pisin". The Atlantic Meets
the Pacific, ed. by F. Byrne and J. Holm, pp. 251-61. Amsterdam: John Benjamins.
Romaine, S. 1992. Language, Education and Development: Urban and Rural Tok Pisin in
Papua New Guinea. Oxford: Oxford University Press.
Romaine, S. and Wright, F. 1987. "Short Forms in Tok Pisin". Journal of Pidgin and
Creole Languages 2.64-7.
Samarin, W.J. 1975. "Historical, Ephemeral and Inevitable Verbal Categories". Paper
presented at the International Conference on Pidgins and Creoles, Honolulu, Hawai'i.
The Grammaticalization of Irrealis in Tok Pisin All
Sankoff, G. and Laberge, S. 1980. "On the Acquisition of Native Speakers by a Lan
guage". The Social Life of Language. G. Sankoff. 195-211. Philadelphia: University of
Pennsylvania Press. [Reprinted from Kivung (1973) 6.32-47].
Schuchardt, H. 1883. "Kreolische Studien V: Über das Melaneso-englische". Sitzungs
berichte der kaiserlichen Akademie der Wissenschaften zu Wien, Philosophisch-his
torische Klasse, 105.151-61.
Steele, S. 1975. "Past and Irrealis: Just What Does it All Mean?" International Journal of
American Linguistics 41.200-217.
Taylor, D. 1971. "Grammatical and Lexical Affinities of Creoles". Pidginization and
Creolization of Languages. 293-6 ed. by D. Hymes. Cambridge: Cambridge University
Press.
Troy, J. 1985. Australian Aboriginal Contact with the English Language in New South
Wales: 1788-1845. B.A. thesis. University of Sydney.
Vogel, H. 1911. Forschungsreise im Bismarckarchipel. Hamburg.
The Evaluative Function of the Spanish
Subjunctive
Patricia V. Lunn
Michigan State University
1. Introduction
have shown that the best synchronic description of the Spanish mood system
links choice of the Indicative (I) to assertion and choice of the Subjunctive (S)
to non-assertion. A proposition may be unworthy of assertion because the
speaker has doubts about its veracity:
(1) Dudo que sea (S) buena idea.
'I doubt that's a good idea'
or because the proposition is unrealized:
(2) Necesito que me devuelvas (S) ese libro.
'I need you to return that book to me.'
or because the proposition is presupposed:
(3) Me alegra que sepas (S) la verdad.
'I'm glad that you know the truth.'
Assertion and non-assertion are not the only options, however. Klein
(1980) has shown that the territory between assertion and non-assertion is
occupied by a cline of speaker attitudes, with choice of the Subjunctive
becoming more likely as speakers report diminished degrees of certainty with
respect to propositions.
Lunn (1989) has shown that Subjunctive-marked propositions, whether
true or false, share the semantic quality of low-information value. Low-
information value is an instantiation of what Sperber and Wilson (1986) term
low relevance. Sperber and Wilson argue that the expectation that communi
cated propositions will modify and improve existing information is the motive
force in discourse comprehension: "Every act of ostensive communication
communicates the presumption of its own optimal relevance" (1986:158). In
Spanish, propositions that are less than optimally relevant, either because they
add nothing new to existing information or because they add nothing true to
existing information, are marked with the Subjunctive.
Subjunctivizable information can be characterized as not possessing
certain qualities, a suggestive characterization in view of the language-univer
sal link between negation and irrealis modalities. 2 Potentially assertable
information must have two qualities: in the speaker's opinion, it must be both
reliable as to truth value and informative as to news value. Information that is
believed to be flawed in truth value or flawed in news value is unlikely to be
asserted.
Evaluative Function of the Spanish Subjunctive 431
Clauses marked with the Indicative, in contrast, appeared after the main verb.
(5) Lang soslaya el hecho mayor de que las ciencias físicas están (I)
penetradas de visiones filosóficas del universo y de todo lo que se
encuentra en él.
'Lang overlooks the major fact that the physical sciences are shot
through with philosophical visions of the universe and of all that is
in it.' (Krakusin and Cedeño 1992:1291)
With respect to the linear distribution of theme and rheme, Contreras
(1976:26) established that "the highest ranking rheme occurs in final posi
tion". Clearly, the information value of these factive clauses determines both
their mood marking and their position in the sentence. Guitart (1991) has
suggested a similar approach to the Spanish Subjunctive by distinguishing
between pragmatic presupposition, based on speaker assumptions about
hearer knowledge, and semantic presupposition, based on sentence-level truth
conditions.
'Almost eleven years after Munich Bayern won its third and last
European Cup, the current champions of the FRG and Spain are
going to meet one another in an official match.'
(7) La pareja, que se hiciera (PS) famosa por interpretar el papel de
marido y mujer en "El pájaro espino", es en la vida real un
matrimonio feliz. {Hola, 6.7.85)
'The couple, who became famous for their role as husband and
wife in "The Thorn Birds", is happily married in real life.'
In this kind of writing, judgments about information value are based on a
very broad definition of discourse context. The discourse contexts in which the
subjunctivized propositions of (6) and (7) are judged to constitute old infor
mation include previous editions and may span a fairly long period of time.
The Subjunctive in (7) appears in a non-restrictive relative clause, where
the grammars say that it should not appear at all (Solé and Solé 1977:187-94).
Given that non-restrictive relative clauses facilitate the introduction of addi
tional information, it is surprising that the non-assertive Subjunctive should
appear in them. This anomaly can be explained in relevance terms: journalis
tic information is marked for low relevance if it is considered to be known to
a faithful readership. The flagging of known information serves to communi
cate the additional message that writers assume readers to be knowledgeable;
this use of the Subjunctive serves not just to classify information but to create
solidarity between a publication and its readers.
In some cases, all that is necessary in order for a proposition to be
presupposed is prior appearance in a headline. Then, when reiterated in the
following text, the information is marked with the Subjunctive.
Another example of the link between mood choice and old/new informa
tion appeared in a recent newspaper report on a notorious murder. Under the
headline Una maleta con un niño muerto 'Suitcase with dead child inside'
appeared the following sentence.
(9) Nadie en la ciudad, ni la policía, ni los vecinos, encontraron los
motivos por los que una mujer ... pudo (I) matar al menor de los
hijos de su vecina ...y enviara (PS) después el cuerpo a Madrid
dentro de una maleta. (El País, 22.5.92)
'No one in the city, on the police force, or in the neighborhood
understood the reasons why a woman could kill her neighbor's
youngest child and then send his body to Madrid in a suitcase.'
The first subordinate verb, pudo, is in the Indicative mood, while the verb
enviara, in the clause which repeats the headline information, is in the
Subjunctive. The uniquely gruesome disposal of the body was talked about all
over Spain, so it is impossible—and unnecessary—to decide whether the
factor controlling mood choice here is the global discourse context or the
format of the article in which the verb appeared. In the terms of category
theory, this is a case of multiple motivation for the explanation of the exten
sion of a category. As Lakoff (1987:86) says, "where motivation is concerned,
the more kinds of motivation, the better."
The discourse context that governs mood choice in the journalistic data
cited in this section and in Section 2 can be as narrow as one article, or as
broad as a chronologically extended series of references to one topic. The use
of the Subjunctive to mark known information serves as acknowledgment of
the shared context of propositions, and in this sense it serves an interpersonal
discourse function.
4. Dialect variation
Because of its indicative history, the -ra Past Subjunctive is used in ways that
the -se form usually is not. It is quite conventional to use the -ra form of
certain modal verbs in main clauses in order to make polite requests and
suggestions.
(20) Quisiera hacerle una pregunta.
T would like to ask you a question, if I may.'
The -se form sounds obsequious and pompous in this formula, and few
speakers report having heard it. It is crucial, however, that it is the exagger
ated politeness of the -se form that is objectionable here. The -ra form is fully
identified with the Subjunctive mood, and the alternate Past Subjunctive -se
form is, at least in theory, substitutable for it.
The -ra form is also likely to be chosen for the 'old information' journal
istic uses discussed in Section 5 above. The -se form is occasionally used in
this way, however, which shows that the criterion for substitutability is
morphological: speakers identify both the -ra and the -se forms as Subjunc
tives. Examples (18) and (19) above illustrate the old information use of the
-se form.
If main-clause and pluperfect uses of the -ra form are classified as non-
Subjunctive, as most traditional grammars would have it, the difference
Evaluative Function of the Spanish Subjunctive 439
between the -ra and the -se forms is hard to describe. Gili y Gaya, for
example, says that "both forms are interchangeable as long as they are
Subjunctive" (1964:179). If, however, all of the -ra uses are considered
Subjunctive, then it is possible to make the generalization that the -se form
encodes less assertion than the -ra form both because it rarely appears in the
main clause and quasi-pluperfect environments of the -ra form, and because
of contrasts like that between (21) and (22).
In this text, the -se form is used in the protasis of a conditional sentence:
si hubiese podido 'if he had been able'. The contrastive value of the -se form
is thus exploited by having it appear in a subordinate clause with past time
440 Patricia V. Lunn
reference, while the -ra form appears in a main clause with present time
reference. This chain effect is often to be observed when, in a given context,
there are a number of syntactic slots for which degrees of modality may be
selected. Klein-Andreu (1986) explains the various possibilities in detail.
Lunn (1988) discusses other data which show the -se form to be a less
assertive variant of the -ra form.
The -se form is the minority form in all dialects, though its frequency varies a
great deal from dialect to dialect. In this section, -se usage in modern novels
from Peru, Argentina and Spain is examined. The novels were chosen because
their authors are famous for their ability to write prose that evokes the spoken
language.
Listening to Peruvian Spanish, one gets the impression that the -se form
is virtually unused in that dialect. This impression is borne out by the novel
Un mundo para Julius, written in 1970 by the Peruvian novelist Alfredo
Bryce Echenique. In this book, the -se form appears only 21 times in nearly
500 pages, which amounts to slightly less that 2% of all Past Subjunctives.
None of these -se forms appears in the Lima speech of the characters. The
verb hubiese, from the auxiliary haber 'to have', accounts for 10 of the 21
cases, or 48% of all -se verbs. Only one of the verbs in the -se form is a
transitive verb. Also, 8 of the -se forms, 38%, appear in negative clauses. In
Un mundo para Julius, residual -se usage can be described as linked with—
but not triggered by—negation and haber.
In the novel, the -se form is often used in the description of highly
unlikely or fanciful situations, which may or may not be marked with an
introductory irrealis conjunction.
(24) Muchas veces tropezó la chola con los mayordomos o con el
jardinero que yacían muertos alrededor de la carroza, para que
Julius, Jesse James o Gary Cooper según el día, pudiese (PS)
partir tranquilo a bañarse. (Bryce Echenique 1991:13)
'Often the Indian girl tripped over the servants or the gardener who
were lying dead around the coach, so that Julius, Jesse James or Gary
Cooper, whoever he was that day, could go off quietly to his bath.'
Evaluative Function of the Spanish Subjunctive 441
the Spanish text, which suggests its continued usage in the Madrid dialect of
the characters. However, the -se form accounts for only 13% of all Past
Subjunctive verb forms in the book, about half of the Argentine figure. As in
the Peruvian and the Argentine texts, the -se form appears most often with the
auxiliary haber, here in 55% of all cases.
In this text, as in the Peruvian text, there is an alternation of extended
non-use of the -se form and closely spaced occurrences of it. Such evidence
shows how a form is sporadically used even as it falls out of use. As speakers
produce longer and longer periods of non-use, the form disappears from
learners' input.
As in the Argentine text, a significant percentage (39%) of the -se forms
in Si te dicen que caí are introduced by a counterfactual conjunction. How
ever, it is not the case in any of the texts examined for this paper that the -se
form must appear in counterfactual clauses; the -ra form may appear there as
well. That a high percentage of the minority -se forms should appear in
contrary-to-fact clauses is logical; in this highly non-assertive environment, a
form that encodes a high degree of non-assertion should be used.
Klein-Andreu (1991) traces the genesis of the Subjunctive use of the
originally indicative -ra form to its early backgrounding functions. In her
data, use of the -ra form in a fourteenth-century text correlated significantly
with the 'low focus' factors of negation and the use of the copular and
auxiliary verbs ser, estar and haber. It is striking that, in the modern language,
use of the most non-assertive of the Subjunctive forms continues to be linked
with these factors.
In Table 1, the first number shows the percentage of Past Subjunctive
forms that carry -se morphology. Note that, despite the differences in absolute
-se usage, the co-occurrence of -se morphology and auxiliary haber is remark-
ably similar. The number on the right shows the percentage of -se forms that
appear in main clauses. Fully 100% of these main-clause uses, in all three
dialects, are realized as hubiese', -se morphology can travel with haber to the
main clause, even though the semantic value of the morphology would seem
to exclude it from this locus of assertion.
I have done the same analysis on other novels by the same authors, and
found that the percentage of Past Subjunctives realized as -se varies from
book to book, as authors adopt the point of view of very different narrators,
and as the authors themselves live for long periods away from their native
countries. The ranking, however, from least to greatest use of -se, does not
change. 3
9. Conclusions
Heine (this volume) has discovered that, for the German modal verbs,
"the highest values of agent-oriented and the lowest values of epistemic
modality are found with items associated with high verbality". This is sugges
tive in view of the fact that, in Spanish, the copulas and auxiliaries—which
lack some of the characteristics of full verbs—have a special behavior with
respect to modality. In the texts examined, the low-assertiveness -se form
appears preferentially with copular and auxiliary verbs. The next form up in
assertiveness, the -ra form, appears freely in main clauses only as hubiera,
from haber. The verbs that appear in Participle form with this auxiliary are
usually non-action verbs. The list deseado 'desired,' esperado' hoped,'
pensado 'thought,' querido 'wanted' (all mental act verbs) and sido 'been,'
covers the majority of the possibilities. The one case of a second-person
subject has already been discussed as (27); here, significantly, the modality is
agent-oriented.
Finally, there is a hard-to-define but very real performance factor at work
in all of this. In modern Spanish, the mood system is a tremendously sophisti
cated rhetorical device, and individual speakers are more or less competent
with respect to using it in a sophisticated way. There are native speakers of
Spanish who are masterful manipulators of the mood system, and there are
native speakers who are clumsy at using mood to produce rhetorical effects.
These varying degrees of rhetorical—as opposed to grammatical—compe
tence constitute input for learners of the language, and this loop serves to
further subjectify the system.
Recent work in categorization has shown that the structure of categories
varies depending on what the goal of categorization is. According to Barsalou,
"it appears that people use a variety of differences between exemplars when
judging typicality" (1987:104); with respect to mood in Spanish, speakers are
presented with syntactic, semantic, discourse and metalinguistic differences
from which to judge what constitutes a typical Subjunctive. Concepts are
temporary constructs in working memory which typically "vary widely as a
function of goals, current context, and recent experience" (1987:135). The
data presented in this paper exemplify some of the goals and contexts which
influence choice of the Spanish Subjunctive. The grammar of mood in Span
ish "emerges" (Hopper 1988) from the complex interaction of many factors.
Evaluative Function of the Spanish Subjunctive 447
NOTES
REFERENCES
Chafe, Wallace. This volume. "The Realis-Irrealis Distinction in Caddo, the Northern
Iroquoian Languages, and English".
Contreras, Heles. 1976. A Theory of Word Order with Special Reference to Spanish.
Amsterdam: North-Holland.
DeMello, George. 1993. "-Ra Vs. -Se Subjunctive: A New Look at an Old Topic".
Hispania 76.235-44.
Donoso, José. 1965. Los mejores cuentos de José Donoso. Santiago de Chile: Editora Zig-Zag.
Fleischman, Suzanne. 1982. The Future in Thought and Language: Diachronic Evidence
from Romance. Cambridge: University Press.
Fleischman, Suzanne. 1989. "Temporal Distance: A Basic Linguistic Metaphor". Studies
in Language 13.1-50.
Gili y Gaya, Samuel. 1964. Curso superior de sintaxis española (9a ed.). Barcelona:
Bibliograf.
Gsell, Otto & Ulrich Wandruszka. 1986. Der Romanische Konjunctiv. Tübingen: Nie
meyer.
Guitart, Jorge M. 1991. "Aspectos pragmáticos del modo en los complementos de predi
cados de conocimiento y de adquisición de conocimiento en español". Indicative y
subjuntivo ed. by Ignacio Bosque, 315-29. Madrid: Taurus Universitaria.
Guo, Jiansheng. Tis volume. "The International Structuring of Meaning: Children's Use
and Development of the Manderin Model Neng (Can)".
Harris, Martin. B. 1986. "The Historical Development of si-clauses in Romance". On
Conditionals ed. by E. C. Traugott, A. ter Meulen, J. S. Reilly & C. A. Ferguson, 265-
84. Cambridge: University Press.
Heine, Bernd. This volume. "Agent-Oriented vs. Epistemic Modality: Some Observations
on German Modals".
Hopper, Paul. 1988. "Emergent Grammar and the A Priori Grammar Postulate". Linguis
tics in Context: Connecting Observation and Understanding ed. by Deborah Tannen,
117-34. Norwood, NJ: Ablex.
James, Deborah. 1982. "Past Tense and the Hypothetical: A Cross-Linguistic Study".
Studies in Language VI. 375-403.
Kany, C. E. 1951. American-Spanish Syntax (2nd ed.). Chicago:University Press.
Klein, Flora. 1980. "Experimental Verification of Semantic Hypotheses Applied to Mood
in Spanish". Georgetown University Papers in Language and Linguistics 17.15-34.
Klein-Andreu, Flora. 1986. "Speaker-Based and Reference-Based Factors in Language:
Non-Past Conditional Sentences in Spanish". Studies in Romance Linguistics ed. by
Carmen Silva-Corvalán & Osvaldo Jaeggli, 99-119. Dordrecht: Foris.
Klein-Andreu, Flora. 1991. "Losing Ground". Discourse Pragmatics and the Verb ed. by
Suzanne Fleischman & Linda Waugh, 164-178. London: Croom Helm.
Krakusin, Margarita and Cedeño, Aristófanes. 1992. "Selección del modo después de
el hecho de qué". Hispania 75.1289-1293.
Lakoff, George, 1987. "Cognitive Models and Prototype Theory". Concepts and Concep
tual Development: Ecological and Intellectual Factors in Categorization ed. by Ulrich
Neisser, 63-100. Cambridge: University Press.
Lavandera, Beatriz R. 1983. "Shifting Moods in Spanish Discourse". Discourse Perspec
tives on Syntax ed. by Flora Klein- Andreu, 209-31. New York: Academic Press.
Evaluative Function of the Spanish Subjunctive 449
F.R. Palmer
University of Reading
Introduction
Even a brief look at the forms used for modality accompanied by negation in a
few languages will show that there is a great deal of irregularity, in the sense
of lack of regular correspondence between form and meaning. This paper
investigates the negative modal forms of a number of diverse languages
within a simple typological paradigm, and shows that several kinds of supple-
tion are used and that there is a strong tendency for some terms in this
paradigm to be regular and other forms to be irregular.
As the title of this paper indicates, it is concerned only with the modals of
possibility and necessity, and, moreover, only with epistemic and deontic
modality, deontic possibility and necessity being interpreted in terms of the
modals for permission (English may/can) and obligation (English must) re
spectively. The can of ability (dynamic possibility) is not included for two
related reasons, that it is less modal and more like verbs such as wish, intend
etc., and it does not raise the same kind of problems with negation as the other
forms.
There are three reasons for this delimitation of the area of discussion:
(i) It is comparatively easy to identify such modals across languages,
though it must be allowed that there are some differences of meaning. In
particular, in some languages it would seem that there is no distinction
between 'mustn't come' and 'oughtn't to come', while a Hebrew speaker felt
that the form he gave me for 'He must be in his office', might be better
translated as 'Of course he is in his office.' But problems of this kind beset
any kind of cross-linguistic comparison.
(ii) It is possible to paraphrase them in terms of such expressions as 'It is
possible/necessary that ...', and thereby to indicate where the negation is
semantically located (its scope) — 'It is not possible/necessary that...' and 'It
is possible/necessary that... not....' This is important, because it is assumed
that in the ideal or regular situation the grammatical placement of the negative
456 F.R. Palmer
indicates the scope of the negation. If the modal is negated, the expected
paraphrase will be 'It is not possible/necessary that...', while if the full verb is
negated, the paraphrase will be 'It is possible/necessary that... not ....' One
concern of this paper is the way in which modal systems diverge from this
ideal and the forms are, in this sense, irregular.
(iii) Possibility and necessity are related logically in terms of negation,
and these relations are to some degree mirrored in language. This is clearly
seen from English. Consider:
(4) He may be in his office
He can't be in his office
He may not be in his office
These express 'possible that', 'not possible that' and 'possible that not'
respectively. (There is suppletion of can for may, but both are essentially
modals of possibility). However, as noted in Section 1 (ii), the second and
third examples can be seen as the negative forms of He must be in his office
('necessary that', indicating a firm conclusion). This can be explained in
terms of the logical equivalence Not Possible = Necessary Not and Possible
Not = Not Necessary.
Because of irregularity, the lack of correspondence between form and
meaning, it is essential to distinguish between the characterization (in terms of
negation and possibility/necessity) of the forms and the meanings of the
modals. This can be achieved by using quotation marks for the meanings, but
not for the forms; thus it can be said of the examples above that the not-
possible form (can 't) is used to express 'necessary-not' and the possible-not
form (may not) is used to express 'not-necessary' (but see Section 7 for the
problem with may not).
In my analysis there are twelve different categories (see Palmer 1990:
39). These are, with the relevant English forms:
(5) Epistemic 'possible' may
'not-possible' can't
'possible-not' may not
'necessary' must
'not-necessary' (may not)
'necessary-not' (can 't)
Negation and the Modals of Possibility and Necessity 457
I did not discover a language with a perfectly regular system. The closest to it
was found in Modern Greek, where only one form, that for deontic 'not-
necessary', was irregular. The regular forms were:
(8) bori na ine sto yrafio tus (epistemic possibility)
it.can that they.are in.the office theirs
'They must be in their office'
Sen bori na ine. (not-possible)
not can that they.are...
'They can't be ...'
bori na min ine... (possible-not)
it.can that not they.are...
'They may not be ...'
prepi na ine sto yrafio tus (epistemic necessity)
it.must that they.are in.the office theirs
'They must be in their office'
Sen prepi na ine... (not-necessary)
not it.must that they.are...
'They may not be ...'
prepi na min ine... (necessary-not)
it.must that not they.are...
'They can't not be ...'
boris na fivis (deontic possibility)
you.can that you.leave
'You can (may) leave'
460 F.R. Palmer
Where the 'ideal' situation is not to be found, i.e., where the scope of the
negation is not reflected in the grammatical placement of the negative, there
are several possibilities:
(i) The simplest is that another verb is used. Two examples of this are
found in English: the use of can in place of epistemic may for 'not-possible'
{can't) and the use of need in place of deontic must for 'not-necessary'
{needn't).
(ii) The logically equivalent form is used. An example is the use of
('not- possible') can't in English as the form for epistemic 'necessary-not'
corresponding to must (see (2) and, for other examples, Section 5).
(iii) The negative is 'misplaced', placed on the 'wrong' verb. This was
found commonly and almost exclusively with 'necessary-not' (mostly with
deontic modality); this was signalled grammatically by the not-necessary
form. This too was illustrated, from French and Italian, in Section 1, and there
are further examples in Section 5.
(iv) There is a rather different construction, but using the relevant modal.
Thus for Latvian, the form proposed for deontic 'possible-not' (permission
not to) was literally 'You can come in, or not':
(11) tu drīksti nākt ieksa vai ne
you can come in or not
'You needn't come in'
(v) The form used is not related grammatically to the modal system.
5. Logical suppletion
One of the most interesting points is the suppletion in terms of the logical
equivalences. For most of the languages, suppletive forms of this kind were
offered for at least one place in the paradigm, but there is a problem with some
languages, in that there were gaps in the paradigm and no suppletive forms
were offered. Yet it cannot be said with any certainty that suppletive forms
462 F.R. Palmer
could not have been used, for it is possible the native speaker simply did not
immediately think of them (and I carefully refrained from suggesting them).
There was only one language with no suppletive forms of this type —
Modern Greek, where all the forms except that for deontic 'not-necessary'
were regular (see Section 4).
Where suppletive forms were offered, they followed two main patterns:
(i) Suppletive forms are used for both types of negation of necessity (both
'not-necessary' and 'necessary-not'), possibility forms being used for both.
This is especially common with epistemic modality, as already illustrated for
English by (3) which is repeated here:
(3) He must be in his office
He can't be in his office
He may not be in his office
Similar forms were found for Assamese, Italian and one variety of
Welsh. Examples from the first two are:
(12) Assamese
ofis-ot thakibc lage
office-in to.be he.must
'He must be in the office'
ofis-ot na thakibc pare (possible-not → 'notnecessary')
office-in not to.be he.may
'He may not be in the office'
ofis-ot thakibo nware (not-possible →'necessary-not')
office-in to.be he.NEG+may
'He can't be in the office'
Italian
Deve essere nell ufficio
He.must be in.the office
'He must be in the office'
Pud non essere nelV ufficio (possible-not —»
He.can not be in.the office 'not-necessary')
'He may not be in the office'
Negation and the Modals of Possibility and Necessity 463
The most striking result of this exercise lies in the comparison of the forms for
deontic 'not-possible' (denial of permission) and 'necessary-not' (obligation
not to). In every one of the languages except Oromo, which, as was noted
earlier, barely has any system, and Tamil, the 'not-possible' form was regular
in that negation was formally marked on the modal. Some examples are:
(18) Tigrinya
Iətə 'atto 'aytəkə'ələn 'ika
that.you.come.in you.not.being.able you.are
'You cannot/may not come in'
Latvian
tu ne drīksti nākt ieksa
you not can come in
'You cannot/may not come in'
By contrast, in all the languages except two, the form used for 'neces
sary-not' was irregular. The possibilities were:
(i) In ten of them, Arabic, Assamese, Danish, English, Farsi, French,
Italian, Kinyarwanda, Tamil and Tigrinya, it is the modal that was formally
negated, i.e., there was misplacement of the negative (although in Italian and
Tamil the same form was used for 'not-necessary', while in Farsi the regular
form with negation on the full verb was also possible — and see Section 6 for
a discussion of Danish and English). Examples are (see also (1) for Italian):
(19) Arabic
la:zim jizi
must you.come.in
'You must come in'
ma:/mu laizim jizi
not must you.come.in
'You mustn't come in'
Kinyarwanda
agomba kwinjira
he.must to.come.in
'He must come in'
466 F.R. Palmer
ntagomba kwinjira
NEG+he.must come.in
'He mustn't come in'
The form used for expressing 'not-necessary' in these languages varied.
While Italian and Tamil (see above) used the same form, Kinyarwanda and
Assamese used rather different constructions, while Arabic, Danish, English
and Farsi employed a different verb and the others had no form that was
clearly modal.
(ii) The form given in five languages, Bahasa Malaysia, Chinese, He
brew, Kikuyu and Japanese, was the suppletive not-possible form, and this
was also given as an alternative to the form with misplaced negative for Italian
and Tamil. In all of these languages except Chinese and Japanese, negation on
the necessity modal had the regular 'not-necessary' sense (Chinese and Japa
nese had no necessity forms with negation, and used the suppletive 'possible-
not' for 'not-necessary'). Examples are:
(20) Bahasa Malaysia
dia mesti/perlu masuk
he/she must come.in
'He/she must come in'
dia tak mesti/perlu masuk
he/she not must come.in
'He/she needn't come in'
dia tak boleh masuk
he/she not can come.in
'He/she mustn't (can't/may not) come in'
Chinese
ta mín tjen bi ∫y lé
he tomorrow must come
'He must come tomorrow'
ta mín tjen kŏe i bu Jy Jé
he tomorrow can/may not come
'He needn't come tomorrow'
ta mín tjen bu kŏe i Jy Jé
he tomorrow not can/may come
'He mustn't come tomorrow'
Negation and the Modals of Possibility and Necessity 467
There is probably also similar suppletion in German (but see Section 7):
(21) Du musst herein kommen
You must in.here come
' You must come in'
Du musst nicht herein kommen
You must not in.here come
'You needn't come in'
Du darfst nicht herein kommen
You can/may not in.here come
'You mustn't come in'
The point here is that the verb dürfen (darfst) is probably best seen as a
possibility verb (though translated 'need' as well as 'can' by dictionaries),
used both in positive forms and as the lexically suppletive verb for mögen
(although können is also used):
(22) Du magst herein kommen
You can/may in.here come
'You can (may) come in'
Du darfst nicht herein kommen
Du kannst nicht herein kommen
You can/may not in.here come
'You can't (may not) come in'
This shows that the possibility verb dürfen is used suppletively to express
necessity, in place of müssen (not-possible to express 'necessary-not').
Welsh has another alternative. Instead of the suppletive 'not-possible'
form given in (17), the 'necessary' form may be used if followed by the
negative verb peidio, which is, slightly misleadingly perhaps, glossed as
'refrain' below:
(23) Rhaid i chi beidio â ddod yfory
Must to you refrain from to.come tomorrow'
'You mustn't come tomorrow'
The two languages in which negation is regular, with the negation of the
full verb giving the 'necessary-not' interpretation were Modern Greek and
one version of Welsh. The Greek example has already been given and dis-
468 F.R. Palmer
cussed. What is surprising is that it is the form used for 'not-necessary', rather
than the form for 'necessary-not', that is irregular in Greek. This is expressed
not by negation on the modal, but by a different, probably non-modal, verb, as
illustrated in (9), which is repeated here:
(9) den ine anangi na fiyis
not is necessary that you.leave
'You needn't leave'
den xriazete na fiyis
not is.needed that you.leave
'You needn't leave'
There is a problem with some of the Germanic languages in that it is not at all
certain that it can be determined whether it is the modal or the full verb that is
negated. In English, it is to be assumed that the negative is formally associated
with the modal, since it is generally cliticised. In that case, there is misplace
ment of the negative in You mustn't' come. If this were regular, it would have
the 'non-necessity' meaning, which is, in fact, expressed by needn't. The
position in Danish and German is less obvious. In Danish skulle is commonly
used for obligation (as well as mätte), but the negative forms, with beh0ve
used suppletively, are (Davidsen-Nielsen 1990: 200-1):
placed between the modal and the full verb, it may be that there is no way of
determining which of these two they formally negate.
In Danish, however, there is some evidence to suggest that the negative
belongs formally with the modal: in certain subordinate clauses it precedes the
modal. Examples are (Davidsen-Nielsen personal communication):
(25) Jeg mener at man ikke skal/behøver dømme
]I think that one not must pass judgment on
nogen så hurtigt
someone so quickly
'I think one mustn't/needn't pass judgment on someone that quickly'
I know of no similar evidence to decide the issue in German. Since,
however, negatives normally follow other verbs in German, it is not unreason
able to associate the negative formally with the modal.
There is, perhaps, a problem with English epistemic may not. this is not
normally cliticized, so that the negative is not so clearly associated formally
with the modal. Consistency with the other modals, however, would suggest
that it is the modal that is negated. In that case, however, the form is irregular,
since it is a not-possible form expressing 'possible-not;' moreover, it is the
only example from all the languages considered in which there is displace
ment of the negation with this meaning. It is tempting to suggest that it is
precisely because it has this meaning that the form is not cliticized, and so
suggests negation of the full verb (and regular negation of the proposition).
However, if that is so, one might expect (falsely) that deontic must + not
should also not permit cliticization, and so similarly indicate negation of the
full verb and of the proposition.
One solution is to say that in these languages it is not possible to signal
the contrast of negation of the modality and negation of the proposition by
differential placement of the negative (negating the modal and negating the
full verb): the negative formally belongs to the whole verbal complex, as in
English with other auxiliaries — with isn't coming and hasn't come. Moreo
ver, the contrast is almost always made by the use of suppletion (e.g., English
must/need, German müssen/dürfen, Danish skulle/beh0ve etc.; it may be
precisely because negative placement cannot signal the contrast that supple
tion is used. However, these languages can still be included within the general
typological framework, because, if the modality is taken to negate the whole
verbal phrase, the most natural interpretation would be 'It is not the case that it
470 F.R. Palmer
is possible/necessary ...;' this would be the same as with the regular marking
of negation of the modal. Judgments about regularity (and about suppletion)
are, then, still possible: it can still be said that the German must nicht,
expressing 'not-necessary', is regular, while the corresponding forms in Eng
lish and Danish (mustn't, skal ikke), expressing 'necessary-not' are irregular.
8. Possible explanations
Is there any explanation for the features discussed in Sections 6 and 7? The
issue of suppletion is, perhaps, easy to account for: economy is achieved by
using the logical equivalences. They are particularly appropriate with epis-
temic modality, where logic and language appear to coincide, but less appro
priate with deontic modality, where, for instance, 'not-possible' (denial of
permission) and 'necessary-not' (obligation not to) are not identical in mean
ing, even though both preclude action.
The comparative irregularity of the 'necessary-not' form is probably due
to two factors. The first is that this is a very commonly used form, and, in
particular, much more common than 'not-necessary.' The second is that
elsewhere in language the scope of negation is not accurately indicated by the
grammar, as shown by the example noted in Section 1,I don't think he 'll come
(= 'I think he won't come'). It is, perhaps, not wholly surprising if the form
that negates the modal is used for this purpose; it would emphasize the
essentially negative aspect of the prohibition. This is especially so if there is
only one formal way of associating the modal verb with negation, as in
English. By contrast, the 'not-possible' form is usually regular, but this too
may be accounted for by its frequent use, in that there would be no reason not
to use the most obviously available form.
REFERENCES
APPENDIX
The table below indicates the type of form offered for each modal type in the languages
investigated. The symbols used are:
■4 Regular
s Suppletive in terms of the possible/necessary equivalences
M Negation misplaced
D Different modal form
c Construction different, but modal form used
No form offered or form not in the modal system
Epistemic Deontic
Not-Pos Pos-Not Not-Nec Nec-Not Not-Pos Pos-Not Not-Nee Nec-Not
Arabic (Hijazi) - √ √ √ √ √ D M
Assamee √ √ s s √ - - M
Bahasa Malaysia √ √ √ √ √ s √ S
Chinese √ √ s s √ √ s S
Danish V s √ s √ s √ M
English D R s s √ s? D M
Farsi √ √ - √/R √ - D √/M
French √ √ s s √ √ - M
German √ s √/D s √ - √ S
Greek
Hebrew √ √ - √ √ √ √ s
Italian √ √ s R/S √ √ √ M/S
Japanes D s s/√ s S
Kikuyu √ s s s S
Kinyarwanda √ √ c R
Latvian √ c √
Luo √ D - - √ - √ -
Oromo - √ - - - - - -
Serbo-Croatian D √ √ s √ s √ D
Tamil √ s D D s D √ M/S
Tigrinya √ √ M
Welsh (i) √ √ s S √ S √
(ii) - - - √ √ S √ s
A Functional Theory of Complementizers*
Zygmunt Frajzyngier
Dept of Linguistics, University of Colorado
1. Introduction
affix whose function is to identify the entity as a complement. Such forms are
known as complementizers'. He further (p.47) lists among English comple
mentizers 'that', 'if', and 'to'. Palmer (1986) in an extensive discussion of
types of complement clauses, uses the term 'conjunction' for what other
linguists call 'complementizers'. He does not define the term, but in discuss
ing conjunctions in various languages he ascribes to them modal functions.
Ransom (1986:87-88) defines complementizers as 'semi-lexical forms that
occur within the complement, setting it off from the main clause and, at the
same time, signaling its modality'. She lists among English complementizers
'that', 'whether', 'for-to', and 'whether-to'. Givón (1990:552ff) refers to the
morphemes in question as 'subordinating —"separating"—morphemes', but
also uses the term 'complementizer' (p. 554) and describes their function as
participating 'in the coding of the event-integration scale'. The underlying
principle in these studies appears to be that for a morpheme to be classified as
complementizer, it has to occur in a specific syntactic position, viz. before the
embedded clause (cf. Bickerton 1981:109), regardless of the relative order of
the two clauses.
In the present paper I discuss morphemes that are usually analyzed, or
likely to be analyzed, as complementizers. I show that complementizers are
lexically separate modality markers that happen to occur in a specific syntac
tic position. As such they are just one of the means by which languages
encode modality.
Descriptions that take separating the main and the embedded clauses to
be the main function of complementizers cannot explain the occurrence of the
same morphemes in simple sentences or in the main clauses of complex
sentences, i.e., in syntactic environments that do not require or allow the
function of 'separation'. The analyses that postulate separation as the main
function of complementizers face the additional problem of motivating the
existence of a morpheme whose function is to 'separate' clauses, since there is
no possibility of clauses ever overlapping in time. It is not therefore clear what
the metaphor of 'separation' stands for. One could argue that the function of
complementizers is to mark a clause as a complement, but even that function
is not well justified in view of the fact that that there are many types of
complement clauses that do not require or even allow a complementizer, even
in languages that have complementizers. A third problem with such descrip
tions is that they do not account for morphemes that occur in positions other
than at the beginning of the embedded clause. These morphemes are, as will
476 Zygmunt Frajzyngier
3. Hypothesis
Even in theories which postulate that every sentence has a COMP, it is recog
nized that the COMP position may remain empty. Bresnan (1970), who first
proposed the rule S' —> COMP S, attempted to explain the empty COMP by
postulating that complementizers are also a part of the underlying structure of
simple sentences but that they are deleted in the surface structure. She associ-
478 Zygmunt Frajzyngier
ated declaratives with the deleted 'that' and interrogatives with the deleted
'WH' (Bresnan 1970:301). For an analysis that does not postulate COMP
deletion in the simple sentence see Emonds (1985:318), who identifies S' and
P' as COMP and P. In the main clause the underlying P is empty.
I explain the absence of complementizers in simple sentences by postu
lating two factors, both of which agree with the proposed hypothesis. One is
that indicative sentences convey what the speaker intends to be taken as his
belief. In other words, indicative sentences have an inherently defined epis-
temic modality. I do not defend this hypothesis here as I have argued for it
extensively elsewhere (Frajzyngier 1985 and 1987; but cf. Palmer 1987).
With respect to the deontic modality, I propose, that the indicative clause is
the unmarked case, i.e., it indicates neither obligation nor wish. The main
support for this analysis comes from the fact that obligation, wish, and all
other types of deontic modalities have to be marked by inflectional markers,
modal verbs, modal adverbs, or other devices that a language may have.
The reason why even non-indicative main clauses sometimes lack com
plementizers is that their modalities are coded by syntactic devices such as
word order, by morphological changes on the verb, or by phonological
devices such as intonation, vowel or consonant length. Thus the Gothic verb-
final -u, which marks the interrogative modality, would most probably not be
considered a complementizer, e.g.:
(1) skuld-u ist kaisaragild giban kaisara
'is it lawful to give tribute money to Caesar?' (Visser 1969, 3:1547)
The following examples illustrate the complementary distribution of comple
mentizers with other devices marking modality in the main clause. As indi
cated earlier, the interrogative 'whether' is considered by most authors a
complementizer in English:
(2) I asked whether she came
(3) I don't know whether she came
Polish czy is also a complementizer, as shown by the following examples:
(4) Spytalem czy przyszta
ask: 1SG:PERF:PAST COMP come:3F:sG:PERF:PAST
T asked whether she came'
A Functional Theory of Complementizers 479
Theoretically, a situation could arise in which the simple clause is marked for
more than one modality, and each of the modalities is encoded by a separate
morpheme. The previously cited examples (7) and (8) from Polish could be
considered instances of two complementizers occurring in a simple clause. In
example (7) the sequence of complementizers is a-by and in example (8) niech by.
The first complementizer encodes obligation and the second a hypothetical
mood. Compare also the following example where the sequence is o-by:1
A Functional Theory of Complementizers 481
(Central Chadic), which does not have inflectional devices and whose marker
of obligation does not occur at the beginning of the embedded clause. In all
three languages, complements of volitional verbs and volitional expressions
do not have markers of obligation if the subjects of the main and the embed
ded clauses are the same, e.g.:
(12) chce-my jesc
want-1PL eat:INF
'we want to eat' (Polish)
(13) chce jesc
want:3sG eat-INF
'he/she wants to eat'(Polish)
(14) Je veux le savoir
lSG want it know
'I want to know it' (French)
In Guider, one of the equivalents of the verb 'to want' is a periphrastic
construction with the literal meaning 'x's mouth is on . . . '. Such an
expression may be followed by an NP to realize the equivalent of 'X wants an
NP',e.g.:
(15) má-w á kà mbáy lá
mouth-lSG PREP on drink
'I want a drink'
Such an expression may be also followed by a clause, to realize the equivalent
o f ' X wants S', e.g.:
(16) má w á kà zəmá
mouth-lSG PREP on eat
'I want to eat'
(17) má-w á kà zəm hlúá
mouth-lSG PREP on eat meat
'I want to eat meat'
In Polish, if the subjects of the main and the embedded clauses are different,
then the embedded clause is marked for the deontic modality of obligation.
The marker is composed of the free complementizer ze and the enclitic by,
which has a hypothetical function, e.g.:
A Functional Theory of Complementizers 483
(20) m l g ní
1PL go OBLIG
'let us go'
(21) m mbát g ni
lPL depart OBLIG
'let us leave'
(22) ki d s w g, n wàyá
2SG cook:-OPT BEN 1SG OBLIG food
'you should cook for me'
Lele (East Chadic) codes obligation for first and third person and the impera
tive for second person through a suffix on the verb. Consequently, as pre
dicted by the hypotheses presented in this paper, the language should not have
complementizers marking these specific deontic functions. The prediction is
borne out by the facts. In the following sentences the modality of obligation in
the embedded clause is coded by the imperative form of the verb ír 'go'. Note
that the sentences have a de dicto complementizer ná since the main verb is a
verb of saying:
(25) ń-yá bé-gè ná gé ir-à
lSG-tell BEN-3PL COMP 3PL gO-IMPER
T told them to go'
(26) yà-di bú-dú ná ir-à
Say-3M BEN-3F COMP gO-IMPER
'he told her to go'
The doubt in truth marker sàn, a marker of epistemic modality, occurs in
clause-final position and therefore there is no sequence of two modality
markers, or complementizers, e.g.:
(27) ná-i né báy gō ségrè sàn
COMP-3M COP person DEM hunter DOUBT
'hej said that hej is a hunter' (it may be that he only pretends to be one)2
cf.
(28) cánígè ná-í nè báy gō ségrè
Canige COMP-3M COP person DEM hunter
'Canige j said that hej is a hunter'
Lele does not, however, have an inflectional form of the verb that indicates
hypothetical or irrealis mood, which in Indo-European languages often con
veys hearsay information. According to our prediction, in the absence of
inflectional markers, if such a mood were to be marked, a complementizer
would be one of the ways to realize it. In what follows I show that Lele indeed
has a complementizer whose function is to mark less than direct evidence.
A Functional Theory of Complementizers 485
7.2 Complementizer go
7.3 Complementizer ná
The complementizer ná (sometimes realized with mid rather than high tone,
cf. Palayer 1981) occurs after verbs of saying and thinking and indicates that
what follows is in the domain de dicto, e.g.:
(36) cánígè ná dá-ì kólôŋ nè báy gō ségrè
canige COMP 3M there COP man DEM hunt
'Canige said that he [one over there] is a hunter'
(37) kìrbí-dí ná-ì nè báy gō ségrè
think-3M COMP-3M COP man DEM hunt
'he i thinks that hei is a hunter'
Embedded interrogative clauses that follow the main verb tón 'ask' are also
introduced by the complementizer ná, e.g.:
(38) ŋ-tón ná é-jé gè wán gà
lSG-ask COMP come-PERF 3PL nevertheless INTERR
'I asked whether they nevertheless came'
(39) rj-tón ná bálè dí gō ôje'-ŋ gà
lSG-ask COMP be able 3M COMP help-lSG INTERR
'I asked whether he could help me'
(40) rj-tón gí ná gí bál gō oje'-rj gà
lSG-ask 2M COMP 2M be able COMP help-lsG INTERR
'I asked you whether you could help me'
(41) ŋ-tôn ná wéî bà bál gō oje'-rj gà
lSG-ask COMP who be able COMP help-lSG INTERR
'I asked who could help me'
A Functional Theory of Complementizers 487
The verb sèn 'to know' may be followed by the single complementizer gō, e.g.:
A Functional Theory of Complementizers 489
(59) sínà ná . . .
know COMP
'know that
The last piece of evidence I present here for the functions of ná and gō is the
fact that one cannot use both complementizers after a verb of perception. Thus
the following sentences are ungrammatical:
(60) *ŋ-gôl-í gō ná wàl-dí kûlbà
lSG-see-3M COMP COMP slaughter-3M cow
for 'Ï saw him slaughter a cow'
(61) *ŋ-gôl-i ná gō wàl-dí kúlbà
lSG-see-3M COMP COMP slaughter-3M cow
for T saw him slaughter a cow'
The ungrammaticality stems from the fact that the main verb of perception is
followed by the direct perception marker gō and also by the indirect percep
tion marker ná, creating an internal contradiction within the clause.
The importance of the data from Mupun lies in the fact that the language has a
clause-internal epistemic marker coding doubt in the truth of the proposition,
and inflectional markers coding imperative and obligation. According to the
proposed hypothesis, neither doubt in truth nor obligation should be realized
by complementizers. The prediction is fully borne out by the data. The
A Functional Theory of Complementizers 491
language also has two other modality markers, both occurring at the begin
ning of the clause: a prohibitive marker and an interrogative marker. Accord
ing to our predictions, such a marker may be preceded by a de dicto
complementizer, resulting in a sequence of two complementizers. The data
support this prediction as well. A discussion of the four cases follows.
The optative and imperative in Mupun are marked by low tone on subject
pronouns in both main and embedded clauses, e.g. (only tones on pronouns
are marked):
(67) mó grdp pupwap
3PL cut:PL fish
'they cut a fish into pieces'
(68) mo grdp
3PL cut:PL
'they should cut it into pieces'
Since the mood of obligation is marked by inflectional devices, there is, in
accord with our prediction, no complementizer marking the mood of obliga
tion. The embedded clause may occur without a complementizer or be pre
ceded by a de dicto complementizer nә, e.g.:
(69) dәm à sat mo grdp
go 2M tell 3PL cut:PL
'go and tell them to cut it into pieces'
(70) mo sat nd yì cin dik
3PL say COMP 2 F do marriage
'they said that you should marry'
cf.
(71) mo sat nd yì ein dik
3PL say COMP 2 F do marriage
'they said that you got married'
(72) mo sat nd à la mat
3PL say COMP 2M take wife
'they said that you should marry'
A Functional Theory of Complementizers 493
cf.
(73) mo sat nә á la mat
3PL say COMP 2M take wife
they said that you married'
8.4 Interrogatives
The importance of the discussion of the Polish data is as follows: There are no
clause-internal markers of epistemic modality nor of deontic modality for the
first and third person. In accordance with the hypothesis of the present paper
one can expect that such a modality may be expressed through modal verbs,
adverbs, or a complementizer. If the modality is expressed through a comple
mentizer one can expect that after verba dicendi, and possibly after verba
sentiendi, a sequence of at least two complementizers may occur, of which
one is a complementizer of verbs of saying and the other codes another
epistemic or deontic modality.
The Polish enclitic by, mentioned earlier in the paper, indicates a hypo
thetical or irrealis mood. It is clearly a modality marker and can be attached to
verbs, nouns, pronouns, adjectives, complementizers, adverbial clause mark
ers, and exclamations, e.g.:
(84) byt by-m w Warszawie gdy-by. . .
be-PERF HYP-ISG in Warsaw when-HYP
T would have been in Warsaw if. . .'
The enclitic by can be added to the morpheme ni to express the epistemic
modality of doubt in truth of the proposition; while by added to the morpheme
o encodes the deontic modality expressing wish. The resulting structures ni-by
and o-by occur in clause-initial position, i.e., in the position of complementiz
ers such as the interrogative czy, e.g.:
(85) Ni-by byt w Warszawie
COMP be:3M:sG:PERF PREP Warsaw:DAT
'Apparently he was in Warsaw'
496 Zygmunt Frajzyngier
11 Conclusions
ABBREVIATIONS
NOTES
* Work on this paper was supported by a grant from the National Endowment for the
Humanities, No. RT-21151-90. I would like to acknowledge the generous help of
colleagues, through conversations and correspondence. For the French data I am grateful
to Anne Abeille, Philippe Alcouffe, and Sylvie Bruno; for information on Quebecois
French and some bibliographic pointers I am grateful to Gary Coen, and for Dutch data
(not used in the present paper) and references I am grateful to Jan Odijk, J. Zwart, and
Gerrit Dimmendaal. I would like to thank Eric Atwell, University of Leeds, for checking
the LOB corpus for me for the occurrence of two complementizers in a sequence in
English (none were found, but cf. Caroll 1983). I gathered data on Lele, an East Chadic
language spoken in Chad, during fieldwork in North Cameroon in the summer of 1990.
I gathered data on Guider, a central Chadic language spoken in North Cameroon, in the
summers of 1990 and 1991. Data on Mupun, collected over a period of many years, are
taken from Frajzyngier 1991b and 1993.
I would like to thank Immanuel Barshi, Karen Ebert, Bernd Heine, Elizabeth
O'Dowd, David Pesetsky, Erin Shay and participants of the Symposium on Mood and
Modality, especially John Haiman, Gillian Sankoff, Eve Sweetser, and Werner Abraham,
for questions raised and comments on various aspects of the present paper. None of them
is, however, responsible for any shortcomings of this paper.
1 For a discussion of by in Russian see Brecht 1977.
2 In Lele, as in many Chadic languages, the verb 'to say' is often omitted. The only trace
of its presence is the complementizer.
REFERENCES
Joan L. Bybee
University of New Mexico
The English modal auxiliaries, would, should, might, and could are histori
cally the Past Tense forms of will, shall, may and can respectively.1 However,
their meaning and usage in Modern English are not derivable from the
combination of past meaning with the meaning of the Present modals. Rather
the uses of the Past Tense modals may be divided into three types: (i)
hypothetical uses; (ii) present tense uses; and (iii) past tense uses.
In Modern English the hypothetical uses are the most common according
to the analysis of spoken and written texts done by Coates (1983). They are
most obvious in the then-clauses of hypothetical conditional sentences. In
some cases the hypothetical meaning occurs along with what is sometimes
called the 'root' meaning of the modal. For instance in (1) would indicates
'hypothetical willingness' (Coates 1983: 211), while in (2) could indicates
'hypothetical ability':
(1) If you had that job lined up, would Fulbright then pay up?
(2) If you helped me, I could finish this in an hour.
In the example in (3), no additional root meaning is present, and the modal
conveys only the hypothetical conditional sense (Coates 1983:214):
(3) / mean we all want be to millionaires, but if we were of course
money wouldn't be worth anything.
The present tense uses of originally Past modals are illustrated in examples (4)
and (5) (from Coates 1983: 58, 152).
(4) You should walk round the ramparts of the old city too.
504 Joan L. Bybee
That should and would are originally past tense in meaning is clear from their
uses in Old English texts, such as Beowulf. There we find many clear in
stances of should signalling destiny, duty or obligation of the subject in the
past, corresponding to sceal, which has the same meaning in the present.
Consider the examples (9) through (11) and their translations, based on
Gordon 1926.
The Semantic Development of Past Tense Modals in English 505
(9) Nœs ßcet forma sid /þœt hit ellen-weorc œfnan scolde. (1464)
'That was not the first time that it (a sword) had to perform a deed
of valor.'
(10) ße he wid ßam wyrme gewegan sceolde. (2400)
'when he had to fight the dragon.'
(11) ßcet ßcet ðeodnes beam geßeon scolde. (910)
'that the king's son was destined to prosper.'
Similarly, wolde is used almost exclusively to signal volition of the subject in
the past, just as wylle was used to express volition in the present. Consider the
examples in (12) through (14). Example (13) refers to a time that is past with
reference to the time of the narrative, which is also past.
(12) wolde self cyning symbel ßicgan. (1010)
'The king himself wished to join in the banquet.'
(13) donne sweorda gelac sunu Healfdenes efnan wolde; (1040-41)
'when the son of Healfdene wanted to practice sword-play.'
(14) woldewig-fruma Wealhßeo secan, cwen to gebeddan. (664-5)
'the war-leader wanted to look for the queen to bed down with.'
By the Middle English period, however, both should and would had made
their way into present contexts, especially with first and second person.
Consider first wolde, which is especially prominent in present usage in Sir
Gawain and the Green Knight, where it can express present volition or
willingness. Consider the following examples:
(15) "I wolde yowre wylnyng worche at my myzt..." (1546)
'"I am willing to do your desire as far as I can...'"
(16) "'Wher is', he sayd, "The gouernour of this gyng? Gladly I wolde
Se that segg in syzt, and with hymself speke raysoun." (224-6)
"Where is', he said, "The lord of this company? I would gladly
See that knight and speak reason with him.'"
To explain the present uses of Past forms, we must refer to the special
properties of modal verbs. Modal verbs, whether they express desire, obliga
tion, necessity, intention or ability, have in common the semantic property
that they do not imply the completion of the action or event expressed by the
infinitive with which they occur. (This property is in contrast with other
complement-taking verbs such as finish and manage, which do imply comple
tion [Givón 1973].) In the present tense, however, these modal verbs do imply
506 Joan L. Bybee
a close relation between the agent and the main predicate, which may imply or
predict a future completion of the event or activity of the main predicate.
Especially in first person, present tense modal verbs such as Modern English /
wanna, I'm gonna, I hafta or / can are used to state intentions, make offers and
promises which our interlocutors can expect to be carried out. The same verbs
in the third person are used to report intentions, offers and promises. Simi
larly, I will and I shall in the Gawain text are used to make promises, state
intentions and resolutions, which one trusts will be carried out.
The situation with these same modal verbs in the past tense is quite
different, however. The relation between the agent and the main predicate is
much more tenuous. He/I wanted to for instance may be used both in contexts
in which the desire was carried out and in contexts in which the desire was not
carried out. As stative verbs, the past forms of modals assert that a state
existed before the moment of speech, but they do not say whether that state
still exists in the present or not. Thus past modals offer two areas of vague
ness: (i) whether or not the predicate event was completed; and (ii) whether or
not the modality remains in effect.
A modal in past time, then, leaves open the possibility that some condi
tions on the completion of the main event were not met, and therefore the
modality may still be in effect. That is why when we say, for example,
(17) / wanted to help you.
in a situation where I didn't, it means that there were some conditions that
were not met: something was standing in my way. Interestingly, we can also
say I wanted to help you in a situation in which you still need help, but I do not
intend to help you because something IS standing in my way. In other words,
the existence of PAST blocking conditions is generalized to include the exist
ence of PRESENT blocking conditions.
Note that the use of a Past Tense in a conditional sentence results in a
hypothetical conditional. The hypothetical conditional is contrasted with the
reality conditional which uses Present tense or a Future auxiliary:
(18) Hypothetical conditional:
If she saw Judy, she would tell her the news.
(19) Reality conditional :
If I see Judy, I will tell her the news.
The difference between the two is that the hypothetical conditional implies
The Semantic Development of Past Tense Modals in English 507
that it is considerably less likely that the condition will still be met. The
difference in likelihood originates from the tense distinction in the following
way: The past in both clauses of the hypothetical conditional sentence signals
that the conditional relation held in past time. The modality in the then-clause
signals that the relation is still in effect. (Cf. If Mike saw her, he told her the
news.) On the other hand, the reality condition with Present tense means that
the conditional relation begins in present time and projects into the future. The
reason that the one framed in the past seems more hypothetical is that the
conditional relation has existed in the past but the condition has not yet been
met, suggesting that it may never be met.
This description of the origins of hypotheticality for past tense modals
predicts that the uses of hypothetical conditionals that span past and present
time are more basic and historically prior to those that are set in the future. As
Suzanne Fleischman (personal communication) has pointed out, a hypotheti
cal conditional may also refer to the future: If in the next year the interests
rates went down again, I would definitely consider refinancing. No uses of
this sort are found in the texts examined in the study reported below, suggest
ing that this use is a less common extension of the hypothetical use derived in
the way described above.
The if-clause of a hypothetical conditional always contains a past form,
and the past modals show up here as well. However, it is their use in the main
clause that is more important for explaining how they come to be used in
present contexts.
It is interesting that the then-clause of the hypothetical may take other
modal verbs besides would, such as those shown in (20), even though some of
them are not very highly grammaticized:
(20) I wanted to
I was gonna
If I saw Judy, I was supposed to tell her the news.
I intended to
I might
The Past tenses of all these modal verbs behave in a similar manner: They
allow the interpretation that the modality is still in effect, and the predicate
action will be carried out if the right conditions are met.
For this reason it seems appropriate to interpret Modern English wanted
to as an analogue to wolde in the Late Old English and Middle English period.
508 Joan L. Bybee
(24) And I wolde loke on that lede, if God me let wolde. (1063)
'And I want to see that knight, if God would let me.'
(25) Bot wolde ze, lady louely, then leve me grante... (1218)
I wolde boze of this bed, and busk me better; (1220)
'But if you would, lovely lady, grant me leave...
I would get out of bed and dress myself better.'
The conclusion, then, is that the hypothetical and the present uses of Past
modals are basically the same in semantic content. With the present uses of
wolde, however, the conditions are not stated, they are only implied.
Turning now to schulde in Middle English we find a similar situation.
Shall refers to what is to be. The deontic source may vary: it may be divine
destiny, social obligation or mutual arrangement. Schulde refers to what was
to be and carries the same vagueness of implication as the other modalities:
the action may or may not have been completed; the modality may or may not
still be in effect. In Middle English we find many uses of schulde to refer to
what was to take place, without any implication that it did take place:
(26) ßere watz much derue doel driuen in the sale
That so worthé, as Wawan schulde wende on that ernde...(558-9)
There was great lamenting made in the hall
That so worthy as Gawain was to go on that errand.'
(27) And went on his way with his wyze one,
ßat schulde teche hym to tourne to pat tene place...(2074-5)
'And (he) went on his way with the man
That was to show him how to get to that perilous place.'
As with wolde, all the conditions necessary for the completion of the main
predicate may not be met, so the use of schulde is appropriate in a hypothetical
conditional.
(28) 'For were I worth all fie wone of wymmen alyue... (1269)
fier schulde no freke vpon folde bifore yow be chosen. ' (1275)
'"For if I were worth all the host of women alive...
No man upon earth would be chosen before you.'"
The fact that a hypothetical conditional relation still holds in the present
makes the few present uses of schulde found in Gawain possible. In (29) I
assume the condition to be 'if I looked for you.'
510 Joan L. Bybee
(29) "Where schulde I wale ße, " quoth Gauan, "where is ßy place?"
(398)
'"Where shall I find thee," said Gawain, "where is thy place?'"
(30) And quy ße pentangel apendez to ßat prynce noble
I am in tent yow to telle, ßof tary hyt me schulde: (623-4)
(Author speaking)
'And why the pentangle belongs to that noble prince
I intend to tell you, though it (should) slow my tale.'
(31) "paz I hade nozt of yourez,
zet schulde ze haue of myne." (1815-6)
'"Though I have nothing of yours,
Yet you should have something of mine.'"
One very interesting example, (32), refers to an obligation established in the
past, which is being met in the present. This example shows clearly that the
use of a Past modal may imply the continuation of the modality into present
time.
(32) "At pis tyme twelmonyth ßou toke ßat ße failed,
And I schulde at zis Nwe zere zeply ze quyte. " (2243-4)
'"At this time a year ago you took what fell your lot,
And I am/was obliged at this New Year to promptly repay you.'"
In an embedded or hypothetical context, the original lexical meaning of
obligation or destiny has weakened considerably even in Middle English, so
that it contributes little additional meaning:
(33) Quat! hit clatered in Pe clyff, as hit cleue schulde. (2201)
'Quat! It (a noise) clattered on the cliff as if it would cleave it.'
(34) And he asoyled hym surely and sette hym so clene
As domezday schulde haf been dizt on the morn. (1883-4)
'And he absolved him surely and made him so clean
As (if) doomsday should have been the following morn.'
A very interesting point to note about the distribution of schulde in the
Gawain text is that all of the past uses save one are in embedded clauses — in
complements, relative clauses, as if oriƒclauses — where the main clause is
marked for Past tense. The one exception is where schulde occurs in a main
clause in the main past narrative line, but in this case it has haf with it:
The Semantic Development of Past Tense Modals in English 511
In this Middle English text, schulde never has to convey a past sense by itself.
It was compatible with a past context, but to convey pastness, it had to be
bolstered with have.
If we continue to follow should and would into Early Modern English, and
consider a Shakespearean play such as The Merchant of Venice, we find that
the trend toward using should and would in present and hypothetical contexts
has continued to the point where both modals appear in these contexts almost
to the exclusion of past contexts. 3 Consider Table 1.
The present uses continue to be much the same for would although they
grow in frequency. They occur mainly in the first person singular and mean T
want' or T would like'. The present uses of should include the first singular
question, as in (36), the second singular statement of obligation, as in (37),
and a new development, the occurrence of should in complement clauses
where no past time is signalled or implied, as in (38).
(36) What should I say to you ? Should I not say
"Hath a dog money? ..." (1.2.115-6)
(37) You should in all sense be much bound to him. (V.l. 136)
(38) You teach me how a beggar should be answer'd. (IV. 1.436)
It will be recalled that the hypothetical uses of modals may involve a
retention of some of their lexical meaning, such as hypothetical obligation and
hypothetical willingness. Thus in Gawain we find examples of would which
signal hypothetical or conditional willingness, as examples (15) and (24).
There is also an example in Gawain which is devoid of any volitional meaning
and might be characterized as a conditional prediction:
rarely in if-clauses and then only with volitional meaning. Thus should has
come to signal pure conditional or hypothetical in conditional sentences,
while would maintains some of its volitional content in first person and if-
clauses.
The difference between Elizabethan and Modern English with regard to
these modals is that would is replacing should in the first person in then-
clauses, and has in fact already done so in American English, and would is
now appearing in //"-clauses with only hypothetical meaning, where formerly
it conveyed willingness. 6
As for the present uses, my proposal is that the Past modals come to be
used in the present via the hypothetical conditional: originally the present uses
are hypothetical then-clauses without the conditions stated explicitly. As
should illustrates, however, the conditional sense can be lost. The resulting
present tense modal expresses obligation, as shall and should originally did,
but in a weakened sense. The hypothetical meaning has had the effect of
weakening the force of the obligation, through the implication that there are
outstanding conditions. Thus modern should expresses weak obligation. This
may be seen by comparing it to the stronger have to, in examples (44).
(44) I should mail this today, vs. I have to mail this today.
The other English Past tense modals, could and might, when used in a present
context, also express a meaning that is weaker than their Present tense
counterparts, can and may.1 If Past modals evolve universally into present
uses in the way proposed here, they will always express a meaning that is
similar to their Present counterparts, but weakened by hypotheticality.
3. Implications
These facts about the gradual development of the hypothetical and present
senses of Past modals and their loss of past meaning have a number of
implications for the synchronic and diachronic analysis of Past tense modals.
In particular, we have seen that it is the combination of the modal sense and
the past sense that produces the hypothetical reading, and not past alone, nor
the modality alone. Moreover, the hypothetical sense replaces the past sense
over time. Therefore a synchronic analysis of Modern English such as the
classical Chomskyan one (Chomsky 1957), which derives would and should
514 Joan L. Bybee
by combining Past Tense with will and shall cannot be justified. The case for
should is the clearest, since should no longer has any past readings at all and
has needed have added to it to attain a past reading ever since the Middle
English period. 8
A case can also be made for would not containing any past sense, except
for its past habitual use. 9 Note that the past uses of would in the corpus of
spoken Modern British English examined by Coates represent only 16% of
the total, with 82% of the uses being hypothetical. Furthermore, the small
percentage of past uses occur in past contexts in which would does not on its
own signal past time. I would argue, then, that would agrees with or is
compatible with a past context, but does not contain as part of its own
semantics a reference to past time. The gradual semantic changes that take
place among the English Past tense modals are similar, but do not affect all the
modals at the same time. Despite having some syntactic properties demon
strating classhood, each modal verb follows its own path and timetable of
development (Bybee 1986).
There are also consequences for the analysis of the meaning of past tense
morphemes. It has been suggested, for instance by Steele (1975), Waugh
(1975), Langacker (1978) and James (1982) that in languages in which a past
tense is used as a hypothetical that the past tense actually means 'remote from
present reality' rather than 'preceding the moment of speech'. James points
out a problem with this approach, which is that in normal contexts, the past
tense, say in English or French, continues to mean 'before the moment of
speech' and cannot be interpreted as merely 'remote' or 'distal'. That is, there
is no justification for claiming that the normal sense of a past morpheme has
become more general and no longer refers to past time just because in some
contexts it means hypothetical.
What all of these authors have failed to stress is that it is not the past tense
alone that is contributing the hypothetical meaning, but rather the past in
combination with a modal verb, a subjunctive mood, a hypothetical marker
(such as if), or, in some cases, the imperfective aspect (Fleischman, this
volume). Moreover, I have argued that the diachronic progression shows that
it is not the 'remoteness' of the past that leads to the hypothetical sense, but
rather the fact that past combined with modality leaves open the possibility
that certain conditions on the completion of the predicate have not been met.
Thus there is no justification for claiming that the meaning of past tense in
some languages is 'removed from present reality'.
The Semantic Development of Past Tense Modals in English 515
NOTES
1. An earlier version of this paper was presented at the Eighth International Conference on
Historical Linguistics in Lille, France, August 1987, and appeared in the Buffalo
Working Papers in Linguistics, A special issue for Paul Garvin, 1990, 13-30. Special
thanks are due to Suzanne Fleischman, William Pagliuca and the participants in the
Mood and Modality Symposium for discussion of ideas contained herein. This work was
supported by a fellowship from the John Simon Guggenheim Memorial Foundation.
2. Wolde also occurs in the if-clauses in these examples. Here it means 'want' or 'be
willing' in the past.
3. The Merchant of Venice is written partly in poetry and partly in prose, making it more
appropriate than one of the plays written entirely in verse. Of course, we cannot assume
that Shakespeare's texts give a true representation of spoken Early Modern English. At
the time, however, they apparently sounded natural enough for audiences to accept them,
and moreover, they are the best evidence we have concerning spoken English of that
period.
4. For the Gawain text, the numbers for wolde include nolde (ne + wolde).
5. The other uses of would in The Merchant of Venice are three generic uses in phrases such
as 'as one would say' and four uses to mean 'I wish', in which the complement clause
has a different subject than the main clause.
6. In American English, examples such as the following, where would is purely hypotheti
cal and conveys no sense of willingness, are frequently heard in colloquial speech:
(i) If I would see her, I would tell her.
1. Must is also considered Past tense in some analyses. However, its strong meaning
suggests that it might derive from the Second Singular Present form, moste.
8. In American English was supposed to conveys past obligation (I was supposed to go
yesterday) and should does not (*/ should go yesterday). Even should with have has
specialized to counter-factual obligation (/ should have gone yesterday).
9. The past habitual uses of would date from the Old English period and develop independ
ently of the hypothetical uses.
The Semantic Development of Past Tense Modals in English 517
REFERENCES
Suzanne Fleischman
University of California, Berkely
1. Introduction
Before proceeding with my analysis, and in light of the fuzziness that comes
to surround grammatical categories once we start comparing them across
languages, let me propose some working definitions of the macro-categories
at issue here, imperfective and irrealis.
These various avatars of irrealis will become clearer as they are illustrated in
the paragraphs that follow. But before proceeding to the analysis, a point of
clarification is in order concerning the basis for my classification of a context
or usage as irrealis.
Imperfectie e and Irrealis 523
4. Motion-derived futures
(11) a. J: ... Pretend I was moving this up and down and up and down
[moves appendage on a doll] [...]
Z: And pretend somebody was iceskating on the rink
a. Sp. Yo era [IMP] el jefe de la banda y tu eras [IMP] el hermano
'(Pretend) I was the leader of the gang and you were my brother'
c. Fr. Moi, j'étais [IMP] le gendarme, et tu me volais [IMP] mon vélo
'(Pretend) I was the policeman and you were stealing my bike'
The key word here is 'make-believe': for in effect, the imperfective verbs
serve to locate the situations they refer to in the realm of fantasy, of fictional
activity. 12 Interesting to note in this connection is a link various investigators
(Le Goffic 1986, Cappello 1986, Adam 1992) have posited between the
French Imparfait and fictionality, particularly as this form is used in the
protasis of unreal conditionals.
In a thought-provoking discussion, Adam (1992) proposes that the
Imparfait should be viewed not so much as a tense of the past but as an
"operator of fictionality." He bases this proposal on the modal uses of imper
fective pasts that are of concern in the present inquiry. 13 Cappello (1986)
refers similarly to the Imparfait as "the fictional analogue of the Present
Indicative," 14 a tense which is likewise pressed into service to express non-
temporal meanings, including meanings that come under the heading of
conditional or imaginary. (The present, we recall, is commonly found in the
protasis of real, or open, conditions and is also the tense most frequently used
across languages for generic or gnomic statements.) Seeking out a common
property that will explain why the Present and the Imparfait in French can
express non-temporal (i.e., modal) meanings, whereas the Passé Simple can
not, Le Goffic (1986) points to their shared imperfective — or as he calls it,
non-completive — aspect.15 It is this property, he argues, that explains,
moreover, why the imperfective past (the Imparfait) and not its perfective
counterpart (the Passé Simple) is chosen as the vehicle for free indirect
discourse (see Section 9).
6. Politeness forms
the speaker refuses to take seriously the words she utters. She says, in effect,
for the benefit of an adult who is or might be listening — or for her own
benefit, out of basic human self-respect: "Pay attention! What I'm saying is
not serious. This is a game; this is fiction."22
Through this usage, then, the adult speaker distances herself from the
words of her own utterance, acknowledging that they are not cast in the
conventional adult forms. She marks the utterance in question as an inauthentic
token of adult discourse. In line with Gougenheim's statement, Wilmet (1968-
69:105) points out that the hypocoristic Imperfect is always a 'fiduciary'
form: its use rests on an understood linguistic contract among speakers, from
which young children — and, obviously, animals — are excluded inasmuch
as they do not qualify as full-fledged interlocutors. In sum, the hypocoristic
imperfect operates as a kind of metalinguistic evidential signaling the non-
authenticity or non-canonicity of the speech situation itself — a usage I
include under the last category of irrealis meaning given in (4).
Commenting on the hypocoristic Imperfect, Martin (1971:102) describes
it as creating a world "centered around the child or pet addressee," "a reality in
which the [adult] speaker is not really immersed." This brings us to the use of
imperfective aspect to signal the irrealis status of represented worlds.
egory of the definition of irrealis given in (4) —, are routinely told in the
present tense. Any speaker of English immediately recognizes a story that
starts out "This guy walks into a bar and says to the bartender..." as a joke,
whereas the same information reported in the past tense might be construed,
initially at least, as a narration of realis events. Both the imperfect and the
present are selected across languages as the basic reporting tense of genres
that refer to a legendary or mythical past, e.g. the Sanskrit Rgvedas, the Tamil
Puranic stories, and epic universally (see Fleischman 1990:123-24).
The imperfective aspect of the present and the imperfect23enables these
tenses to represent past experience as if it were in the process of occurring,
unmediated by the post-hoc reflective consciousness of a reporting speaker. 24
Thus in many languages (e.g., Spanish, French, Italian, Rumanian, Dutch)
these two tenses are used to report the contents of dreams, hallucinations, and
other semi-conscious states, as seen in (2) above, repeated for convenience as
(18):
(18) Now Winder was wanting to know what the hell the stage was
doing on the pass at night anyway. (Walter Van Tilburg Clark,
The Ox-Bow Incident, p.126)
This statement occurs at a point in the story just after the (first-person)
narrator has been shot by a stagecoach guard. As Chafe (1994:242) observes,
Winder's statement would normally have been perceived as a punctual event,
but it entered the narrator's impaired consciousness "as if he were tuning in on
part of a speech event that was diffuse and out of focus." What is captured by
the imperfective aspect of the main-clause predicate is the semi-conscious
state of the narrator, specifically his inability of focus in on particular events.
In the best known of medieval French epics, The Song of Roland, the very
few Imperfects that occur at all are concentrated in the dream visions. 25 The
passages given in (19) and (20), though taken from literary fiction, are
representative of the reporting style used to narrate the contents of dreams and
fantasies. The passage in (19), from a Spanish novel of the 19th century,
reports a dream of one of the characters:
(19) Oía [IMP] el reloj de la catedral dando las nueve; veía [IMP] con
júbilo a la criada anciana durmiendo con beatífico sueño, salía
[IMP] del cuarto muy despacito para no hacer ruido; bajaba [IMP]
la escalera tan suavemente que no movía un pié [IMP] hasta no
estar segura de poder evitar el más ligero ruido. Salía [IMP] a la
Imperfective and Irrealis 531
The hierarchy given in (21) below, adapted from Haiman (1989), lists several
strategies for reporting speech, ordered according to their ostensible degree of
fidelity to the original speaker's utterance, 28 a parameter which lends itself to
interpretation along a realis/irrealis cline.
(21) Speech and thought reports: hierarchy of attenuation
a. DIRECT QUOTATION: She said/thought: "What the hell are you
doing here in my hotel room ?"
b. SEMI-INDIRECT QUOTATION (including free indirect discourse):
What the hell was he doing here in her hotel room ?
c. INDIRECT QUOTATION: She wondered what he was doing in her
hotel room.
d. EVIDENTIALS (including quotatives): He is said to have been/
was reportedly found in her hotel room (adapted from Haiman
1989:149; examples mine)
These are not the only categories of speech and thought representation, but
they will suffice for our purpose. 29
9.1. The 'framework' 30 of the original speaker is best represented (i.e., the
representation is maximally realis) in the case of direct speech, where, accord
ing to the quotation convention (see n.32), the original speaker's words, at
least, are reproduced verbatim. As we move down the hierarchy, the quoted
speaker's framework becomes progressively more attenuated, i.e. more re
moved from the reality of the original utterance, until finally, in the case of
quotative evidentials, the original speaker's very identity becomes immaterial
(Haiman 1989:149). The quotative evidential examples given in (22), from
newspaper journalism in French, use forms of the Conditional, which is
formally and historically imperfective:
Imperfective and Irrealis 533
9.3. The quotation style illustrated in (21b) has been variously labeled 'style
indirect libre', 'erlebte Rede', 'represented speech and thought', 'narrated
monologue', 'semi-indirect quotation', or 'free indirect discourse'. Addi
tional examples are given in (23) and (24), from French and Italian respec
tively (sentences of free indirect discourse are bolded):
(23) Jamais Frédéric n'avait été [PLUPERF] plus loin du manage.
D'ailleurs, Mlle Roques lui semblait [IMP] une petite personne
assez ridicule. Quelle différence avec une femme comme Mme
Dambreuse! Un bien autre avenir lui était réservé! [IMP] Il en
avait [IMP] la certitude aujourd'hui. (Flaubert, Sentimental Educa
tion, Part 3, Chapter 2, p. 350)
534 Suzanne Fleischman
called for), but when what is being reported on is rather how the speaker feels
about something. Imperfective verba dicendi formally release the quoting
speaker from a commitment to the exact replication of what the original
speaker said — this is in line with category (c) of the characterization of
irrealis given in (4) — and provides instead "a characterization that reflects
the quoted speaker's feelings" (Bybee, p.c.).
Both of the above interpretations of imperfective inquit formulas (ha
bitual and continous) are plausible. And both motivate the use of verbs with
imperfective aspect to mark this style of quotation as irrealis, in the sense of
being an inexact representation of a quoted speaker's utterance(s).
Commenting on Bulgarian forms that can mark either the modal Condi
tional or the aspectual Iterative, Aronson (1977:15) sees the connection as so
close and so widespread as to constitute a potential universal. In support of
this position he offers the semantically equivalent pairs of sentences given in
(28), from English, Serbocroatian, and Hebrew respectively:
(28) a. Eng. He would play golf every day when [ITER] / if
[COND] he lived in Chicago
b. Serbobcr. Svakog bi dana igrao golf dokje [ITER] / kadbi
[COND] zivevo u Cikagu
c. Heb. Hu haya msoxek golf yom-yom kaašer [ITER] / ilu
[COND] gar bSikago
For each pair of examples, the main clause is identical; only the particle in the
subordinate clause tells us whether the sentence is iterative/habitual or condi
tional (cf. also Galton 1987).
Lazard (1975) cites parallel evidence from a spectrum of ancient and
modern Indo-European languages in which a single morpheme expresses both
past habitual and irrealis/conditional (and in certain cases also future-of-the
past, analogous to Eng. would)43. He points out (pp.225f.) that where a
generalized imperfect is used to express certain irrealis meanings, it is the
habitual meaning, among the various possible meanings a durative past can be
used to express, that specifically correlates with irrealis. He sees this affinity
as particularly striking in that it is realized by a diverse range of morphologi
cal structures (the data he cites have nothing in common from the standpoint
of form, their only commonality being their shared functions). Even in geneti
cally related languages, the respective developments are apparently independ
ent of one another — what Sapir referred to as 'drift'. Seeking to explain this
affinity, Chung & Timberlake (1985:221) observe that if habitual subevents
are indefinite both in number and in time (cf. n.42 above), they can also be
viewed as extending over possible worlds.
In the grammatical environment under consideration here, the apodosis
of unreal conditionals, a number of investigators (Lazard 1975, James 1982,
Restan 1989, Dahl Forthcoming) have noted that in languages that distinguish
perfective and imperfective aspect, what we invariably find is the imperfec-
tive past or one of its subtypes (progressive or, in particular, habitual), and not
the perfective form that the context would predict (given that the main-clause
Imperfective and Irrealis 539
verb most commonly denotes a single completed act). On the basis of broad
cross-language investigation, Dahl speculates about a general tendency to use
imperfective aspect in counterfactual constructions.
Seeking to account for why many languages use past tenses — specifically
imperfective pasts — to express the hypothetical, James (1982) points to the
shared semantic feature of non-completion. That is, imperfective and hypo
thetical both present predicated situations as unrealized or not fully realized at
the time of reference. 44 An alternative explanation, not incompatible with the
non-completion hypothesis, involves the discourse notion of backgrounding.
The correlation between imperfective aspect and backgrounding, first
proposed by Weinrich (1964/1973) as an alternative to traditional interpreta
tions of the Imparfait/Passé Simple opposition in French, became popularized
among linguists through several papers by Hopper from the late '70s and early
'80s (notably Hopper 1979, 1981). By now it has become a commonplace in
the discourse literature. Background information, Hopper (1981:238) ob
serves, involves a reduced assertion of the finite reality of an event. It is not
involved in "the asserting of events in the story line, but makes statements that
are CONTINGENT [his emphasis] and dependent on story line events. Typically,
therefore, one finds in backgrounding those forms associated with a lower
degree of assertiveness and even forms designated as irrealis" (1981:215, my
emphasis). So if imperfective aspect is indeed associated in discourse with a
reduced degree of assertiveness, then, as James observes, this would provide a
natural explanation for why many languages choose it over perfective aspect
to encode the hypothetical, since hypothetical statements likewise do not
assert the truth of their propositions. Or, as Bybee, Perkins, and Pagliuca
(1994:239) argue, mood is not about truth values, but about the speaker's
choice between assertion and contrasting functions.
The foregrounding and backgrounding of information in discourse like
wise correlates with the visual notion of focus, which, as I suggested in the
introduction, provides a well motivated link between imperfective and
irrealis. Simply put, information a speaker chooses to foreground is presented
as in focus, background information as defocused, or out of focus. 45 In a now
540 Suzanne Fleischman
classic paper on transitivity, Hopper & Thompson (1980) make the case for a
language-universal correlation between aspect and focus. 46 Their findings
have subsequently been confirmed by a number of studies on narrative show
ing the focus and non-focus roles of perfective and imperfective aspects
respectively.
In an illuminating paper contrasting the Preterit and Imperfect in Span
ish, Lunn (1985) distinguishes these two categories of the Spanish past system
in terms of the conferring vs. witholding of focus. There are two points of
view, she notes, from which things are not in focus. On the one hand, if an
observer is at a point from which foreground is in focus, then that observer is
too far away for background to be in focus. This would explain the conven
tional connection between imperfective aspect and background. On the other
hand, imperfective forms are also used to describe situations that are too close
— physically and/or psychologically — to be sharply in focus. Thus Lunn
explains novelist Juan Rulfo's use of the Imperfect to render the obsessive
fantasy life of his protagonist Pedro Páramo, offering examples analogous to
those cited in (18)-(20) above. The same out-of-focus perspective is strikingly
conveyed in the passage in (29) below from Robbe-Grillet's novel Jealousy,
which relies on a series of Present tenses (bolded in the example) to report the
disjunctive perceptions of a jealous husband as he peers through the Venetian
blinds at his wife and her surroundings:
12. Conclusion
NOTES
tense, aspect, and modality are clearly distinct categories at the theoretical level, in
practice (i.e., in actual usage) they cannot always be teased apart and studied in isolation
from one another.
6 In languages that contrast perfective and imperfective aspect, this opposition may not be
overtly expressed across the tense spectrum (in languages that have tense). Where this is
the case, the opposition is explicitly (i.e., morphologically) expressed most often in the
past. Thus Dahl (1985:92) observes that whereas progressives tend to appear in all tenses
(in a language that has tense), general imperfectives tend to be limited to past tenses.
7 'Situation' will be used here as a cover term for the predicate classes traditionally
referred to as events, processes, and states, or, according to the now familiar typology
originally proposed by Vendler (1967), as states, activities, achievements, and accom
plishments.
8 Trask adds (p. 147) that Palmer, in Mood and Modality (1986), recommends avoiding
the term irrealis altogether in linguistic theory on grounds that it corresponds to "no
consistent linguistic content." Like Chafe (this volume), I have searched Mood and
Modality to corroborate this statement but was unable to locate it.
9 The French and Spanish Conditional forms (which double as Futures-of-the-Past)
evolved from a Latin periphrasis combining Past Imperfective forms of 'have' with an
infinitive: scribere habeba(m) (lit. to-write I-had [IMP]) > Fr. écrirais/Sp. escribiría.
Standard Italian scriverei departs from this common Romance pattern in having chosen
the Perfective form of the 'have' auxiliary, though both northern and especially southern
Italian dialects show forms traceable to the Imperfective periphrasis as well as hybrid
paradigms containing forms historically Imperfective alongside forms historically Per
fective (see Rohlfs 1968:§§593-599).
10 Historically, this form began life as a Pluperfect Indicative. On this frequently analyzed
development, see Lunn & Cravens (1991), Klein-Andreu (1991).
11 Among many commentaries on these constructions, see Givón (1973), Fleischman
(1982), Bybee & Pagliuca (1987), Lichtenberk (1991), Bourdin (1992) and Forthcoming.
12 An alternative explanation, suggested by Oesten Dahl (p.c.), is that the verb form
children select in this context will be whatever form their language happens to use in the
protasis of unreal conditionals, e.g., the Conditional in Finnish, the Subjunctive in
German, etc. This usage, in other words, would represent a special type of truncated,
mono-clausal conditional. Undermining this explanation, however, is the situation in
Spanish. Standard Spanish uses the Imperfect Subjunctive in the protasis of unreal
conditionals, as in (5), yet Spanish children use the Imperfect Indicative in 'pre-ludic'
contexts.
13 A less radical statement of this position, formulated with respect to Italian, is that of
Bertinetto (1986:390), who argues that in the modal uses of the Imperfetto the meaning
of temporal pastness is, if not neutralized, at least attentuated to the point of conveying
'non-actuality' or simply 'virtuality'. Cf. also Bazzanella (1987).
14 In his words, "comme le conditionnel, qui prend souvent des valeurs de 'futur fictionnel'
(ou hypothéthique), l'imparfait de l'indicatif semble pouvoir assumer dans certains
emplois non temporels une valeur de présent fictionnel" (38f.).
15 Commenting on hypotheticality in Slavic, Restan (1989) similarly suggests 'incomplete
ness' as the common feature linking this modality to imperfective past forms in many
Imperfective and Irrealis 545
32 Similarly Tannen (1986). I agree with this statement in theory. Nonetheless, and as
Chafe acknowledges, speakers often use the direct-speech format to report language
they know was not uttered in precisely the form in which they quote it, as evidenced by
the following example, from a natural narrative, where the direct quote is preceded by an
explicitly hedged inquit formula: I was saying things like: "Well, could we stop by my
house so that I can tell my family that I have to go to jail haha an ' I can't make the
funeral?" That speakers act as if they can remember distal speech accurately and
reproduce it verbatim is also suggested by the following utterance, from the same natural
narrative (reference in n.38): I remember him saying: "Son, you're under arrest, you
have to go to jail".
33 For a more detailed analysis of the formal features and constraints of free indirect
discourse, see Banfield (1978/1993). Like many analysts, Banfield identifies utterances
of free indirect discourse on the basis of clause-level formal features. For a compelling
challenge to this position, relevant in particular to languages like English that lack a
distinctive imperfective aspect, see Ehrlich (1991), which establishes important dis
course-based criteria for identifying this style of quotation.
34 A distinctive feature of fictional narrative that sets its apart from nonfiction is the
cognitive access the narrator has to the mental states of other individuals; only in
constructed narratives (i.e. fiction) which are true in constructed worlds can one indi
vidual enter — and represent through language — the consciousness of another. When
conversational narrators appear to do this — and we often do — the thought quotation
must technically be viewed as an inference, not an assertion.
Imperfective and Irrealis 547
35 Echoing Kuno (1972), Wierzbicka (1974:271) insists that "direct speech is in some
important and intuitively clear way more basic than indirect speech. To see this it is
sufficient to think of all those languages which do not have indirect discourse at all
(whereas the existence of direct discourse seems to be universal), and also of the
historical priority of direct discourse in those languages in which the gradual develop
ment of the indirect discourse is historically attested."
36 The most vehement advocate of this position is Banfield (1982, 1985, and elsewhere in
her writings), who insists on the chronological coincidence of free indirect discourse
with the emergence of the modern novel in the 19th century. For discussion, see
Fleischman (1990, 230ff.), (Haiman 1989).
37 From "Jeff goes to jail," a narrative tape-recorded and transcribed by Annie Jaisser. In
the transcribing of an oral narration editorial decisions inevitably have to be made, e.g.,
about punctuation, capitalization, and — most important for our purpose — conventions
for quotation (cf. Chafe 1988). The quotation conventions chosen will, of course, reflect
the transcriber's interpretation of the style(s) of quotation used. In (25) and (26) I have
changed the transcriber's double quote marks to the stylistically neutral convention <Q
... Q>.
38 In Fleischman (1990, Chap. 7) I discuss the influence of the grammatical person of a
narration (first vs. 'third') on what are here referred to as irrealis quotation styles.
39 An alternative reading of the imperfective aspect of these inquit formulas is presented
below.
40 This should occasion no surprise. As shown in the papers in this volume by Mithun and
Chafe, grammatical categories (e.g. future, imperative) or constructions (questions,
negation) are not uniformly classified as realis or irrealis across languages, nor even at
times within the same language (e.g. imperatives in Jamul Diegueño and Alamblak, as
reported by Mithun).
41 This explains why the Imparfait will be chosen when the past events in question are
construed collectively, whereas the Passé Simple (or Passé Composé in speech) will be
chosen when they are construed individually. The collective construal corresponds to
what is here referred to as habitual, the individual construal to what is here referred to as
iterative. Hacking (1993:158) offers a particularly nice formulation of this distinction
(with reference to Slavic, but easily generalizable), observing that imperfective aspect is
consistent with a "global, stative overview of a series of repeated events," i.e. habitual,
while perfective is called for if the focus is on "the individual quality of the subevents,"
i.e. iterative. Though there are languages that formally distinguish iterative from ha
bitual (e.g. Old Persian, per Lazard 1975), for our purpose it is unnecessary to insist on
the distinction. For the record, in Old Persian it is the Habitual marker -i that is also used
to express irrealis meanings, notably in unreal conditionals of the type illustrated in (5).
42 Givón (1984:285) notes that the most common test for realis/irrealis involves the
referentiality of indefinite arguments under the scope of these modalities.
43 Regarding Eng. would, Bybee, Perkins, & Pagliuca (1994:239) point out that historically
the habitual and irrealis (conditional) uses are "entirely independent developments."
Whereas for Macedonian, Galton (1987:365) opines that "the two probably developed
alongside each other."
548 Suzanne Fleischman
44 As James observes, "Imperfective aspect is usually held to have as its essential feature
the fact that it indicates that an action or state is not being viewed as a complete whole,
but rather as something going on in time and not yet completed. But likewise, if an action
or state is hypothetical, then it is unrealized and in that sense is not an actual complete
action or state. Thus, imperfective aspect and the hypothetical share a semantic feature in
that they both indicate something which is in some way not fully realized" (1982:p. 399).
45 One difficulty with the backgrounding hypothesis, pointed out to me by Ekkehard
König, is that background information tends to be presupposed, i.e., it is presented as
being not subject to challenge. It would thus seem more likely to correlate with realis
than with irrealis.
46 At issue here is 'focus' in the visual, perceptual sense, not as a category of information
structure contrasting with 'topic'.
47 Chafe (1994 and p.c.) suggests that these various contexts, or the irrealis meanings they
instantiate, have as a common denominator the marking of an event as imagined rather
than directly experienced. This distinction figures as the last of a hierarchy of opposi
tions he sets up, according to which experiences are divided into those that are immedi
ate and those that are displaced, and within the latter category, those that are remem
bered and those that are imagined, though he acknowledges the fuzziness of this last
distinction.
48 Like Bulgarian, Macedonian has a dual aspectual system in the past involving two
oppositions: Perfective/Imperfective and Aorist/Imperfect, though the status of the latter
opposition — tense or aspect — remains controversial (see Fielder 1993, §§2.4.7-8).
According to Aronson (1977), the latter opposition has remained fundamentally aspec
tual, while the former, superordinate opposition has become more modal, with the
(suffixally derived) Imperfectives giving rise to the new "Simple Conditional" forms (cf.
also Galton 1987, Hacking 1993, Fielder 1993).
49 Though these two categories of the linguistic metalanguage are differentiated at the
theoretical level, in practice (i.e. when applied to real-language data) the distinction
often blurs (see Willett 1988).
REFERENCES
Fleischman, Suzanne & Linda R. Waugh, eds. 1991. Discourse Pragmatics and the Verb:
The Evidence from Romance. London & New York: Routledge.
Galton, Herbert. 1987. "Unreal Hypothetical Periods in South Slavic and Greek and
Related Features." In Honor of Use Lehiste ed. by R. Channon & L. Shockey, 357-368.
Dordrecht: Foris.
Givón, T. 1973. "The Time-Axis Phenomenon." Language 49.890-925.
Givón, T. 1984. Syntax. A Functional-Typological Introduction. Vol. 1. Amsterdam:
Benjamins.
Gougenheim, Georges. 1970. "La valeur psychologique des temps dans le 'Monologue de
Figaro'." Etudes de grammaire et de vocabulaire français, 149-54. Paris: Picard.
Hacking, Jane F. 1993. A Comparative Typology of Conditionals in Russian and Macedo
nian. Unpublished dissertation, University of Toronto.
Haiman, John. 1989. "Alienation in Grammar." Studies in Language 13,1.129-170.
Hale, Kenneth. 1969. "Papago/ci-m." Cambridge, MA: MIT (mimeo).
Harris, Martin B. 1978. The Evolution of French Syntax. A Comparative Approach.
London: Longman.
Heine, Bernd & Mechthild Reh. 1984. Grammaticalization and Reanalysis in African
Languages. Hamburg: Buske.
Heine, Bernd, Ulrike Claudi & Friederike Hünnemeyer. 1991. Grammaticalization. Chi
cago: University of Chicago Press.
Hopper, Paul J. 1979. "Some Observations on the Typology of Focus and Aspect in
Narrative Language." Studies in Language 3,1.37-64.
Hopper, Paul J. 1981. "Aspect and Foregrounding in Discourse." Discourse and Syntax (=
Syntax and Semantics, 12) ed. by T, Givón, 213-41. New York: Academic Press.
Hopper, Paul J. & Sandra A. Thompson. 1980. "Transitivity in Grammar and Discourse."
Language 56,2.251-99.
James, Deborah. 1982. "Past Tense and the Hypothetical: A Cross-Linguistic Study."
Studies in Language 6,3.375-403.
Klein-Andreu, Flora. 1991. "Losing Ground: A Discourse-Pragmatic Solution to the
History of -ra in Spanish." Fleischamn & Waugh, 1991. 164-178.
Kuno, Susumo. 1972. "Pronominalization, Reflexivization, and Direct Discourse." Lin
guistic Inquiry 3,2.161-195.
Lass, Roger. 1990. "How To Do Things With Junk: Exaptation in Language Evolution."
Journal of Linguistics 26,1:79-102.
Lazard, Gilbert. 1975. "La catégorie de l'éventuel." Mélanges linguistiques offerts à
Emile Benveniste, 221-232. Louvain: Peeters.
Leech, Geoffrey N. & Michael H. Short. 1981. Style in Fiction. London: Longman.
Le Goffic, Pierre. 1986. "Que l'imparfait n'est pas un temps du passé." Le Goffic, 1986.
55-69.
Le Goffic, Pierre, ed. 1986. Points de vue sur l'imparfait. Caen: Université de Caen.
Lichtenberk, Frantisek. 1991. "Semantic Change and Heterosemy in Grammaticaliza
tion." Language 67.475-509.
Lunn, Patricia V. 1985. "The Aspectual Lens." Hispanic Linguistics 2.49-61.
Lunn, Patricia V. & Thomas Cravens. 1991. "A Contextual Reconsideration of the
Spanish -ra 'Indicative'." Fleischman & Waugh, 1991. 147-163.
Imperfective and Irrealis 551
and past tense modals 504,513- 48, 142-149, 165, 208, 209, 429,
514,516 446, 455-457, 462-464, 470, 478,
pathways of 3, 5, 349, 380,445 480,481,543
and pidgins/creoles 11, 401- crosslinguistic expression of 6
402,410,416,420 defined 4,6,55,165,293
and polysemy 5,325 development of 321-322
and realis/irrealis 10, 352, 375- epistemic necessity. See under neces
377, 385-386,429 sity
and signed language 140, 141, epistemic possibility. See under pos
144, 150, 151, 157 sibility
social factors in 398, 400 evaluation. See speaker evaluation
direct experience 168,173,485,489, evidentiality 4, 13, 77, 85, 101, 165,
490, 542, 548 458, 529, 532, 542, 543. See also
discourse quotative
discourse context 8,433, 434 and child language 194-196,
discourse interactive functions 197
3, 199-201, 212, 214, 216, expression of 168-169, 172,
219,221,225,227,229-230, 173, 175
234,432,434 {see also under indirect evidence 481,484,487,
child language development; 490
obligation) evolution (of language), 137
genres of 85 exaptation 543
interactive 3, 8, 9, 11, 12, 169, existential subject 26, 35-36, 58
173, 175 expected/unexpected 169, 357
doubt 333,430,484,490-491,495
dreams (narration of), 519, 530-531, F
542 facial expression. See non-manual
drift 538 signs under signed language
dubitative modality 2 factive 432
dummy subject 35-36 factual 29, 88, 92, 94, 359, 363
dynamic modality 209 fantasy (narration of), 530-531, 540
fear 296,297,302,304,305-306,307,
E 309, 310, 311, 314, 315, 319, 320,
emergent grammar 446 322. See also apprehensional-
emphasis 332, 337, 343 epistemic modality
enabling conditions 5, 77 fiction, fictionality 520, 526, 529,
endearment 339-340, 528. See also 530, 534, 535, 536, 541, 544, 546
hypocoristic focal sense (of a category). See under
epistemic downtoner 294-295, 301- meaning
302, 316-317, 318, 320, 321, 322 focus (contrastive), 280, 281, 282
epistemic modality 4,5,7,12,13,17- focus (visual), 521, 539-542, 548
Index of Languages 557
hypothetical 43, 44-45, 92, 96, 103, inference 74, 82, 168, 173, 546, 542
395-396, 397, 413, 480, 482, 491, information value 429,430-432,433.
495-496, 520, 523, 529, 539, 542, See also knowledge status; new in
543,544,548. See also counterfac- formation; old information
tual; unreal conditionals intention 120,123-124,131,149-150,
and past tense 503-504, 506, 260, 262, 263-264, 395, 413-414,
509-514,516 420, 508, 528
interior monologue 520,541
interrogative mood. See questions
iconicity 136, 152,281,444 intonation. See suprasegmental fea
vs. arbitrariness 135, 151, 152, tures
344 invariant meaning. See under mean
and child language 151-152 ing
erosion of 145, 151, 152-153 inversion (subject-auxiliary), 477,
and signed language 7,138-139, 497-498
144, 151-158 irony 333
illocutionary force 43, 44, 336, 528 irrealis 11, 282, 369-370, 397, 410,
image schemata 137-138 440, 495. See also realis/irrealis
imagination 10, 349, 350, 358, 363, distinction
368, 384, 386, 548 categories classified as 396,412
imperative mood 3, 6, 147, 358-359, contexts conditioning 353-358,
363, 376-378, 396, 412, 447, 484, 362-363, 370-371, 373-375,
490, 492, 545 377,384-385,413
from future 410,412,420,421 crosslinguistic variability of 3
vs. obligation 275, 280-283 defined 9-10, 12-13, 350-351,
imperfective aspect 11, 12, 514. See 522-523
also perfective/imperfective dis and future tense 395-396,419
tinction grammaticalization of 353,354,
defined 521 359, 389-390, 393-394, 395,
and irrealis 12,519-548 400, 410,422
impersonal 107, 116, 117, 118, 120, and imperfective aspect 12,519-
126,127,130,458-459 548
impersonal subject/marker 35,36,57 and negation 430
implicature, conventionalization of redundancy in marking of 10
38, 39, 322 irregular, irregularity 11, 453, 456,
incorporation 402,408,423 465, 468-470
indefinite 10,436, 498, 538, 547 systems of 459-461
indicative mood 430,431,478,480 types of 453-455
indicative/subjunctive contrast iterative aspect 521, 536, 537-538,
11,430-431,434,436 547
Index of Languages 559
mood 3, 13, 91, 108, 139, 329, 341, and subjunctive mood 440
429,431,473 neutralization (of distinctions), 40,
choice of 11, 443, 445, 446, 249,251,256,353,524,531
431-432,434 new information 174-175, 431. See
conditional {see conditionals) also information value; knowledge
defined 2,70-71,329 status; old information
imperative (see imperative mood) noteworthiness 172
indicative (see indicative mood)
irrealis (see irrealis) O
realis (see realis) obligation, obligative modality 4, 5,
subjunctive (see subjunctive 6,8,33,38,55,90,107-131,139-
mood) 140, 258, 356, 359, 363
motion verbs 48,389 and complementizers 479-480,
into future tenses 37, 38, 125, 481-484,490,492,496
415,422,524-525 crosslinguistic comparison of 9,
movement rules (formal theory), 474 240, 245-246, 269-271, 283,
284-285
N development of 7, 107-108,
narrative 8, 429, 443-444, 519, 520, 115-120,159,289
529-532, 534-536 degree of 140, 241-242, 247,
necessity 4, 107, 120, 121-123, 131, 272
140, 455,456-457, 459,462, 467 discourse interactive functions
and constraints 120, 121, 128, of 9,240,241,247-257,259-
129 271,273-277,283,284
epistemic 4, 55, 59, 60, 86, 87- into other categories (epistemic
89, 458-460 modality, future tense, infer
root/deontic 55, 59, 60, 128- ence) 38,239,243-244,245,
130, 455, 458-460, 462, 463 278, 279, 284
negation 11-12, 63, 143, 150, 453- and negation 454, 455, 457,
470. 465-468,470
de dicto (metalinguistic)/de re and past tense 504, 510, 512,
102,329,331-332 513,515,516
and future tense 361-362 word order marking of 240,246,
(mis)placement of 12,332,454, 278, 279-283, 288
461,463,465,466,468,469 old information 431, 432-433, 434,
and realis/irrealis 430,447,354- 438. See also information value;
355,360,374-375,380,381- knowledge status; new information
384 optative modality 329, 359, 361,492
scope of 11,74,78, 102,455-
456,461,470
Index of Languages 561
A
Acholi 107-117,119-127,130-132
c
Alamblak 378, 547 Caddo 10, 349, 351, 353-354, 358-
Albanian 333 365, 367, 378, 380-383, 385
Cape Verde Creole 422
Alsea 368
Catalan 436
Alur 108-111, 113, 117-120, 125-
Cayuga 359
126, 128-130, 132
Central Siberian Eskimo 320
Amele 375-376, 380
Chadic languages 486, 498, 500
American Sign Language 7,135,138-
Chinese 240-242,245-247,253,257-
140, 142, 144-146, 148, 150-159
258, 260, 264-274, 277-278, 285-
Antiguan Creole 228
286, 288, 466 (see also Mandarin)
Arabic 80-81,465-466
Chinese Pidgin English 408, 424
Assamese 462, 465-466
Chinese Pidgin Russian 169, 542
Austronesian languages 9, 294, 315,
Cree 523
325, 367, 396
Czech 302,309,319
B D
Bahasa Malaysia 466 Danish 463-466,468-470
Balkan languages 523 DhoLuo 108,110,113,118-120,129-
Bargam 537 131
Basque 304 Dhopadhola 108,109,111,113,115,
Bauan 315 119,126, 130,132
Betelgeuse 334 Diegueño
Bikol 537 Jamul 368, 377-378, 383-384,
Bislama 403, 422 547
Boiken 403 Mesa grande 383-384
Boumaa Fijian 315,317-318,320 Diyari 306
Brazilian Cities Sign Language (BCSL) Dutch 474, 500, 530
155
Bulgarian 542, 545, 548 E
Buriat 2 East Sepik 378
English 4, 5, 7-10, 12-13, 17-20, 26-
566 Index of Languages
27,33-35,37,39,40,43-44,47,55- F
61, 65, 68, 70-71, 78, 85, 91, 94, Farsi 465,466
96, 100, 102, 118, 120, 124-125, Fijian 315,322,325
129,166,170,176,201,204, 206- Boumaa 315,317-318,320
208, 212, 227-228, 232, 235-236, Nabukeleva 325
240-249, 252-253, 257-260, 262- Standard 315-318
266, 268-275, 277-281, 283, 285- Wayan 320
286, 288-291, 294, 302, 304, 313, Finnish 544
320, 325, 333-334, 342, 349, 350- French 12, 102, 330, 352, 403, 437,
351, 361-365, 378, 382, 389, 391, 453, 458, 460-461, 465, 479-483,
396-397, 401-404, 408-410, 417- 500, 504, 514-515, 520-521, 523-
419, 422, 424-427, 454-457, 461- 526, 528, 530, 532-533, 537, 539,
462, 465-466, 468-470, 474-479, 544-545, 550
485, 497, 498, 500, 503, 504-511, Old 480
513-516, 520, 524, 527, 530, 535- French Sign Language (FSL) 158-159
538,546-547 549,551 Old 140
American 61, 63-64, 330-331,
513,516 G
British 47, 60, 63-64, 244, 331, German 7, 17-20, 23-29, 34-36, 42,
541 46-48, 50, 56, 58, 76, 352, 446,
Early Modern 12,511,516 467-470, 523, 544
Elizabethan 513 High 27,48, 52
Late Old 507 Modern 48
Middle 12, 505, 507, 509-511, Gothic 478
514 Greek 166, 289, 311, 460, 467-468,
Modern 43, 315, 503, 506-508, 523
511,513 Classical 311,320
Old 314, 479, 497, 498, 504, Homeric 311
507,516 Modern 459, 462, 467
English-based pidgins and creoles Guider (Central Chadic) 481-483
389, 391, 407, 427 (see also
Chinese Pidgin English H
Hawai'i Creole English Hawai'i Creole English (HCE) 391,
Hawai'i Pidgin English 422
New South Wales Pidgin English Hawai'i Pidgin English 422
Pacific Jargon English Hebrew 455, 466, 538
Pacific Pidgin English) Biblical 9, 240-241, 246-247,
European languages 7,10-11,13,367 277-279, 281-283, 285-286,
Ewe 343 288
Index of Languages 567
T
Tamil 465, 466, 530
Tibetan 542
Tigrinya 457,458, 465
To'aba'ita 294, 299, 301-306, 315,
320, 325
Tok Boi 424
TokPisin 11,389-391,393,395-397,
400-401, 403-405, 407, 409-411,
416-424 {see also Melanesian
Pidgin)
Author Index
A Biskup,R 424
Abraham, W. 18,19 Bloom, H. 289
Adachi,T. 343,334 Bloom, L. 176,187,201,289
Adam, J. 526, 543 Bolinger, D. 429,445-477, 485
Adams, D. Q. 334 Borchardt,K. 403,424
Aijmer, K. 38 Borges, J. 338-339
Akatsuka, N. 168, 201 Borsley, R. 474
Aksu-Koç, A. 166, 168, 333, 543 Bosch, P. 73
Albert, R. 288 Bouchard, D. 474
Alcantra, S. 369 Bourciez, E. 480
Alerotek 122 Bourdin,R 544
Andrews, J. 504,515 Boyes-Braem, R 140, 159
Ansre, G. 343 Brecht, R. 500
Aristotle 313 Bresnan,J. 474,477-478
Amstrong, D. 138, 154 Brito,F. 155
Aronson, H. 538, 542, 543, 548 Brody,G. 176,201
Austin, R 306,307 Browman, C. 137
Brown, P. 149
B Bryce,A. 440
Baker, C. 144 Buckley, E. 367, 368
Baker, P. 424 Bull,W. 72
Banfield, A. 535, 537, 546, 547 Buscha,J. 19
Barnes, A. 287,288 Bybee, J. 1,2,4-6,12-13,17,20,33-
Barsalou, L. 446 35, 37, 39, 42-47, 62, 64, 70, 73,
Bavin, E. 7, 109 75, 77, 93-94, 102, 107-108, 115-
Bazzanella, C. 543,544 116,118,120-121,124,126, 129,
Bello, A. 72 131, 133, 137, 156, 159, 165-166,
Bellugi, U. 136, 152, 153, 158, 159 177, 193, 229, 236, 239, 243-244,
Benet, J. 437 279, 284, 287, 293, 321, 351, 367,
Bergmann, U. 393 395, 396, 409-412, 415-421, 423-
Bertinetto, P. 524, 534, 536, 543- 424,447,476, 514, 521, 525, 527,
544, 545 536-537, 539, 543-544, 547
Bickerton, D. 389-391,396,416,474- Byrnes, J. 175
475, 477
570 Index of Names
c Dasgupta, P. 474
Davidsen-Nielsen, N. 457,463,468-
Calbert,J. 34
469
Cao, Y. 258-259, 261-262, 266-268,
Deane,P. 137-138
270, 287
DeLancey, S. 168-169
Cappello, S. 526,543
DeMello,G. 441,447
Cedeño,A. 431-432
Demers, R. 19,47
Chafe, W. 4, 10, 13, 165, 168, 333,
Dempwolff, O. 367
349, 352, 361-362, 368, 378, 380-
Dench,A. 307-309
383, 385,429, 522, 530,533,543-
Detzner, H. 403,424
544, 546-548
Dixon, R. 295,317,318
Choi, S. 8,17,166-167,172,181,183
Donaldson, T. 305
Chomsky, N. 151,474,513
Donoso, J. 435
Chowning, A. 406
Dore, J. 203
Chung, S. 367,538
Driberg,J. 112,115,119
Churchill, W. 402
Duff,M. 175
Churchward, C. 316
Cichocki,W. 474
E
Clancy, P. 201
Eatough, A. 367
Clark, W. Van Tilburg 519,530
Edelman, G. 136
Claudi, U. 21, 37, 38, 40, 42,45, 48,
Ehrlich, S. 546
543
Emonds, J. 478
Coates, J. 7,18,31,27,33-35,39-41,
47,55-56,61,64,68,71-72,74,78,
F
94, 100, 103, 108, 124, 206, 236,
Faltz,L. 73
242-245, 284, 286-287, 313, 503-
Fielder, G. 548
504,514
Fischer, S. 139
Cokely,D. 144
Flaubert, G. 533,534
Collins 313,315
Fleischman, S. 11-12, 43-45, 92,
Comrie,B. 367,378,521,525,536
108, 158, 429, 447, 507-508, 514-
Conteras, H. 432
515, 525-526, 529-530, 541, 544-
Cooreman, A. 239
545, 547
Craig, C. 367
Foley, W. 367,396
Cravens, T. 436,544
Fowler, C. 136-137
Crazzolara, J. 112-113, 115
Fox, A. 239,280,303
Criper,C. 109
Frajzyngier, Z. 12,476,477-478,481,
Crowley, T. 401-402,422
485,487,491,498
D Freeland,L. 504,516
Dahl,Ö. 1,2,523,537-539,544 Friedman 333
Daniels, K. 369 Frishberg,N. 140,145,150-151,157
D'Annunzio, G. 531 Furrow, D. 175-176
Index of Names 571
G Helbig,G. 19
Galton, H. 523, 538, 547-548 Hernández, C. 67
García, E. 72-73 Herring, S. 239
Geertz,C. 273,276 Heschel,A. 289
Geraghty, P. 317-318 Higgins,D. 141, 145
Gerhardt, J. 8, 236, 228, 235 Hirst, W. 175
Gili y Gaya, S. 439 Hood,L. 176,187,201
Givón,T. 48,77, 135-137, 146, 149, Hooper, J. (see Bybee) 429
151-152, 157, 159, 228, 239, 243, Hopper, P. 1,154,236,239,320,325,
281-282, 318, 325, 351, 362, 367, 446, 539, 540
378, 395-396, 412, 416, 474-475, Horn,L. 299,331,343
505, 537, 544, 547 Humphries, T. 140,147, 158
Glick,R. 109 Hiinnemeyer, F. 21,37,38,40,42,45,
Goldstein, L. 137 48, 543
Goodwin, W. 311-312 Huntingford, G. 118-119
Gopnik,A. 167,181
Gordon, L. 376-377,504 J
Gordon, R. 504 Jack,F. 369
Gougenheim, G. 528, 529, 545 Jackendoff, R. 362
Gough, B. 139, 150 Jaisser, A. 547
Greenbaum, S. 497, 498 Jakobson, R. 13
Gregerson, E. 118 James. D. 397, 514, 523, 538-539,
Grice,R 545 543, 548
Grondona, M. 431 Janssen, T. 18-19
Gsell,0. 431 Jasperson 476-477,481, 485,487
Guitart,J. 432 Jelinek, E. 19,47
Guo,J. 8,64,437 Jenkins, L. 19
Jensen, E 480
H Jespersen, O. 29, 32,42, 56, 58,135,
Hacking, J. 542-543, 545, 547-548 314
Hackman, B. 303 Johnson, M. 137
Haegeman, L. 55,71, 102 Johnson, R. 159
Haiman,J. 9,151,159,309,330,332, Joseph, B. 474
343, 532, 533, 535, 546-547 Jourdan,C. 401,422
Hale,K. 528,545
Hall, J. 314,404 K
Halliday, M. 68,70,94,102,205,228 Kac,M. 474
Harris, M. 524 Kany,C. 436
Heine, B. 7,20-21,37-38,40,42,45, Kay, P. 390
47-48, 56-58, 65, 76-77, 158,446, Keesing, R. 303,403,408
543 Kemmer, S. 331
572 Index of Nantes
L M
Laberge, S. 390, 394, 398-402, 406, Maingueneau, D. 533
408,411,419 Mallinson, G. 334
Labov,W. 390,402,408 Malo, S. 119
Ladefoged,P. 109 Malotki,E. 273-277,288
Lakoff, G. 137 Marchese, L. 409,421
Lakoff,R. 137,434,236 Marcos Marín, F 67
Lambdin, T. 286 Markey,T. 390
Langacker, R. 17, 135, 137, 514 Marsé, J. 441
Langdon, M. 383-384 Martin, R. 529, 545
Lange, D. 536 Mayes, R 546
Lass,R. 543 McCay,W. 331
Lavandera, B. 443 McDonald, B. 157
Lawal,N. 474 Meier, R. 151-152
Lazard, G. 538,547 Meltzoff,A. 167,181
Lau,J. 287 Mihalic, F. 393,401,402
Le Goffic, R 521, 524, 526, 543 Miller, A. 367-368, 377-378, 384
Leben, W. 498 Miller, G. 102-103
Lee, H. 169, 172-175, 177, 187, 194 Mitchell, B. 314
Lee,K. 172 Mithun,M. 10,354,429,522,545,547
Index of Names 573
71 Nedjalkov, Vladimir P. (ed.): Reciprocal Constructions. With the assistance of Emma Š. Geniušienė and
Zlatka Guentchéva. Expected June 2007
70 Zúñiga, Fernando: Deixis and Alignment. Inverse systems in indigenous languages of the Americas.
xii, 309 pp. Expected November 2006
69 Aranovich, Raúl (ed.): Split Auxiliary Systems. A cross-linguistic perspective. viii, 268 pp. + index.
Expected December 2006
68 Abraham, Werner and Larisa Leisiö (eds.): Passivization and Typology. Form and function. 2006.
x, 553 pp.
67 Veselinova, Ljuba N.: Suppletion in Verb Paradigms. Bits and pieces of the puzzle. 2006. xviii, 236 pp.
66 Hickmann, Maya and Stéphane Robert (eds.): Space in Languages. Linguistic Systems and Cognitive
Categories. 2006. x, 362 pp.
65 Tsunoda, Tasaku and Taro Kageyama (eds.): Voice and Grammatical Relations. In Honor of
Masayoshi Shibatani. 2006. xviii, 342 pp.
64 Voeltz, F. K. Erhard (ed.): Studies in African Linguistic Typology. 2006. xiv, 426 pp.
63 Filimonova, Elena (ed.): Clusivity. Typology and case studies of the inclusive–exclusive distinction.
2005. xii, 436 pp.
62 Couper-Kuhlen, Elizabeth and Cecilia E. Ford (eds.): Sound Patterns in Interaction. Cross-
linguistic studies from conversation. 2004. viii, 406 pp.
61 Bhaskararao, Peri and Karumuri Venkata Subbarao (eds.): Non-nominative Subjects. Volume 2.
2004. xii, 319 pp.
60 Bhaskararao, Peri and Karumuri Venkata Subbarao (eds.): Non-nominative Subjects. Volume 1.
2004. xii, 325 pp.
59 Fischer, Olga, Muriel Norde and Harry Perridon (eds.): Up and down the Cline – The Nature of
Grammaticalization. 2004. viii, 406 pp.
58 Haspelmath, Martin (ed.): Coordinating Constructions. 2004. xcv, 578 pp.
57 Mattissen, Johanna: Dependent-Head Synthesis in Nivkh. A contribution to a typology of
polysynthesis. 2003. x, 350 pp.
56 Shay, Erin and Uwe Seibert (eds.): Motion, Direction and Location in Languages. In honor of Zygmunt
Frajzyngier. 2003. xvi, 305 pp.
55 Frajzyngier, Zygmunt and Erin Shay: Explaining Language Structure through Systems Interaction.
2003. xviii, 309 pp.
54 Aikhenvald, Alexandra Y. and R.M.W. Dixon (eds.): Studies in Evidentiality. 2003. xiv, 349 pp.
53 Givón, T. and Bertram F. Malle (eds.): The Evolution of Language out of Pre-language. 2002. x, 394 pp.
52 Güldemann, Tom and Manfred von Roncador (eds.): Reported Discourse. A meeting ground for
different linguistic domains. 2002. xii, 425 pp.
51 Newman, John (ed.): The Linguistics of Sitting, Standing and Lying. 2002. xii, 409 pp.
50 Feigenbaum, Susanne and Dennis Kurzon (eds.): Prepositions in their Syntactic, Semantic and
Pragmatic Context. 2002. vi, 304 pp.
49 Wischer, Ilse and Gabriele Diewald (eds.): New Reflections on Grammaticalization. 2002.
xiv, 437 pp.
48 Shibatani, Masayoshi (ed.): The Grammar of Causation and Interpersonal Manipulation. 2002.
xviii, 551 pp.
47 Baron, Irène, Michael Herslund and Finn Sørensen (eds.): Dimensions of Possession. 2001.
vi, 337 pp.
46 Aikhenvald, Alexandra Y., R.M.W. Dixon and Masayuki Onishi (eds.): Non-canonical Marking of
Subjects and Objects. 2001. xii, 364 pp.
45 Bybee, Joan and Paul J. Hopper (eds.): Frequency and the Emergence of Linguistic Structure. 2001.
vii, 492 pp.
44 Voeltz, F. K. Erhard and Christa Kilian-Hatz (eds.): Ideophones. 2001. x, 436 pp.
43 Gildea, Spike (ed.): Reconstructing Grammar. Comparative Linguistics and Grammaticalization. 2000.
xiv, 269 pp.
42 Diessel, Holger: Demonstratives. Form, function and grammaticalization. 1999. xii, 205 pp.
41 Frajzyngier, Zygmunt and Traci S. Curl (eds.): Reciprocals. Forms and functions. Volume 2. 2000.
xii, 201 pp.