Disjunctive Questions Intonation and Highlighting
Disjunctive Questions Intonation and Highlighting
net/publication/221016387
CITATIONS READS
42 845
2 authors, including:
Floris Roelofsen
University of Amsterdam
106 PUBLICATIONS 1,464 CITATIONS
SEE PROFILE
All content following this page was uploaded by Floris Roelofsen on 22 January 2014.
Amherst/Amsterdam
This paper examines how intonation affects the interpretation of disjunctive questions. The semantic effect
of a question is taken to be three-fold. First, it raises an issue. In the tradition of inquisitive semantics, we
model this by assuming that a question proposes several possible updates of the common ground (several
possibilities for short) and invites other participants to help establish at least one of these updates. But apart
from raising an issue, a question may also highlight and/or suggest certain possibilities, and intonation
determines to a large extent which possibilities are highlighted/suggested. We will introduce a composi-
tional version of inquisitive semantics, and extend this framework in order to capture the highlighting- and
suggestion potential of sentences. This will lead to a systematic account of the answerhood conditions and
implications of disjunctive questions with different intonation patterns.
?
This paper has benefited enormously from discussions with Ivano Ciardelli and Jeroen Groenendijk, for which we
are very grateful. We would also like to thank Maria Aloni and Kathryn Pruitt for helpful feedback.
1
The semantic significance of these prosodic features has been established experimentally by Pruitt (2007).
2
Previous work on disjunctive questions usually distinguishes block intonation from closed intonation, but does not
take the open intonation pattern into account (cf. Bartels, 1999; Han and Romero, 2004a,b; Beck and Kim, 2006).
(4) Intonation patterns for wide-scope disjunctive interrogatives:
a. Open intonation: Does Ann↑ play the piano, or Bill↑?
b. Closed intonation: Does Ann↑ play the piano, or Bill↓?
Focus and closure. We take it that emphasis in the acoustic signal is a reflex of a focus feature in the logical
form, and that the rising-and-falling pitch contour in (3c) and (4b) correlates with a closure feature in the
logical form. It seems that this closure feature affects the pronunciation of the entire sentence (not just of,
say, the contrastive elements in both disjuncts). Therefore, we assume that it is adjoined to the sentence
as a whole. The ensuing logical forms are listed in the table below. Focus features, closure features, and
interrogative complementizers are denoted by F, C, and Q, respectively.
Basic data. Our theory should capture, at the very least, the effects of intonation on answerhood condi-
tions. The basic empirical observations are summed up in (5), (6), and (7) below (wide-scope disjunctive
interrogatives are not explicitly listed here; they behave exactly like their narrow-scope counterparts in the
relevant respects). Notice that open intonation behaves in some ways like block intonation, but in others
more like closed intonation: it licenses a no answer, but it does not license a yes answer. To the best of our
knowledge, this observation has not been taken into account before.
(5) Does Ann-or-Bill↑ play? (6) Does Ann↑ or Bill↑ play? (7) Does Ann↑ or Bill↓ play?
a. No. ⇒ neither a. No. ⇒ neither a. #No.
b. Yes. ⇒ at least one b. #Yes. ⇒ yes what?! b. #Yes.
c. (Yes,) Ann does. c. Ann does. c. Ann does.
d. (Yes,) Bill does. d. Bill does. d. Bill does.
A further observation that should be accounted for is that disjunctive interrogatives with closure intonation
convey that the speaker expects that exactly one of the disjuncts is true. In this respect, disjunctive inter-
rogatives with closure intonation are similar to the corresponding disjunctive declaratives. However, there
is also an important difference, as illustrated in (8) and (9):
(8) Ann↑ or Bill↓ plays the piano. (9) Does Ann↑ or Bill↓ play the piano?
a. No, neither of them does. a. #No, neither of them does.
b. Actually, neither of them does.
The difference is subtle but clear: (8) really excludes the possibility that neither Ann nor Bill plays, while
(9) merely conveys an expectation on the speaker’s part that at least one of them does. In the first case,
disagreement can be expressed with no; in the second case, actually must be used instead.
The next section presents an analysis of disjunctive interrogatives in inquisitive semantics. This will
not directly account for the above observations, but it will serve as a useful basis.
2 Inquisitive Semantics
In inquisitive semantics, a sentence is taken to propose one or possibly several ways to update the common
ground of a conversation. Formally, the proposition expressed by a sentence is a set of possibilities, each
of which is in turn a set of indices, and represents a possible update of the common ground.
In previous work (Groenendijk, 2009; Mascarenhas, 2009; Groenendijk and Roelofsen, 2009; Ciardelli
and Roelofsen, 2009; Ciardelli, 2009; Balogh, 2009, among others), inquisitive semantics has been defined
for the language of propositional logic and the language of first-order predicate logic, largely abstracting
away from issues of sub-sentential syntactic and semantic composition. In the present paper, we are specif-
ically interested in this process of semantic composition at the sub-sentential level, and especially in the
role that certain prosodic features play in that process. So, to start with, we need to define a compositional
inquisitive semantics for a suitable fragment of English. Fortunately, much of the technical machinery that
we need is familiar from alternative semantics (Hamblin, 1973; Kratzer and Shimoyama, 2002; Alonso-
Ovalle, 2006, among others).
Basic ingredients. As usual, we will say of each expression in our language that it is of a certain type. The
basic types are e, s, and t, and whenever σ and τ are types, (στ) is also a type. Our semantics will map each
expression to a certain model-theoretic object. The type of an expression determines the kind of object
that it is mapped to. Each model-theoretic object belongs to a certain domain. There is a domain De of
individuals, a domain D s of indices, and a domain Dt consisting of the truth values 0 and 1. Furthermore,
for every complex type (στ) there is a domain D(στ) consisting of all functions from Dσ to Dτ . As in
alternative semantics, each expression of type τ is mapped to a set of objects in Dτ . The semantic value of
an expression α will be denoted by [[α]]. Notice that [[α]] is always a set. Therefore, we will refer to it as
the denotation set of α.
Semantic values are composed by means of pointwise function application:
(10) Pointwise Function Application
If [[α]] ⊆ D(στ) and [[β]] ⊆ Dσ , then
[[αβ]] B [[βα]] B {d ∈ Dτ | ∃a ∈ [[α]]. ∃b ∈ [[β]], d = a(b)}
Basic lexicon. Most lexical items are mapped to singleton sets, consisting of their standard denotations.
Disjunction. Disjunction introduces alternatives. The denotation set of a phrase ‘α or β’, where α and β
are two expressions of some type τ, is the union of the denotation set of α and the denotation set of β:
(12) For any type τ, if [[α]], [[β]] ⊆ Dτ , then [[α or β]] B [[α]] ∪ [[β]]
For example:
Notice that the denotation set of a complete sentence, such as ‘Ann or Bill plays’ is a set of objects in
D(st) . Such objects are functions from indices to truth values, or equivalently, sets of indices. In inquisitive
semantics, sets of indices are referred to as possibilities, and a set of possibilities is called a proposition.
So complete sentences express propositions.
Visualization. As long as we limit our attention to a language that contains, besides disjunction, just two
names, ‘Ann’ and ‘Bill’, and a single intransitive verb ‘play’, the propositions expressed by the sentences
in our language can be visualized in a helpful way. For instance, the sentence ‘Ann plays’ expresses the
proposition {λw.playw (Ann)}, which contains a single possibility consisting of all indices in which Ann
plays. This proposition is depicted in figure 1(a), where 11 is the index in which both Ann and Bill play, 10
11 10 11 10 11 10 11 10 11 10
01 00 01 00 01 00 01 00 01 00
(a) Pa (b) Pa ∨ Pb (c) ?Pa (d) ?Pa ∨ ?Pb (e) ?(Pa ∨ Pb)
the index in which only Ann plays, etcetera. Figure 1(b) depicts the proposition expressed by ‘Ann or Bill
plays’. As we saw in (13b), this proposition consists of two possibilities: the possibility that Ann plays, and
the possibility that Bill plays.
Excluded possibilities. Recall that the possibilities for a sentence α embody the ways in which α proposes
to update the common ground. If some index i is not included in any possibility for α, then we say that
i is excluded by α. For in this case, i will be eliminated from the common ground by any of the updates
proposed by α. If α excludes any indices, then we refer to the set of all such indices as the possibility
excluded by α. If α does not exclude any indices, then we say that it does not exclude any possibility. We
use [α] to denote the set of possibilities excluded by α (which is always either a singleton set, or empty).
For example, the proposition expressed by the simple polar interrogative ‘Does Ann play?’ consists of two
possibilities: the possibility that Ann plays, and the possibility that she does not play. These possibilities
embody two possible updates of the common ground, and the responder is invited to provide information
such that either one of these updates can be established.
Disjunctive interrogatives. Given these assumptions, the propositions expressed by wide- and narrow-
scope disjunctive interrogatives are the following:
(16) Wide-scope disjunctive interrogative: Does Ann play or does Bill play?
[[Q-does Ann play or Q-does Bill play]]
= [[Q-does Ann play]] ∪ [[Q-does Bill play]]
λw.playw (Ann), λw.playw (Bill),
( ) ( )
= ∪ ⇒ see figure 1(d)
λw.¬playw (Ann) λw.¬playw (Bill)
So much for the compositional treatment of our basic fragment in inquisitive semantics. Notice that this
treatment does not yet say anything about the licensing and interpretation of yes/no answers, or about
the ‘exactly one implication’ of disjunctive interrogatives with closure intonation. The following sections
propose an extension of the system that will allow us to capture these phenomena.
According to inquisitive semantics, as it has been developed so far, (18a) and (18b) are equivalent: they
both express a proposition consisting of two possibilities, the possibility that the door is open, and the
possibility that the door is closed. However, there is a clear empirical difference between the two: in reply
to (18a), yes means that the door is open, while in reply to (18b), it means that the door is closed.3
This difference is captured straightforwardly if we assume that (18a) highlights the possibility that the
door is open, that (18b) highlights the possibility that the door is closed, and that the interpretation of yes
and no is as hypothesized above. Our aim is to give a similar explanation of the licensing and interpretation
of yes/no answers in response to disjunctive questions. In order to do so, we must first specify how the
possibilities highlighted by a given sentence are compositionally determined, and in particular how focus
affects this process.
Proposing and highlighting. We will henceforth assume that the semantic value of a sentence α consists
of two components, [[α]]P and [[α]]H . Both [[α]]P and [[α]]H are sets of possibilities; [[α]]P embodies the
proposal that α expresses, and [[α]]H consists of the possibilities that α highlights.
The semantic value of subsentential expressions will also consist of these two components. For any
expression α, sentential or subsentential, we will refer to [[α]]P as its P-set, and to [[α]]H as its H-set. Both
P-sets and H-sets are composed by means of pointwise function application.
What we used to call the denotation set of an expression, then, is now called its P-set. As far as names,
verbs, and disjunction are concerned, H-sets are defined just as P-sets. However, as soon as interrogative
complementizers enter the derivation, P-sets and H-sets start to diverge. Recall that the proposal expressed
by [Q α] consists of the possibilities for α itself, plus the possibility that α excludes:
(19) [[Q α]]P B [[α]]P ∪ [α]
We will assume that [Q α] simply highlights the possibilities that α itself highlights, not the possibility that
α excludes:
(20) [[Q α]]H B [[α]]H
These assumptions are sufficient to capture the contrast between opposing polar questions:
3
This is sometimes taken to be a general argument against ‘proposition set’ approaches to questions—which include,
besides inquisitive semantics, the classical theories of Hamblin (1973), Karttunen (1977), and Groenendijk and
Stokhof (1984)—and in favor of alternatives such as the ‘structured meaning’ approach or the ‘orthoalgebraic’
approach (cf. Krifka, 2001; Blutner, 2009). Here, we choose not to pursue a full-fledged alternative to the proposition
set approach, but rather to extend it in a suitable way.
(21) [Q-is the door open] (22) [Q-is the door closed]
Proposes: open/closed Proposes: open/closed
Highlights: open Highlights: closed
yes ⇒ the door is open yes ⇒ the door is closed
no ⇒ the door is closed no ⇒ the door is open
Highlighting and focus. We will assume that focus affects the computation of H-sets. To see why, consider
the two focus structures that give rise to block intonation and open intonation, respectively:
Recall that (23a) licenses both yes and no as an answer, while (23b) only licenses no. Our hypothesis about
the interpretation of yes and no captures this contrast if we assume that (23a) highlights a single possibility
(the possibility that Ann or Bill plays), while (23b) highlights two possibilities (the possibility that Ann
plays, and the possibility that Bill plays). But this can only be if focus affects the computation of H-sets.
For, apart from their focus structures, (23a) and (23b) are perfectly identical.
The intuitive idea that we will pursue is that ‘focus makes H-sets collapse’. Let us first make this more
precise for the case where α is a complete sentence, of type (st):
If α is of type (st), then every element of [[α]]H is a possibility π, a set of indices. The focus feature collapses
all these possibilities into one big possibility, π∈[[α]]H π. This, then, is the unique possibility in [[αF ]]H .4
S
If α is a sub-sentential expression, of some type σ different from (st), then the elements of [[α]]H are
not full-fledged possibilities, so we cannot simply take their union. However, following Partee and Rooth
(1982), we can take their ‘generalized union’:
For our examples, the relevant case is the one where α is of type e. In this particular case, we have:5
S
(26) [[αF ]]H B {λP. y∈[[α]]H P(y)} where P is a variable of type (e(st))
Let us first consider what this means for some disjunctive declaratives with different focus structures:
λw.playw (Ann),
( )
(27) [[ [Ann]F or [Bill]F plays ]]H =
λw.playw (Bill)
(28) [[ [Ann or Bill]F plays ]]H = {λw.playw (Ann) ∪ λw.playw (Bill)}
With narrow focus on each individual disjunct, ‘Ann or Bill plays’ highlights two possibilities. But, as
desired, focus on the whole disjunctive subject NP collapses these two possibilities into one. Now let us
turn to disjunctive interrogatives. First consider the narrow-scope variant. Recall that, by definition, an
interrogative clause [Q α] highlights the same possibilities as α itself. So we have:
4
Notice that this is reminiscent of what is called non-inquisitive closure in inquisitive semantics (cf. Groenendijk and
Roelofsen, 2009), and what is called existential closure in alternative semantics (cf. Kratzer and Shimoyama, 2002).
5
Computing the H-set of a sentence with a focused expression of type e in object position runs into type matching
trouble in the present setup. The ‘problem’ is exactly the same as the one that arises for the interpretation of quan-
tified noun phrases in object position in any system that starts with ‘low types’ (in particular, (e(et)) for transitive
verbs, cf. Heim and Kratzer, 1998). It also has the same solutions: type-lifting, function composition, quantifier
raising, or simply starting out with higher types. For simplicity’s sake, we will not implement any of these possible
solutions here, and simply focus on examples with focused noun phrases in subject position.
λw.playw (Ann),
( )
(29) [[ Q-does [Ann]F or [Bill]F play ]]H =
λw.playw (Bill)
(30) [[ Q-does [Ann or Bill]F play ]]H = {λw.playw (Ann) ∪ λw.playw (Bill)}
Thus, it is predicted that the question ‘Does Ann or Bill play?’ only highlights two distinct possibilities if
it has narrow focus on ‘Ann’ and on ‘Bill’. Wide-scope disjunctive interrogatives on the other hand, always
highlight two distinct possibilities:
λw.playw (Ann),
( )
(31) [[ Q-does [Ann]F play or Q-does [Bill]F play ]]H =
λw.playw (Bill)
The analysis so far yields a number of satisfactory predictions:
(32) Does [Ann or Bill]F play?
a. Highlights the possibility that Ann or Bill plays.
b. yes ⇒ at least one of them plays
c. no ⇒ neither Ann nor Bill plays
(33) Does [Ann]F or [Bill]F play?
a. Highlights the possibility that Ann plays and the possibility that Bill plays.
b. yes ⇒ presupposition failure (the question highlights more than one possibility)
c. no ⇒ neither Ann nor Bill plays
(34) Does [Ann]F play or does [Bill]F play?
a. Highlights the possibility that Ann plays and the possibility that Bill plays.
b. yes ⇒ presupposition failure (the question highlights more than one possibility)
c. no ⇒ neither Ann nor Bill plays
We seem to have obtained a better understanding of the basic difference between block intonation and open
intonation. Now let us consider the effect of closure.
Our basic intuition is that closure suggests that exactly one of the highlighted possibilities can be realized.
(Recall that possibilities embody possible updates of the common ground; as such it makes sense to speak
of them as ‘being realized’.) To see what this amounts to, consider our running examples (35a) and (35b):
(35) a. Does Ann↑ or Bill↓ play the piano? b. Does Ann↑ play the piano, or Bill↓?
These questions both highlight two possibilities: the possibility that Ann plays, and the possibility that Bill
plays. To suggest that exactly one of these possibilities can be realized is to suggest that exactly one of Ann
and Bill plays the piano. In particular, it is to suggest that at least one of them plays, and that they do not
both play. Such a suggestion does indeed seem to be part of what (35a) and (35b) communicate.
There are several ways to formalize this intuition. We will assume here that the meaning of a sentence
α does not just consist of [[α]]P and [[α]]H , but has a third component, [[α]]S , which is the set of possibili-
ties/updates that α suggests. We will refer to [[α]]S as the S-set of α.
We will assume that the S-set of expressions that do not bear a closure-feature is always empty, and
define the semantic contribution of the closure-feature to be as follows:
(36) The effect of closure: [[ αC ]]P B [[α]]P [[ αC ]]H B [[α]]H [[ αC ]]S B EX( [[α]]H )
The definition of [[ αC ]]S makes use of the exclusive strengthening operator EX. For any set of possibilities
Π, and for any possibility π ∈ Π, the exclusive strengthening of π relative to Π is defined as:
EX(π, Π) B π − {ρ | ρ ∈ Π and π * ρ}
S
(37)
and the exclusive strengthening of Π itself is defined as:
11 10 11 10 11 10
01 00 01 00 01 00
(39) Ann↑ or Bill↓ plays the piano. (40) Does Ann↑ or Bill↓ play the piano?
a. No, neither of them does. a. #No, neither of them does.
b. Actually, neither of them does.
The declarative licenses a no response; the interrogative does not. What is the relevant difference between
the two? —The answer is that the declarative really asserts that at least one of Ann and Bill plays the
piano (in the sense that it excludes—technically speaking—the possibility that neither Ann nor Bill plays),
whereas the interrogative merely suggests that at least one of Ann and Bill plays. Thus, this example
illustrates that no can be used to deny an assertion, but not to cancel a suggestion. As illustrated in (40b),
cancellation of a suggestion requires a ‘weaker’ disagreement particle such as actually or in fact (if a
disagreement marker is used at all).8
Thus, no, in response to a question α, denies the possibilities that α highlights, but is felicitous only
if denying these possibilities does not cancel the suggestion that α expresses. This accounts for the con-
trast between (39) and (40), and also for the licensing and interpretation of no in response to disjunctive
interrogatives with block intonation or open intonation.
6
It should perhaps be emphasized that closure is not interpreted here as signaling exhaustivity (as in Zimmermann,
2000). That is, it does not imply that ‘nobody else plays the piano’ or something of that kind. And this is for a
good reason: disjunctive interrogatives with closure intonation generally do not exhibit any exhaustivity effects.
Therefore, closure intonation and exhaustivity effects should be seen as (at least partly) independent phenomena.
7
For reasons of space, we cannot spell out explicitly here how the common ground, and updates thereof, are modeled.
Groenendijk (2008) and Balogh (2009) discuss the notion of a ‘suggestion’ that we make use of here in more detail,
and provide formal definitions of acceptance and cancellation in the broader context of a dialogue management
system.
8
See (Groenendijk, 2008) and (Groenendijk and Roelofsen, 2009) for closely related observations.
Sincerity requirements. Grice’s (1975) quality maxim, formulated in our present terms, says that if a
cooperative speaker s utters a sentence α, then s must take himself to know that at least one of the updates
proposed by α can indeed be established (informative sincerity). In inquisitive pragmatics (Groenendijk and
Roelofsen, 2009), it is further assumed that if α is inquisitive, then for each update that α proposes, s must
be genuinely uncertain as to whether that update can indeed be established or not (inquisitive sincerity).
In the present setting there is a third requirement, namely that if α suggests certain updates, then s must
genuinely expect that exactly one of these updates can indeed be established (expectative sincerity).
One consequence of this is that denying an assertion is much more likely to give rise to conflicts than
canceling a suggestion. For, in the first case, the speaker’s supposed knowledge is contradicted, while the
second case may require merely a revision of expectations. This is illustrated by the following contrast:
(41) A: Ann↑ or Bill↓ is coming tonight. (42) A: Is Ann↑ or Bill↓ coming tonight?
B: No, neither of them is. B: Actually, neither of them is.
A: What?! (# Oh, thanks) A: Oh, thanks.
5 Repercussions
The proposed analysis may shed light on a much wider range of phenomena than the ones explicitly dis-
cussed here. Let us end by briefly mentioning some such phenomena:
Disjunctive declaratives. The analysis directly accounts for the ‘exclusive component’ of declarative dis-
junctions. In particular, it makes the right predictions for sentences like (43), which have received much
attention in the recent literature (see Alonso-Ovalle, 2006, chapter 3, and references given there).
(43) Ann↑ is coming, or Bill↑, or both↓.
Might. Ciardelli, Groenendijk, and Roelofsen (2009) provide an analysis of might in inquisitive semantics.
Adopting this analysis, and assuming that a sentence might α highlights exactly the same possibilities as α
itself, seems to give a satisfactory account of sentences like:
(44) a. Jim might talk to Ann-or-Bill. d. Jim might talk to Ann↑, or he might talk to Bill↑.
b. Jim might talk to Ann↑ or to Bill↑. e. Jim might talk to Ann↑, or he might talk to Bill↓.
c. Jim might talk to Ann↑ or to Bill↓. f. Jim might talk to Ann↑, or to Bill↑, or to both↓.
Ignorance implicatures. Inquisitive pragmatics (in particular the inquisitive sincerity requirement men-
tioned above) accounts for ignorance implicatures triggered by disjunction, questions, and might in a uni-
form way. This account carries over straightforwardly to the extended semantic framework presented here.
Closure variability. One aspect of the data that we abstracted away from entirely is that the rising-and-
falling pitch contour that was taken to signal closure may be pronounced more or less dramatically, and
this seems to correlate with the strength of the corresponding ‘exactly one’ suggestion. This could be
captured by construing the closure feature not as a binary-valued feature—that is either ‘on’ or ‘off’—but
rather as a continuous-valued feature—with values, say, between 0 and 1. Phonologically, this value would
then determine the sharpness of the rising-and-falling pitch contour, and semantically it would determine
the strength of the corresponding ‘exactly one’ suggestion.
Cross-linguistic application. Of course, the syntactic structure and phonological characteristics of dis-
junctive questions differ widely across languages. However, the interpretation of disjunctive questions in
different languages is usually reported to be similar or identical to the interpretation of their English coun-
terparts. Therefore, we suspect that the general semantic mechanisms of proposing, highlighting, and sug-
gesting possibilities may play a role cross-linguistically, even though the way in which these mechanisms
are ‘implemented’ will differ from language to language. To give one example, it seems quite reasonable
to hypothesize that while closure is signaled in English by intonation, it is expressed in other languages
by designated lexical items. Haspelmath (2007) and Alonso-Ovalle (2006, chapter 5) provide data from
Basque, Mandarin Chinese, Finnish, and several other languages that seems to support such a hypothesis.
Bibliography