deterministic4
deterministic4
Abstract
A multi-player Dynkin game is a sequential game in which at every stage one of the players is
chosen, and that player can decide whether to continue the game or to stop it, in which case all
players receive some terminal payoff.
We study a variant of this model, where the order by which players are chosen is deterministic,
and the probability that the game terminates once the chosen player decides to stop may be strictly
less than 1.
We prove that a subgame-perfect -equilibrium in Markovian strategies exists. If the game is not
degenerate this -equilibrium is actually in pure strategies.
© 2003 Elsevier Science B.V. All rights reserved.
JEL classification: C73
1. Introduction
Dynkin (1969) introduced the following zero-sum game of optimal stopping. The game
involves two players, and two stochastic processes: (in )n∈N N is a {1, 2}-valued process,
which indicates which player is active at stage n, and (rn )n∈NN is a R -valued process, which
2
0304-4068/03/$ – see front matter © 2003 Elsevier Science B.V. All rights reserved.
doi:10.1016/S0304-4068(03)00021-1
2 E. Solan, N. Vieille / Journal of Mathematical Economics 1097 (2003) 1–19
-optimal strategies for the two players. Dynkin’s -optimal strategies are subgame-perfect
in the sense that after every finite history, the continuation strategy is -optimal in the
subgame defined by that history.
An extensive literature developed from this seminal work. In a discrete time framework,
much attention was paid to the case where the players are allowed to stop simultaneously.
In the zero-sum case, several authors, including Kiefer (1971) and Neveu (1975), pro-
vided sufficient conditions for the existence of the value, when players are restricted to
stopping times. Rosenberg et al. (2001) proved (under a minimal boundedness condition)
that the value always exists, provided the players are allowed to use randomized stop-
ping times. In the two-player non-zero-sum case, Shmaya and Solan (2002) proved that an
ε-equilibrium always exists in randomized stopping times (again, under some boundedness
condition).
Dynkin’s (1969) result implies that in every multi-player Dynkin game (without simulta-
neous moves) an ε-equilibrium exists. Indeed, let σ i be a pure ε-optimal strategy of player i
in the zero-sum game in which player i maximizes his expected payoff, and all other players
try to minimize player i’s payoff. Let σi−i be a pure ε-optimal strategy of i’s opponents in
this game. One can verify that the strategy profile in which each player i follows σ i until
a deviation occurs (since each σ i is pure, a deviation is detected immediately), and upon
−j
deviation of player j all his opponents switch to σj , is a 2ε-equilibrium.
The model of multi-player Dynkin games offers a stylized framework to analyze various
issues of timing games. For example, in situations of shrinking markets (see, e.g. Ghemawat
and Nalebuff, 1985; Fine and Li, 1989), n firms have to decide when to exit a shrinking
market. Once a firm exits, we remain with a market with n − 1 firms, which can be solved
inductively, hence the overall game reduces to a Dynkin game.
A similar situation occurs in takeover games, where n firms strategically decide to make
a takeover attempt on opponent firms.
Another related model is that of multi-player duels, or n-uels (see, e.g. Kilgour, 1975,
1977 or Kilgour and Brams, 1997). In this model, n gunners alternately have the option to
shoot one of their opponents or to abstain. Since once a gunner hits one of his opponents
we are left with a game with n − 1 players, which can be solved inductively, the game is
essentially reduced to a Dynkin game where players have several stop actions.
As the ε-equilibrium we presented above involves threats of punishment, which might
be non-credible, it is desirable to know whether a subgame-perfect ε-equilibrium exists for
every ε > 0. To this day, it is still not known whether every multi-player Dynkin game has
an ε-equilibrium.
When |I| = 2, the proof of Shmaya and Solan (2002) can be used to show the existence
of a subgame-perfect ε-equilibrium. Solan (2002) uses the theory of differential inclusions
to prove the existence of a subgame-perfect ε-equilibrium when (i) the sequence (in ) is
i.i.d., and (ii) rn depends only on in (so that the terminal payoff depends only on the identity
of the player who terminates the game).
In the present paper, we analyze the following class of I-player games. A deterministic
sequence (in , pn , rn ) ∈ I × [0, 1] × R I is given. At each stage n, player in chooses whether
E. Solan, N. Vieille / Journal of Mathematical Economics 1097 (2003) 1–19 3
to continue or to stop. If he continues, the game continues to the next stage, while if he
stops a lottery is performed. With probability pn the game terminates, yielding the payoff
rn , while with probability 1 − pn the game continues.
The assumption that the order of players is deterministic is restrictive but sometimes
relevant. On the other hand, allowing the probability of termination to be strictly less than
1 is quite natural: a takeover attempt is not always successful, and the accuracy of a gunner
is not always perfect.
Our main result states that if the sequence (rn ) of payoffs is bounded, a subgame-perfect
ε-equilibrium in Markovian strategies exists. Moreover, unless the game is degenerate,
this ε-equilibrium is in pure strategies. However, in degenerate cases, a subgame-perfect
0-equilibrium need not exist. Since the subgame-perfect -equilibrium we identify is in
Markovian strategies, it is robust to the information players receive along the game; all they
need to know is the stage of the game. Translated to the n-uel model, this means that there
is a subgame-perfect -equilibrium which is also a subgame-perfect -equilibrium in the
silent n-uel, in which players do not observe missed shots.
In degenerate cases, there need not be subgame-perfect ε-equilibria in pure strategies. This
is to be contrasted with (i) finite games of perfect information and (ii) two-player zero-sum
Dynkin games, where a subgame perfect (ε-) equilibrium in pure strategies always exists.
We hope that the combination of the arguments we use here with the techniques presented
by Shmaya and Solan (2002) and Solan (2002) can be used to further study multi-player
Dynkin games.
Another motivation to our study is linked to the observation that deterministic Dynkin
games form a simple class of stochastic games. By now, some results are available on
the existence of equilibrium payoffs in multi-player stochastic games, see Solan (1999)
and Vieille (2000). By contrast, apart from few classes of games, there are no results on
the existence of subgame-perfect equilibrium payoffs and useful techniques are yet to be
found. We hope that this paper will contribute to this emerging literature.
The paper is arranged as follows. In Section 2 we present the model and the main result.
Several examples appear in Section 3. The proof of the main result appears in Section 4.
the game, including random choices by the players, are independent). With probability pn
the game terminates, and the terminal payoff for the players is given by the vector rn . With
probability 1 − pn the game continues to stage n + 1. If the game never terminates, the
payoff is zero for all players.1
We denote by θ the termination stage of the game, i.e. the first stage in which a player
decides to stop and the game terminates. Thus, the payoff to player i ∈ I is rθi 1 θ<∞ .
A strategy of player i ∈ I maps the set of information sets of player i to the set of mixed
moves of player i. We let Ni = {n ∈ N |in = i} be the set of stages in which player i is
active.
We are going to restrict the players to Markovian strategies; namely, strategies that depend
only on the stage, and not on the history. We will prove below that the game admits a
subgame-perfect ε-equilibrium in Markovian strategies. By a general observation (see,
e.g. Fudenberg and Tirole (1991, p. 501)), this subgame-perfect ε-equilibrium remains a
subgame-perfect ε-equilibrium without the restriction to Markovian strategies.
In the present context, a (behavior Markovian) strategy of player i is a function σ i : Ni →
[0, 1], where σ i (n) is the probability assigned by player i to stop at stage n, provided the
game does not terminate before that stage. We denote the set of strategies of player i by Σ i .
A strategy profile (or simply a profile) is a vector σ = (σ i )i∈I of strategies, one for each
player.
Every strategy profile σ ∈ ×Σi∈I i induces a probability distribution P over the space of
σ
plays, or infinite histories. The corresponding expectation operator is E σ . Thus, the expected
payoff to player i given a strategy profile σ is
γ i (σ) ≥ γ i (σ −i , τ i ) − ε.
We mention that, for any ε > ε, an ε-equilibrium is a uniform ε -equilibrium; that is, it is
an ε -equilibrium (a) in every discounted game, provided the discount factor is sufficiently
small, and (b) in every N-stage game, provided N is sufficiently large. Indeed, the proof
provided in Solan and Vieille (2001, Proposition 2.13) does adapt to the present framework.
For n ∈ N , we denote by γn (σ) the expected payoff induced by the strategy profile σ in
the subgame starting at stage n.
A strategy profile is a subgame-perfect (ε-) equilibrium of a game if it induces an (ε-)
equilibrium in any subgame. In the present context, this amounts to the following definition.
1 Equivalently, we may assume that, with probability p , player i is given the opportunity to stop for sure. For
n n
each strategy profile, the payoff is the same under both interpretations of the game.
E. Solan, N. Vieille / Journal of Mathematical Economics 1097 (2003) 1–19 5
Theorem 1. Let Γ = (I, (in , pn , rn )n∈NN ) be a deterministic Dynkin game. If the se-
quence (rn )n∈N
N is bounded, then for every ε > 0 the game Γ admits a subgame-perfect
ε-equilibrium in Markovian strategies.
3. Examples
In the present section we provide several examples that illustrate the main features of the
model.
In words, at the first stage, player 1 can stop the game, thereby yielding the payoff vector
(1, 0, 3). If player 1 chooses to continue, at the second stage player 2 can stop the game,
yielding the terminal payoff (3, 1, 0). If player 2 chooses to continue as well, at the third
stage player 3 can stop the game, yielding the terminal payoff (0, 3, 1). The process then
repeats itself cyclically. This game is a variation upon a game studied by Flesch et al. (1997).
We will characterize all pure subgame-perfect 0-equilibrium profiles of that game, using
backward induction.
Let σ be such a 0-equilibrium. Assume that at stage 3n, for some n ≥ 2, player 3 stops
with probability 1; that is, σ 3 (3n) = 1. In particular, γ3n (σ) = (0, 3, 1).
Consider the subgame starting at stage 3n−1. In that subgame, player 2 receives γ3n 2 (σ) =
On the other hand, if σ 3 (3n) = 0 for some n ≥ 2, then σ 3 (3n − 3) = 1 and the previous
analysis holds.
Thus, there are two pure subgame-perfect 0-equilibria: (a) at odd stages the active player
stops, and at even stages the active player continues, and (b) at even stages the active player
stops, and at odd stages the active player continues.
In each pure equilibrium, the players agree on who shoots first. We believe that the
interpretation of these two equilibria is quite appealing: suppose there are three gunners. If
gunner 1 thinks that gunner 2 is going to shoot tomorrow gunner 3 (or gunner 1 if gunner
3 is already dead), he has no reason to shoot today: he is better off by letting gunner 2 be
done with gunner 3, and shoot gunner 2 the next time he can. On the other hand, if gunner
1 thinks that gunner 2 is not going to shoot tomorrow if gunner 3 is still alive, but shoot
gunner 1 if gunner 3 is already dead, and that gunner 3 is going to shoot him the day after,
he is indifferent between shooting and not shooting gunner 2, as he is going to die anyway,
so he can as well shoot gunner 2 today.
Remark 1. This game admits other subgame-perfect equilibria. In particular, the profile in
which each player stops with probability 1/2 whenever active, is a subgame-perfect equili-
brium. In a sense, it corresponds to the cyclic equilibrium constructed by Flesch et al. (1997).
Example 2. Consider the following modification of Example 1, where I = {1, 2, 3}, and
(1, 1, (1, 0, 3)),
n = 1 modulo 3,
(in , pn , rn ) = (2, 1/2, (3, 1, 0)), n = 2 modulo 3,
(3, 1/2, (0, 3, 1)), n = 0 modulo 3.
Thus, when player 1 stops the game terminates with probability 1, while when either player
2 or player 3 stops the game terminates with probability 1/2.
As we did in Example 1, we characterize the set of subgame-perfect 0-equilibrium in
pure strategies. Let σ be such a strategy profile. Let n > 0 and i be the active player at stage
n. By the subgame-perfect equilibrium condition, σ i (n) = 1 if γn+1i (σ) < 1 and σ i (n) = 0
i
if γn+1 (σ) > 1.
Let n ≥ 3, and assume that σ 1 (3n + 1) = 1. Then γ3n+1 (σ) = (1, 0, 3), and therefore
σ 3 (3n) = 0. This implies that γ3n (σ) = γ3n+1 (σ) = (1, 0, 3), and therefore σ 2 (3n−1) = 1.
It follows that
γ3n−1 (σ) = 21 (3, 1, 0) + 21 (1, 0, 3) = (2, 21 , 23 ),
and therefore σ 1 (3n − 2) = σ 3 (3n − 3) = 0 and σ 2 (3n − 4) = 1.
E. Solan, N. Vieille / Journal of Mathematical Economics 1097 (2003) 1–19 7
Then,
γ3n−4 (σ) = 21 (3, 1, 0) + 21 (2, 21 , 23 ) = ( 25 , 43 , 43 ),
Fix ε ∈ (0, 1), and let σ be the strategy profile defined by σ 1 (2n+1) = 1 and σ 2 (2n+2) =
ε for every n ≥ 0. We claim that σ is a subgame-perfect ε-equilibrium. One should verify
that player 1 (respectively player 2) cannot profit by deviating in the subgames that start
at odd (respectively even) stages. Consider first the subgame that starts at stage 2n + 1,
for some n ≥ 0. By stopping at stage 2n + 1 player 1 receives −1, while, since player 2
eventually stops with probability 1, player 1’s payoff is at most −1, whatever he plays. In the
subgame starting at stage 2n + 2, player 2’s expected payoff under σ is ε + 2(1 − ε) = 2 − ε,
whereas the maximal payoff to player 2 in the game is 2.
8 E. Solan, N. Vieille / Journal of Mathematical Economics 1097 (2003) 1–19
We next prove that the game has no subgame-perfect ε-equilibrium in pure strategies.
Assume to the contrary that there exists such a profile σ.
We first claim that there is an infinite set of even stages in which player 2 chooses to
stop. Otherwise, let N be the maximal integer such that player 2 stops at stage 2N (set
N = 0 if player 2 never stops). Consider now the subgame that starts at stage 2N + 2.
By the definition of N, player 2 never stops in this subgame. Since σ is a subgame-perfect
ε-equilibrium, this implies that under σ player 1 never stops in this subgame: by never
stopping he receives 0, while by stopping he receives −1. But this leads to a contradiction,
as it implies that player 2 can profit 1 by deviating: by never stopping he receives 0, while
by stopping he receives 1.
We next claim that there is at most one even stage in which player 2 chooses to stop. To-
gether with the previous paragraph, this shows that there cannot be a subgame-perfect
ε-equilibrium. Assume that player 2 stops at stage 2N, with N > 1. Since σ induces an
ε-equilibrium in the subgame that starts at stage 2N − 1, and since player 2 stops at stage
2N, under σ player 1 stops at stage 2N − 1. However, since player 1 stops at stage 2N − 1,
under σ player 2 continues in all stages 2k for k < N: by continuing in all these stages he
receives 2, while his payoff upon stopping is 1.
This example shows that pure subgame-perfect ε-equilibria need not exist. Such a case
may arise when there is a player i who by stopping gives everyone else high payoff, but
he himself receives low payoff. It is then in the interest of his opponents to threaten him
that if he does not stop, one of them will eventually stop and punish player i. The punisher,
however, stops with low probability, so that player i has a chance to correct his behavior
and stop the game at a later stage.
We finally prove that there is no subgame-perfect 0-equilibrium. We argue by contra-
diction, and we let σ be a subgame-perfect 0-equilibrium. For i = 1, 2, we denote by ci
the strategy that always continues, i.e. ci (n) = 0 for each n ∈ Ni . Note first that, for each
n ∈ N , one has
Indeed, the sequence (PP σ (θ < +∞|θ ≥ n))n∈N N would otherwise decrease to zero, hence
the sequence (γn (σ))n∈NN would converge to zero, and player 2 would have a profitable
deviation in the subgame starting at stage n, for n large enough. By (1) the game terminates
with probability 1, hence at least one of the players eventually stops with probability 1:
or
If (2) holds, then c2 is the best reply to σ 1 in all subgames, hence σ 2 = c2 . Since the unique
best reply of player 1 to c2 is c1 , one gets σ = (c1 , c2 )—a contradiction to (1).
If (3) holds, there are infinitely many even integers n such that σ 2 (n) > 0. By optimality
of σ 1 , and since (3) holds, one has σ 1 (n − 1) = 1 for any such n. Therefore, (2) holds—a
contradiction.
E. Solan, N. Vieille / Journal of Mathematical Economics 1097 (2003) 1–19 9
4.1. Preliminaries
In this subsection, we analyze few degenerate cases, and slightly rephrase the problem.
The core of the proof of Theorem 1 is in Section 4.4.
Let Γ = (I, (in , pn , rn )n∈N N ) be a deterministic Dynkin game. Since the sequence (rn )n∈N N
is bounded, we can assume w.l.o.g. that payoffs are bounded by 1.
Let Γ̃ = (I, (in , pn , r̃n )n∈N
N ) be another game with the same sequence of active players
and the same probabilities of success. Since the payoff functions of the two games differ
by at most supn∈N N rn − r̃n , any subgame-perfect ε-equilibrium of Γ̃ is a subgame-perfect
ε -equilibrium of Γ , where ε = ε + supn∈N N rn − r̃n .
Since we are looking for an ε-equilibrium, and since payoffs are bounded, there is no
loss of generality in assuming that the range of the sequence (rn )n∈N N is finite, and that if
(i, r) and (j, r̃) are two distinct elements in that range then r k = r̃k for every k ∈ I.
Notice now that Theorem 1 will follow if we prove that there is a subgame-perfect
ε-equilibrium in some subgame of Γ . Indeed, the conclusion for Γ will then follow by
applying backward induction to the first stages of the game. Moreover, since finite extensive
games with perfect information have pure subgame-perfect equilibria, the resulting profile
will be pure when the subgame-perfect ε-equilibrium of the subgame is pure.
Let IR be the finite range of the sequence (in , rn )n∈N N . For each (i, r) ∈ IR define
π(i, r) = {pn |n ∈ N , (in , rn ) = (i, r)}
and set IR∞ = {(i, r) ∈ IR|π(i, r) = +∞}. If π(i, r) = +∞ then if player i stops whenever
(in , rn ) = (i, r), and all players continue in all other stages, the game will eventually
terminate, and the terminal payoff will be r.
We now argue that we may assume w.l.o.g. that
π(i, r) = +∞ for each (i, r) ∈ IR. (4)
As a first step, we prove that we may assume w.l.o.g. that
π(i, r) = 0 for each (i, r) ∈
/ IR∞ . (5)
Choose first N ∈ N large enough such that
pn < ε/|IR| for each (i, r) ∈
/ IR∞ . (6)
n≥N:(in ,rn )=(i,r)
Such an N exists since IR is a finite set. Denote by ΓN the subgame that starts at stage
N. Let Γ̃N = (I, (in , p̃n , rn )n∈N
N ) be the game that coincides with ΓN except that p̃n = 0
whenever (ĩn , r̃n ) ∈
/ IR∞ .
By (6), the payoff functions of the two games ΓN and Γ̃N differ by at most 2ε. Therefore,
any subgame-perfect ε-equilibrium of Γ̃N is a subgame-perfect 3ε-equilibrium of ΓN , and,
by backward induction, yields a subgame-perfect 3ε-equilibrium of Γ .
10 E. Solan, N. Vieille / Journal of Mathematical Economics 1097 (2003) 1–19
As the game Γ̃N satisfies (5), one can assume w.l.o.g. that (5) holds.
Thus, we are led to analyze games such that, for each (i, r) ∈ IR, either π(i, r) = 0
or π(i, r) = +∞. If π(i, r) = 0 for each (i, r) ∈ IR (so that IR∞ = ∅), the payoff
function of the game is identically zero and the conclusion of Theorem 1 follows tri-
vially.
Assume now that IR∞ = ∅. Consider the game obtained by dropping all stages n such that
π(in , rn ) = 0 (and by relabeling stages). Since there are infinitely many stages n such that
π(in , rn ) = +∞, the resulting game is again a deterministic multi-player Dynkin game.
Plainly, any subgame-perfect ε-equilibrium of this new game is also a subgame-perfect
ε-equilibrium of the initial game (with the proper identification of stages, and with an
arbitrary behavior in the stages that have been dropped).
It follows that we can assume w.l.o.g. that (4) holds.
In the present section we fix ε < 1/40. Given ε, we define a partition of the set N of
stages into blocks. This partition will be used in the sequel to prove Theorem 1.
We will use the following technical result.
Lemma 1 (Rosenberg et al., 2002, Lemma 18). Let n ∈ N , and let p1 , . . . , pn be non-
negative reals that satisfy ni=1 pi < 1/20. Then,
n
n 2 n i−1 n
pi − 20 pi ≤ pi (1 − pj ) ≤ pi .
i=1 i=1 i=1 j=1 i=1
Observe that ni=1 pi i−1j=1 (1 − pj ) is the probability that the result of at least one out
of n coins with parameters p1 , . . . , pn is Head. In particular it is equal to 1 − ni=1 (1 − pi ).
Corollary 1. Let ε
< 1/40, n ∈ N , and p1 , . . . , pn be non-negative reals that satisfy
n n i−1
p
i=1 i ≥ ε. Then i=1 pi j=1 (1 − pj ) ≥ ε/2.
Proof.
The proof is divided into three cases.
If ni=1 pi ≤ 1/20 the claim follows from Lemma 1 and since ε < 1/40.
If there is i such that pi ≥ ε the claim holds trivially.
Otherwise, there is a subset I ⊂ {1, . . . , n} such that 1/20 − ε ≤ i∈I pi ≤ 1/20. Then
n
i−1 n
pi (1 − pj ) = 1 − (1 − pi ) ≥ 1 − (1 − pi )
i=1 j=1 i=1 i∈I
2
1 1
≥ − ε − 20 −ε ≥ ε − 20ε2 ≥ ε/2,
20 20
where the second inequality follows from Lemma 1, and the third one holds since the
function x − 20x2 is monotonic decreasing for x < 1/20 and since ε < 1/40. 䊐
E. Solan, N. Vieille / Journal of Mathematical Economics 1097 (2003) 1–19 11
We are now ready to define the partition of N into blocks. Set n0 = 1 and, for l ∈ N ,
define the initial stage nl of block l to be
nl = min n > nl−1 | pk ≥ ∀(i, r) ∈ IR .
nl−1 ≤k<n,(ik ,rk )=(i,r)
Under the assumption that π(i, r) = +∞ for each (i, r) ∈ IR, the proof proceeds by
induction over the number of elements in IR. The conclusion is easy if |IR| = 1, and is left
to the reader.
We now analyze a somewhat degenerate case that generalizes Example 2. This is the only
place in the proof where we use the induction hypothesis.
The lemma states that if there is a terminal payoff r that is preferred by each player i to all
terminal payoffs i controls, then a subgame-perfect ε-equilibrium exists.
Proof. We assume w.l.o.g. that ε < 1/40, and we split the discussion into three cases.
Case 1. ri ≥ 0.
Let σ be the pure strategy profile in which player i stops whenever (in , rn ) = (i, r), and
all players continue in all other stages, i.e.
Fix n ∈ N . We prove that σ induces a 0-equilibrium in the subgame that starts at stage
n. Since π(i, r) = +∞, the game eventually terminates, and therefore the expected payoff
is r. Player i cannot gain by deviating, since his payoff is at most ri if he terminates the
game, and 0 ≤ ri if he always continues. Every player j = i cannot gain by deviating either,
since his payoff under σ is r j , while if he deviates his payoff is in the convex hull of r j and
{r̃j , (j, r̃) ∈ IR}, hence at most r j .
Case 2. r i < 0, and there is (j, r̃) ∈ IR such that i = j and r̃i < ri .
In this case, we elaborate upon the construction in Example 3. We will have player i stop
at all stages in {n ∈ N : (in , rn ) = (i, r)}, and player j stop with some small probability at
stages in {n ∈ N : (in , rn ) = (j, r̃)}. The choices of the corresponding probabilities should
12 E. Solan, N. Vieille / Journal of Mathematical Economics 1097 (2003) 1–19
fulfill two conditions: (i) these values should be small enough so that γn (σ) − r < ε,
for each n ∈ N and (ii) they should be high enough so that, if player i were to continue at
all stages, the game would still stop a.s. in finite time. These two conditions relate to the
two dual aspects of the threat. By condition (i) the threat will be used on the equilibrium
path with small probability. By condition (ii) it will provide incentives to player i to act as
required.
Recall the partition of N into blocks that was defined in Section 4.2, and that nl is the
first stage of block l, l ≥ 0.
Since for every l ≥ 0 one has nl ≤n<nl+1 :(in ,rn )=(j,r̃) pn ≥ ε, there is a function x:
N → [0, 1] such that for every l ≥ 0 one has nl ≤n<nl+1 :(in ,rn )=(j,r̃) xn pn = ε2 .
We let σ be the strategy profile in which player i stops whenever (in , rn ) = (i, r), player j
stops with probability xn whenever (in , rn ) = (j, r̃), and all players continue otherwise, i.e.
1, if (in , rn ) = (i, r),
in
σ (n) = xn , if (in , rn ) = (j, r̃),
0, otherwise.
Proposition 1. Let Γ be a deterministic multi-player Dynkin game. Assume that for every
(i, r) ∈ IR, (i) π(i, r) = +∞, and (ii) there is (j, r̃) ∈ IR such that r̃j > rj . Then, for every
ε > 0, the game Γ has a subgame-perfect ε-equilibrium in pure Markovian strategies.
E. Solan, N. Vieille / Journal of Mathematical Economics 1097 (2003) 1–19 13
Note that Example 3 does not fit into Proposition 1. We do not know whether a subgame-
perfect 0-equilibrium exists or not. The rest of this section is devoted to the proof of the
proposition.
As remarked at the beginning of Section 4.1, we can assume w.l.o.g. that for every
(i, r), (j, r̃) ∈ IR, either (i, r) = (j, r̃), or r k = r̃k for every k.
For every i ∈ I set
mi = max{ri |(i, r) ∈ IR}.
This is the maximal terminal payoff player i can receive when stopping alone. Let ρi ∈ R I
be the unique vector r such that (i, r) ∈ IR and r i = mi (uniqueness is guaranteed by the
preceeding paragraph).
Finally, set
W = {w ∈ R N |wi ≤ mi for some i ∈ I}.
This is the set of all payoff vectors w such that at least one player is better off by stopping
at some stage rather than continuing forever and receiving w.
An important property of the set W is that if the continuation payoff at stage n is w ∈
W, and if player in prefers to stop rather than continue (that is, win ≤ rnin ), then the
expected payoff if player in stops at stage n, (1 − pn )w + pn rn , is in W. Formally, for every
n ∈ N,
w ∈ W and win ≤ rnin imply (1 − pn )w + pn rn ∈ W. (7)
Indeed, under the assumptions, (1 − pn )win + pn rnin ≤ rnin ≤ min , and (7) follows.
We will prove the existence of a subgame-perfect ε-equilibrium. We assume w.l.o.g. that
ε < 1/40, and that furthermore ε < 21 min(i,r)=(j,r̃) |r i − r̃ i |.
Let l ∈ N be given. We will define a pure profile σl up to stage nl . We will simultaneously
nl
construct a sequence (wl (n))n=1 of vectors in W. As a first approximation, the vector wl (n)
may be interpreted as the expected payoff under σ from stage n onwards.
As for now, we fix l ∈ N and we write σ and w instead of σl and wl , respectively.
We define both σ and w backwards. We let w(nl ) be an arbitrary point in W ∩ [−1, 1]I .
We deal with each of the blocks inductively (starting with the lth one). Let k ≤ l. Assuming
w(nk ) ∈ W is already defined, we define now σ and w over the stages n = nk−1 , . . . ,
nk − 1.
Given w(n + 1) and σ in (n), we set w(n) = σ in (n)pn rn + (1 − σ in (n)pn )w(n + 1), so
that we need only define σ in (n). Thus, if w(n + 1) is the expected payoff from stage n + 1
onwards, w(n) is the expected payoff from stage n onwards. We will define σ in (n) such
that (i) σ in (n) is pure, and (ii) σ in (n) = 1 implies rnin ≥ win (n + 1). Since w(nl ) ∈ W and
by (7), this implies that w(n) ∈ W for every n ≤ nl .
Fix i∗ ∈ I such that wi∗ (nk ) ≤ mi∗ . Since w(nk ) ∈ W, such a player exists. We will
define σ so that at the final stages of the block only player i∗ will possibly stop. In earlier
stages, σ will be defined using backward induction as in Case 1.
Formally, let nk−1 ≤ n < nk . Assume that σ has been defined for stages q = n +
1, . . . , nk − 1. We define σ at stage n as follows. Denote by π(n + 1, nk ) the probability
under σ that, starting from stage n + 1, the game terminates under σ before stage nk , i.e.
π(nk , nk ) = 0, and
π(q, nk ) = σ iq (q)pq + (1 − σ iq (q))π(q + 1, nk ) for n + 1 ≤ q < nk .
Then:
• if π(n + 1, nk ) < ε, we set σ in (n) = 1 if both in = i∗ and rni∗ ≥ wi∗ (n) hold. We set
σ in (n) = 0 otherwise;
• if π(n + 1, nk ) ≥ ε, we set σ in (n) = 1 if rnin ≥ win (n), and σ in (n) = 0 otherwise.
We now prove that under σ, the probability of termination in any single block is bounded
away from zero.
P σ (θ < nk+1 |θ ≥ nk ) ≥ 13 ε.
Proof. We will prove that π(nk , nk+1 ) ≥ ε/3. We consider Cases 1 and 2 in turn.
We first assume that Case 1 holds, and we let i∗ ∈ I be a player such that wi∗ (nk+1 ) ≤
i
m ∗ − .
1. If σ i∗ (n) = 1 whenever (in , rn ) = (i∗ , ρi∗ ), one has by Corollary 1 π(nk , nk+1 ) ≥ ε/2.
2. If σ i∗ (n) = 0 for some n such that (in , rn ) = (i∗ , ρi∗ ), then wi∗ (n + 1) > mi∗ . Observe
now that, since payoffs are bounded by one, one has
and ρii∗∗ = mi∗ . Therefore, σ i∗ (n) = 1 whenever (in , rn ) = (i∗ , ρi∗ ), and one gets
π(nk , nk+1 ) ≥ ε/2, as in Case 1, item 1. 䊐
We will now let l vary and we denote by σl and wl the objects that were defined above.
The pure strategy profile σl may be identified with a point in {0, 1}N (the nth component
being the behavior at stage n of the active player in ). Since the product space {0, 1}N is
compact (and metrizable), the sequence (σl )l≥0 has a subsequence that converges to some
pure strategy profile σ∗ . For notational convenience, we still denote this subsequence by
(σl )l≥0 . Since σl is a pure strategy for every l ∈ N , for every fixed n ∈ N the first n
components of σ∗ coincide with the first n components of σl , provided l is sufficiently large.
For such l’s, the behavior in the first n stages of the game under the two strategy profiles σ∗
and σl coincide.
Our goal is to prove that σ∗ is a subgame-perfect ε-equilibrium. We first prove that the play
terminates P σ∗ -a.s. in each subgame. We will then relate the payoff γ(σ∗ ) to the sequence
(wl )l∈N
N (Lemma 4) and prove that no player has a profitable one-stage deviation (Lemma 5)
under σ∗ . The conclusion follows (Proposition 2), after we prove that no single player is
responsible for the termination of the game (Lemma 6).
Proof. Let l > k be large enough so that σ∗ coincides with σl up to stage nk+1 , and apply
Lemma 3. 䊐
Proof. We prove the result for n = 1. The proof is similar for the subgame that starts at
any stage n ∈ N .
Let k ∈ N be given. For each l ≥ k, one has
γ(σ∗ ) = E σ∗ [rθ 1θ<nk ] + P σ∗ (θ ≥ nk )γnk (σ∗ ),
and
wl (1) = E σl [rθ 1θ<nk ] + P σl (θ ≥ nk )wl (nk ).
For l large enough, the two profiles σl and σ∗ coincide up to stage nk . In particular,
E σ∗ [rθ 1θ<nk ] = E σl [rθ 1θ<nk ] and P σ∗ (θ ≥ nk ) = P σl (θ ≥ nk ). By Corollary 2
The next lemma says in substance that no player can increase his payoff by more than
3ε by modifying his strategy in a single stage.
16 E. Solan, N. Vieille / Journal of Mathematical Economics 1097 (2003) 1–19
Proof. Let n ∈ N be given. Let l ∈ N be sufficiently large so that nl > n. We first prove a
related statement for the strategy profile σl . Let k < l be determined by nk ≤ n < nk+1 .
By construction, σlin (n) is defined using backward induction, except in one case where
σl (n) is required to be zero. In Case 1 one has σlin (n) = 1 if rnin ≥ wiln (n+1) and σlin (n) = 0
in
otherwise. In Case 2 one has wiln (nk+1 ) ≥ min − ε and π(n + 1, nk+1 ) < ε. Therefore,
|wiln (nk+1 ) − wiln (n + 1)| < 2ε, which yields wiln (n + 1) ≥ min − 3ε ≥ rnin − 3ε. Hence,
in both cases, one has
We now prove that the play terminates a.s., even if a single player chooses to continue
whenever active. Recall that ci is the strategy of player i that always continues.
Proof. We argue by contradiction, and we assume that, for some player i ∈ I, the sequence
P ci ,σ∗−i (θ < +∞|θ ≥ n) converges to zero when n goes to +∞. By Corollary 2 the game
eventually terminates, so that P σ∗ (θ < +∞|θ ≥ n) = 1 for every n. Therefore, it must be
the case that player i terminates the game: P σ∗i ,c−i (θ < +∞|θ ≥ n) = 1 for every n, and
limn→+∞ γn (σ∗ ) − γn (σ∗i , c−i ) = 0.
We first prove that limn→+∞ γn (σ∗ ) = ρi , and then deduce a contradiction with the basic
assumption made on Γ .
if σ∗in (n) = 1. Therefore, for every two positive integers n ≥ m, one has
i
γm (σ∗ ) ≥ γni (σ∗ ) − 2 pq1 iq . (8)
σ∗ (q)=1
m≤q<n;q∈N
/ i
E. Solan, N. Vieille / Journal of Mathematical Economics 1097 (2003) 1–19 17
Let ε̃ ∈ (0, 1/40) be given. Choose Nε̃ ∈ N sufficiently large so that P ci ,σ∗−i (θ <
+∞|θ ≥ Nε̃ ) < ε̃/2. For such Nε̃ , one has by Corollary 1 Nε̃ ≤q<+∞;q∈N
/ i pq 1 iq ≤
σ∗ (q)=1
ε̃. Therefore, by (8),
i
γm (σ∗ ) ≥ γni (σ∗ ) − 2ε̃, for every n ≥ m ≥ Nε̃ .
Proof. Let i ∈ I be given. We prove that player i cannot gain more than 3ε by deviating from
σ∗ . The same proof will hold in any subgame, thereby showing the subgame-perfectness
property.
Define the sequence (Xn )n∈N i i
N of random variables by Xn = rθ if θ < n and Xn = γn (σ∗ )
i
if θ ≥ n. Let τ be an arbitrary strategy of player i. By Lemma 6, the sequence (Xn )n∈N N
converges P τ i ,σ∗−i -a.s. to X∞ := rθ 1 θ<+∞ , hence
E
lim E i −i [Xn ] = E τ i ,σ∗−i [rθ 1 θ<+∞ ] = γ i (τ i , σ∗−i ). (9)
n→+∞ τ ,σ∗
18 E. Solan, N. Vieille / Journal of Mathematical Economics 1097 (2003) 1–19
On the other hand, let n ∈ N , and denote by Hn the past play up to stage n. We shall
prove that
E τ i ,σ∗−i [Xn+1 |Hn ] ≤ Xn + 3ε11θ=n , a.s. (10)
On the event θ < n, both Xn and Xn+1 are equal to rθi . Consider now the event θ ≥ n. If
in = i, one has
Xn = γni (σ∗ ) = E σ∗ [Xn+1 |Hn ] = E τ i ,σ∗−i [Xn+1 |Hn ],
where the last equality follows since the two profiles (τ i , σ∗−i ) and σ∗ coincide at stage n.
In both cases, (10) follows trivially. Finally, if in = i, one has Xn = Xn+1 = γn+1 i (σ ) if
∗
in i (σ ) otherwise. Inequality (10)
σ∗ (n) = 0 and E τ i ,σ∗−i [Xn+1 |Hn ] = pn rni + (1 − pn )γn+1 ∗
then follows by Lemma 5.
By taking expectations in (10), and by summing over n, one obtains limn→+∞E τ i ,σ∗−i [Xn ]
≤ X1 + 3ε which yields, by (9),
γ i (τ i , σ∗−i ) ≤ γ i (σ∗ ) + 3ε. 䊐
Acknowledgements
References
Dynkin, E.B., 1969. Game variant of a problem on optimal stopping. Soviet Mathematics Doklady 10, 270–274.
Fine, C.H., Li, L., 1989. Equilibrium exit in stochastically declining industries. Games and Economic Behavior
1, 40–59.
Flesch, J., Thuijsman, F., Vrieze, K., 1997. Cyclic Markov equilibria in stochastic games. International Journal of
Game Theory 26, 303–314.
Fudenberg, D., Tirole, J., 1991. Game Theory. MIT Press, Cambridge, MA.
Ghemawat, P., Nalebuff, B., 1985. Exit. RAND Journal of Economics 16, 184–194.
Kiefer, Y.I., 1971. Optimal stopped games. Theory of Probability and its Application 16, 185–189.
Kilgour, D.M., 1975. The sequential truel. International Journal of Game Theory 4, 151–174.
Kilgour, D.M., 1977. Equilibrium points in infinite sequential truels. International Journal of Game Theory 6,
167–180.
Kilgour, D.M., Brams, S.J., 1997. The truel. Mathematics Magazine 70, 315–325.
Neveu, J., 1975. Discrete-Parameter Martingales. North-Holland, Amsterdam.
Rosenberg, D., Solan, E., Vieille, N., 2001. Stopping games with randomized strategies. Probability Theory and
Related Fields 119, 433–451.
Rosenberg, D., Solan, E., Vieille, N., 2002. On the maxmin value of stochastic games with imperfect monitoring.
Discussion Paper 1337. The Center for Mathematical Studies in Economics and Management Science, Kellogg
School of Management, Northwestern University.
Shmaya, E., Solan, E., 2002. Two-player non-zero-sum stopping games in discrete time. Discussion Paper 1347.
The Center for Mathematical Studies in Economics and Management Science, Kellogg School of Management,
Northwestern University.
E. Solan, N. Vieille / Journal of Mathematical Economics 1097 (2003) 1–19 19
Solan, E., 1999. Three-person absorbing games. Mathematics of Operations Research 24, 669–698.
Solan, E., 2002. Subgame perfection in quitting games with perfect information and differential equations.
Discussion Paper 1356. The Center for Mathematical Studies in Economics and Management Science, Kellogg
School of Management, Northwestern University.
Solan, E., Vieille, N., 2001. Quitting games. Mathematics of Operations Research 26, 265–285.
Vieille, N., 2000. Equilibrium in 2-person stochastic games. II. The case of recursive games. Israel Journal of
Mathematics 119, 93–126.