0% found this document useful (0 votes)

13 views

Lecture 04

Uploaded by

b7ng.1119

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

Lecture 04

Uploaded by

b7ng.1119

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Extensive form games

December 22, 2022

1 Examples
1.1 Extensive form games with perfect information

• An extensive game can usually represented by a tree graph.

• Terminals nodes are outcomes of a game, and payoffs of all players are attached to some outcome.

• Some players are active (have at least two actions) on some nodes, and these nodes are called
moving nodes.
• Actions of players are represented by branches.
• In an extensive form game of perfect information, each player observe all actions that had been
chosen by others before she moves

1.2 Extensive form games with imperfect information

• The dotted line indicate an information set.

• Nodes that some player cannot distinguish are in the same information set of that player.
• In an EFG with perfect information, each information set contains exactly one moving node.
• In an EFG with imperfect information, some information sets contain at least two moving nodes.

1
2 Theory
2.1 Representation of an extensive form game
• A game tree consists of a collection of nodes, X, and a binary relation ≻ such that, for any
x, y ∈ X, x ≻ y means “x come after y”.
– The initial node of the tree is called root.
– For example, recall the bargaining game, X = {1, 2, 3, 4, 5, 6, 7}.
– 2 ≻ 1 , 6 ≻ 3, and 6 ≻ 1, but not 6 ≻ 2.
• The relation ≻ satisfies the following
– asymmetry: there is no x, y ∈ X such that x ≻ y and y ≻ x
– transitivity: for any x, y, z ∈ X, if x ≻ y and y ≻ z, then x ≻ z
– common predecessor for non-initial nodes: for any two nodes y1 , y2 ∈ X, if there exists some
nodes x1 and x2 such that y1 ≻ x1 and y2 ≻ x2 , then there exists a node z ∈ X such that
y1 ≻ z and y2 ≻ z.
– and, if x ≺ y and z ≺ y, then it is either x ≻ z or z ≻ x.
• Using this relation, we further define
– set of predecessors of node x ∈ X by P (x) = {y ∈ X|x ≻ y}.
∗ x ∈ X is the root of X iff P (x) = ∅.
– set of successors of node x ∈ X by S(x) = {y ∈ X|y ≻ x}.
– The set of terminal nodes is denoted by Z = {x ∈ X|S(x) = ∅}, i.e. nodes without succes-
sors.
– The set of moving nodes is Y = X\Z = {x ∈ X|S(x) ̸= ∅}. {Y, Z} is a partition of X.
∗ A partition of set A is a collection
Sn of subsets {B1 , . . . , Bn } such that (a) ∀i, Bi ⊆ A,
(b) ∀i ̸= j, Bi ∩ Bj = ∅, (c) i=1 Bi = A
– A path of a terminal node z is Path(z) = P (z) ∪ {z}, i.e. the nodes that proceeds z and z
itself.
∗ For example, P (6) = {1, 3}, S(2) = {4, 5}, Z = {4, 5, 6, 7}, X = {1, 2, 3}, Path(4) =
{1, 2, 4}
• Let N be the set of players.
• Let (Yi )i∈N be a moving partition of Y such that Yi contains all the nodes that player i chooses
actions.1
• Let Ui be the information partition of player i. That is, Ui is a partition of Yi and each ui ∈ Ui
is an information set of player i, i.e. i do not know exactly in which node she is.
– e.g. N = {B, S}, YB = {1} and YS = {2, 3}, UB = {{1}}, US = {{2}, {3}} (with perfect
information), US = {{2, 3}} (with imperfect information. Because there are two elements
of {2, 3}, player S do not know in which one exactly).
• Let Au be the set of actions at information set u ∈ Ui of player i. (Why do nodes in the same
information set have the same set of actions?)
• e.g. A{1} = {100, 500}, A{2} = A{3} = {A, R} (perfect information), A{2,3} = {A, R}
1 We consider extensive form games such that there is only one player choosing action at each moving node.

2
• A pure strategy of player i is defined by a functions

si : Ui → ∪u∈Ui Au such that ∀u ∈ Ui , si (u) ∈ Au . (1)

Let Si denote the set of pure strategies of player i, and the set of strategy profiles S = ×i∈N Si .
– For example, in the perfect information bargaining (100, RA) means that sB ({1}) = 100,
sS ({2}) = R, sS ({3}) = A, SB = {100, 500}, SS = {AA, AR, RA, RR}, and S = SB × SS
– with imperfect information, (500, A) means that sB ({1}) = 500, sS ({2, 3}) = A, SS =
{A, R}
• The payoff function vi : Z → R is defined on outcome space, i.e. the set of terminal nodes, for
each player i.
D E
• Γ = N, Y, Z, ≻, Yi , Ui , (Au )u∈Ui , vi i∈N completely describes an extensive form game

• For every strategy profile s = (s1 , . . . , sn ), it must lead to some terminal node, and we call it the
terminal node induced by s, denoted by z = ζ(a), where ζ : S → Z is the outcome function.

– This ζ is determined by N, Y, Z, ≻, Yi , Ui , (Au )u∈Ui i∈N .
• The payoff function in terms of strategy profiles is ui = vi ◦ ζ, i.e.

ui (s) = vi (ζ(s)) , (2)

is the composite of vi and ζ.

– e.g. ζ ((500, RA)) = 6, vB (6) = 100 and vS (6) = 450, so uB (500, RA) = 100 and
uS (500, RA) = 450.
D E
• Thus, a complete description of an extensive form game Γ = N, Y, Z, ≻, Yi , Ui , (Au )u∈Ui , vi i∈N
is a mathematical structure that includes:
– N : a set of players,
– Y : a set of moving nodes,
– Z: a set of terminal nodes,
– ≻: a “comes after” relation,
– (Yi )i∈N : a moving partition of moving nodes,
– Ui : an information partition of player i’s moving nodes Yi for every i ∈ N ,
– Au : a set of actions for player i’s each information set u ∈ Ui for every i ∈ N ,
– vi : a payoff function for each player i ∈ N

2.2 Relation between EFG and SFG

D E
Definition 2.1. A reduced strategic form game G(Γ) of an extensive form game Γ = N, Y, Z, ≻, Yi , Ui , (Au )u∈Ui , vi i∈N
is G(Γ) = N, (Si , ui )i∈N where Si is given in (1) and ui is given by (2).

• The RSFG of bargaining game with perfect information is

3
• The RSFG of bargaining game with imperfect information is

• A strategic form game can always be represented as extensive form game with imperfect infor-
mation.
• For example: the strategic form game

can be represented as an extensive form game, the bargaining game with imperfect information.

• There can be various ways to represent a strategic form game by an extensive form game.

2.3 Equilibrium
2.3.1 Nash equilibrium
Definition 2.2. A Nash Equilibrium of an extensive form game Γ is the Nash equilibrium of its
reduced strategic form game G(Γ).

• How to find NE’s of an extensive form game?

– 1) Write the reduced strategic form game of an extensive form game.

– 2) Find NE’s.
• The bargaining game with perfect information has three NE’s
– (100, AA), (100, AR), and (500, RA).

• The bargaining game with imperfect information has a unique NE (100, A).

2.3.2 Subgame perfect equilibrium

Definition 2.3. A subgame of an extensive form game is a part of the game tree such that:
• it starts at a single node;

• it contains every successor to this node;

• if it contains a node in some information set, then it contains all nodes in that information set.
Subgame that is not the entire game is called a proper subgame.

4
• The bargaining with p.i. has three subgames, two of which are proper subgames:

– The bargaining with i.i. has only one subgame and no proper subgame.

Definition 2.4. A strategy profile s is a subgame perfect equilibrium of an extensive form game Γ if
it is a Nash equilibrium in all subgames of Γ.

Backward induction: a tree-pruning algorithm

• Let g be a subgame of Γ. Denote r(g) the root, i.e. initial node, of g.
• Denote the d(g) = ♯P (r(g)) the depth of subgame g, e.g. d(Γ) = 0, d(bl) = d(or) = 1.
• The backward Induction procedure:
– Step 1: start from the deepest subgames, i.e. subgames of the largest depth, of a game tree,
and find NE’s of those subgames with strategy and payoffs recorded.
– Step 2: Prune a game tree: replace the roots of those deepest subgames by a terminal node,
where the equilibrium payoff is attached, and the resulting game is called a pruned game.
– Step 3: repeat step 1 until the deepest subgame has a depth 0.
Example 2.1. An entry game:
• Coke is about to enter a market where Pepsi is the incumbent.
• Coke decides whether or not enter (E for entry and O for out) the market at first stage; if it
enters, then both simultaneously decide between “act though” (T) and “accommodate” (A):

• The entry game Γ has two subgames: the whole game Γ, and the part in red frame gred .
Theorem 2.1 (Zermelo’s Theorem). Every finite extensive form game of perfect information has a
subgame perfect equilibrium, and, hence, a Nash equilibrium, in pure strategy. And all SPE’s can be
obtained from backward induction procedure.

5
2.4 Randomized strategies
• A mixed strategy σi is simply a probability on Si , i.e. σi ∈ ∆(Si )
– In the entry game, Coke has for pure strategies, {OA, OT, EA, ET }, and it may play the
mixed strategy (0.1, 0.2, 0.3, 0.4).

• A behavioral strategy βi : Ui → ∆(Ai ) is defined to be such that, for every information set u ∈ Ui ,
βi (u) ∈ ∆(Au ) is a probability on Au .
– Mixing when a player makes decision at a specific information set, instead of mixing at the
beginning of the game
– In the entry game, a behavioral strategy {(0.3, 0.7), (3/7, 4/7)}, i.e. At the root, Cook
chooses O w/ prob. 0.3 and E w/ prob. 0.7; If Coke enters the market, it choose A w/
prob. 3/7 and T w/ prob. 4/7
• What is the relationship between mixed and behavioral strategies?
– First, it is always easy to obtain a mixed strategy σi from a behavioral strategy βi , say
Y
σi (si ) = βi (u) (si (u)) (3)
u∈Ui

– Second, let Si (u) = {si ∈ Si |∃s−i , ∃x ∈ ui , x ≺ ζ(si , s−i )} be set of strategies such that
player i could arrive the information set u, and let Si (u, ai ) = {si ∈ S(u)|si (u) = ai }, then
say that βi is consistent with σi if, for any u ∈ Ui and any ai ∈ Au ,

σi (Si (u, ai ))
σi (Si (u)) > 0 =⇒ βi (u)(ai ) = (4)
σi (Si (u))

– Consider the entry game, if a Coke’s behavioral strategy {(0.3, 0.7), (3/7, 4/7)} is given,
it is equivalent to its mixed strategy (9/70, 12/70, 0.3, 0.4). The other way around, if
a Coke’s mixed strategy (0.1, 0.2, 0.3, 0.4) is given, then there is a behavioral strategy
{(0.3, 0.7), (3/7, 4/7)} that induces the same outcome (distribution) as the mixed strategy.
∗ S1 (1) = {OA, OT, EA, ET }, S(1, O) = {OA, OT }, S(1, E) = {EA, ET }, so β1 (1)(O) =
0.3 and β1 (1)(E) = 0.7
∗ S1 (2) = {EA, ET }, S(2, A) = {EA}, S(2, T ) = {ET }, so β1 (2)(A) = 0.3/0.7 = 3/7
and β1 (2)(T ) = 4/7
• The set of mixed strategies is larger than those obtained from behavioral strategies
– (0.1, 0.2, 0.3, 0.4) is mixed strategy but cannot be obtained from any behavioral strategies.
• But they are somehow “observationally equivalent”.

Theorem 2.2. (Kuhn) For any profile of mixed and behavioral strategies, σ and β, the following hold:
(a) For any player i ∈ N , if σi and βi satisfy either (3) and (4), then Pr(z|σi , s−i ) = Pr(z|βi , s−i )
for any terminal node z ∈ Z.
(b) For all players i ∈ N , if σi and βi satisfy either (3) and (4), then Pr(z|σ) = Pr(z|β) for any
terminal node z ∈ Z.
Theorem 2.3. (Kuhn) Every finite extensive form game has a subgame perfect equilibrium in behav-
ioral strategies.

• A SPE in behavioral strategies induces a SPE in mixed strategies.

6
• There is a mixed (behavioral) equilibrium in the entry game: {((1, 0), (2/3, 1/3)) , (1/2, 1/2)}

Theorem 2.4 (One-deviation principle). Let Γ be a finite horizon extensive form game with perfect
information. The strategy profile s∗ is a SPE of Γ if and only if, for every player i ∈ N and for every
subgame g of Γ where player i moves at the initial node of g, there exists no profitable deviation by
player i which differs from s∗i only in the action specified at the initial node of g.
Remark 2.1. The one-deviation principle holds for infinite horizon games if certain regularity conditions
(e.g., continuity at infinity (Fudenberg and Tirole, 1991, p.110), which means that what happens in
the distance future has little impact on the payoff). Such conditions hold in all games with compact
action space and continuous payoff functions.

3 Applications
3.1 Pirate game
There are five rational pirates (in strict order of seniority A, B, C, D and E) who found 100 gold coins.
They must decide how to distribute them.
The pirate world’s rules of distribution say that the most senior pirate first proposes a plan of
distribution. The pirates, including the proposer, then vote on whether to accept this distribution. If
the majority accepts the plan, the coins are dispersed and the game ends. In case of a tie vote, the
proposer has the casting vote. If the majority rejects the plan, the proposer is thrown overboard from
the pirate ship and dies, and the next most senior pirate makes a new proposal to begin the system
again. The process repeats until a plan is accepted or if there is one pirate left.
Pirates base their decisions on three factors. First of all, each pirate wants to survive. Second,
given survival, each pirate wants to maximize the number of gold coins he receives. Third, each pirate
would prefer to throw another overboard, if all other results would otherwise be equal.
Solution:
• The final possible scenario would have all the pirates except D and E thrown overboard. Since
D is senior to E, they have the casting vote; so, D would propose to keep 100 for themself and 0
for E.
• If there are three left (C, D and E), C knows that D will offer E 0 in the next round; therefore,
C has to offer E one coin in this round to win E’s vote. Therefore, when only three are left the
allocation is C:99, D:0, E:1.
• If B, C, D and E remain, B can offer 1 to D; because B has the casting vote, only D’s vote is
required. Thus, B proposes B:99, C:0, D:1, E:0.
• With this knowledge, A can count on C and E’s support for the following allocation, which is
the final solution: A: 98, B:0, C:1, D:0, E:1

3.2 An R&D race

Two firms compete with each other to develop a new technology. Firm i, i ∈ {1, 2}, initially has ki
steps from the finish line. On each of its turns, a firm can either not take any steps (at a cost 0), or
can take one step, at a cost of c(1), or two steps, at a cost of c(2). The fist firm to reach the finish
line wins the competition with a payoff vi > 0 to firm i; the losing firm’s payoff is 0. Moreover, if, on
successive turns, neither firm takes any step, the game ends and both firms’ payoff is 0.
In particular, we consider k1 = k2 = 5, v1 = v2 = 7, c(1) = 1, c(2) = 4, and firm 1 moves first.
Solution:
Let us denote Gi (ki , k−i ) the (sub-)game in which firm i moves first with remaining steps ki and
k−i . Then, it is easy to see that:

7
• In subgames Gi (ki , k−i ) with ki , k−i ≤ 2, the unique SPE outcome is that firm i takes ki steps
to win the competition, and the corresponding payoff to firm i is 7 − c(ki ) and to firm −i is 0.
• In subgames Gi (ki , k−i ) with ki ≤ 2 and k−i > 2, the unique SPE outcome is that firm i takes 1
step in each of its turn to win the competition, and the corresponding payoff to firm i is 7 − ki
and to firm −i is 0.
• In subgames Gi (ki , k−i ) with ki > 2 and k−i ≤ 2, the unique SPE outcome is that firm i takes no
steps and let firm −i to win the competition with 1 step in each of its turn, and the corresponding
payoff to firm i is 0 and to firm −i is 7 − k−i .

Now consider a game G1 (3, 3). Firm 1 can take one step into the game G2 (3, 2), in which firm 1 win,
with cost 1. Firm 1 can also take two steps into the game G2 (3, 1) with cost 4. Clearly, in SPE, firm
1 take one step. Similarly,
• in the unique SPE of the game Gi (3, k−i ) with k−i > 2, firm i take one step and win the
competition with a payoff of 4

• in the unique SPE of the game Gi (4, k−i ) with k−i ∈ {3, 4}, firm i take two steps and win the
competition with a payoff of 1.
• in the unique SPE of the game Gi (4, k−i ) with k−i ≥ 5, firm i take one step and win the
competition with a payoff of 3

Then consider a game G2 (5, 3). Firm 2 can take one step into the game G1 (3, 4) or two steps into
the game G1 (3, 3), where it will lose anyway. Hence, in the SPE, firm 2 takes no step and expect a
payoff of 0, and firm 1 expect a payoff of 4. Similar argument applies to G2 (5, 4), in whose SPE firm
2’s payoff is 0 and firm 1’s is 3.
In the game G1 (5, 5) as what we finally want to solve, firm 1 can take one step into the game
G2 (5, 4) with a cost of 1 and its final payoff is 2.

Exercise 3.1. Solve the SPE of Gi (6, 5) and Gi (6, 6).

3.3 Bargaining
3.3.1 Nash bargaining
Nash (1950) considered a bargaining problem and take a cooperative approach to show that there is a
unique solution that satisfies certain desirable properties. Two people, A and B, are bargaining over
a set of possible outcomes, denoted by S ⊆ R2+ . If the individuals fail to reach an agreement they
both receive outcome zero (0, 0), which is called the disagreement point. Nash looked for solutions that
satisfy the following properties:
Axiom 1 (Pareto efficiency (PAR)). No one can improve upon the solution without making the other
person worse off.
Axiom 2 (Symmetry (SYM)). Both individuals receive the same outcome if the bargaining set is
symmetric.
Axiom 3 (Invariance (INV)). If the bargaining set is contracted or expanded by some factor, the shares
are also contracted or expanded by the same factor.
Axiom 4 (Independence of Irrelevant Alternatives (IIA)). Adding alternatives to the bargaining set
that have not been chosen does not change the solution.
Nash’s theorem states that there exists a unique solution that satisfies these properties and that it
is given by
∗ ∗
(πA , πB ) = arg max πA πB
(πA ,πB )∈S

8
3.3.2 Alternating offers bargaining
The game:
Two players, A and B, bargain over a cake of size 1. At time 0 player A makes an offer xA ∈ [0, 1]
to player B. If player B accepts the offer, agreement is reached and player A receives xA and player
B receives 1 − xA . If player B rejects the offer, she makes a counteroffer xB ∈ [0, 1] at time 1. If this
counteroffer is accepted by A, then B receives xB and A receives 1 − xB . Otherwise, player A makes
another offer at time 2. This process continues indefinitely until a player accepts an offer.
If the players reach an agreement at time t on a partition that gives player i a share xi of the cake,
then player i’s payoff is δit xi , where δit ∈ (0, 1) is player i’s discount factor. If players never reach an
agreement, then each player’s payoff is zero.
The solution:
Suppose that (x∗A , x∗B ) is an equilibrium offer, then it must satisfy the following properties:
1. (No delay) Equilibrium offers are accepted in all the subgames.
2. (Stationary) A player makes the same offer in equilibrium.
Therefore, the current value of B rejecting the offer x∗A is δB x∗B , and, in equilibrium,

1 − x∗A = δB x∗B .

Similarly,
1 − x∗B = δA x∗A .
The unique solution to these equations is
1 − δB
x∗A =
1 − δA δB
1 − δA
x∗B =
1 − δA δB
Thus, the following strategy profile (s∗A , s∗B ) constitutes an SPE:
• Player A always offer x∗A and accepts any xB with 1 − xB ≥ δA x∗A
• Player B always offer x∗B and accepts any xA with 1 − xA ≥ δB x∗B .
Proposition 3.1. One-deviation principle holds for the Rubinstein bargaining game.
Proposition 3.2. (s∗A , s∗B ) is a SPE of the alternating offers bargaining game.
Proof. Consider any period (subgame) when A has to make an offer. Her payoff to s∗A is x∗A . If,
instead, A offer xA < x∗A , then B accepts the offer and A obtain a payoff xA < x∗A to this deviation. If
she offer xA > x∗A , B will reject the offer and offer 1 − x∗B = δA x∗A . In this case, A will accept the offer
2 ∗
and obtain a payoff δA xA < x∗A . Hence, we conclude that there is no profitable one-shot deviation. By
a symmetric argument, there is no profitable one-shot deviation for player B either.
The SPE has the following property:
• Unique (try to prove it)
• Efficient (Pareto optimal)
1−δB
• In the unique SPE, the equilibrium payoff of player A is πA = x∗A = 1−δA δB and that of player B
δB (1−δA )
is πB = 1 − x∗A
= 1−δA δB .The
share of each player is decreasing in her discount factor. Suppose
that δA = δB = δ → 1, the Rubinstein bargaining outcome πA = πB = 1/2, which coincide with
Nash bargaining solution with S = {(πA , πB ) ∈ R2+ : πA + πB ≤ 1}.

9
4 Perfect Bayesian equilibrium
To study extensive form games with incomplete information, we can consider an EFG with a dummy
player, Nature. Its choice is random and may or may not be observable. It is indifferent between
outcomes, i.e. payoffs are the same in the different final nodes. An extensive form game with incomplete
information can be considered as an EFG with Nature and Nature’s actions are unobservable. It
is possible to define subgame perfect equilibrium or Bayesian-Nash equilibrium, but they are not
adequate.
The following example is an EFG with complete information, however, it illustrates the idea that
sometimes SPE’s may be unreasonable.
Example 4.1. Consider the following game:

The strategic form of this game is as follows:

There are two SPEs: (T, L) and (O, R). However, (O, R) is unreasonable: if player 2 know that she in
the information set I, she should never choose R since it is dominated by L.
Thus, we further require that players are rational in every continuation game. A continuation game
may start from an information set. In the above example, the information set I and the nodes follow
from that information set. In general, analyzing players’ decision at an information set requires him
forming beliefs regarding which decision node he is at.
To specify a perfect Bayesian equilibrium, we consider an assessment (µ, β) consisting of
• a belief system µ : X → [0, 1] that assigns a probability of each Pnode in each information set,
satisfying that, in every information set u ∈ Ui for some player i, x∈u µ(x) = 1; and

• a profile of behavioral strategies β.

Definition 4.1. A perfect Bayesian equilibrium in an extensive form game Γ is an assessment (µ, β)
such that
1. (Weak consistency) given strategy profile β, beliefs are determined by Bayes rule whenever pos-
sible;

10
2. (Sequential rationality) given belief µ and subsequent strategies, the action chosen at each infor-
mation set is optimal.
In the above example, let µ ∈ [0, 1] be the probability assigned to the node follow T . A PBE of this
game is (µ = 1, (T, L)). That is, player 2 believes that the probability of the node following T equal
to 1 if he is at the information set I, and player 1 always chooses T , and player 2 always chooses L.
Given the belief µ = 1 (in fact, for any belief µ ∈ [0, 1]), choosing L is optimal for player 2 in the
continuation game. Given that β1 (T ) = 1 and β1 (B) = β1 (O) = 0, by Bayes rule,

β1 (T )
= 1 = µ.
β1 (T ) + β1 (B)

5 Signaling games
5.1 The example

• The game starts with a “decision” by Nature, which determines whether player 1 is of type I or
II with a probability p = Pr(I) = 1/2 (in this example)
• Player 1 decides whether to play L or R conditioning on his type. Hence, his strategy is defined
a mapping from types to actions:

σ1 : I 7→ aL + (1 − a)R and II 7→ bL + (1 − b)R

• After player 1 moves, player 2 is able to see the action taken by player 1 but not the type of
player 1. Hence, her decision is conditional on the action observed. Hence, her strategy is defined
a mapping from player 1’s actions to her own actions:

σ1 : L 7→ xU + (1 − x)D and R 7→ yU + (1 − y)D

• There is no subgames but continuation games. Player 2 forms a belief about the type of player
1 given the action observed.
– In the example, her belief are specified by r = Pr(I|L) and q = Pr(I|R)
– In equilibrium, we require beliefs are consistent meaning that it satisfies Bayes rule given
strategies. In the example, let a = Pr(L|I) and b = Pr(L|II). By Bayes rule, if ap+b(1−p) >
0,
Pr(L|I) Pr(I) ap
r = Pr(I|L) = = (5)
Pr(L|I) Pr(I) + Pr(L|II) Pr(II) ap + b(1 − p)

11
and, if (1 − a)p + (1 − b)(1 − p) > 0,

Pr(R|I) Pr(I) (1 − a)p

q = Pr(I|R) = = (6)
Pr(L|R) Pr(R) + Pr(R|II) Pr(II) (1 − a)p + (1 − b)(1 − p)

– In the case that ap + b(1 − p) = 0 ((1 − a)p + (1 − b)(1 − p) = 0), r (q) can be anything in
[0, 1].

5.2 Finding equilibria

We first solve pure strategy equilibria (separating and pooling) and then mixed strategy equilibria
presented above. A PBE consists of strategy profile (σ1 , σ2 ) and belief (r, q).

5.2.1 Separating and pooling equilibria

In separating equilibria, player 1 of different types chooses different actions. To solve an equilibria,
there are four steps.
Step1: consider a candidate strategy of player 1,

σ1 : I 7→ L; II 7→ R

Step 2: compute the belief of player 2 with the candidate strategy specified in step 1. In this case,
simply, r = 1 and q = 0.
Step 3: find player 2’s best response given her beliefs. In this case,

σ2 : L 7→ U ; R 7→ U.

Step 4: given player 2’s strategy in step 3, verify that the candidate strategy in step 1 is optimal
for player 1. In practice, we verify whether player 1 has incentive to deviate. In this case, if player 1
of type I deviates from L to R, his payoff changes from 4 to 0, so the strategy is optimal for type I. If
type II deviates (from U to L), his payoff, again, changes from 4 to 0; so the strategy is also optimal
for type II. Then, we can conclude that there is a separating equilibrium that consists of strategy
profile (LR, U U ) and beliefs (1, 0).
We can also check whether there is another separating equilibrium with player plays

σ1 : I 7→ R; II 7→ L.

Then, player 2’s beliefs are simply r = 0 and q = 1, and her best response, given these beliefs, is

σ2 : L 7→ D; R 7→ D.

Finally, it is easy to check that player 1 has no incentive to deviate given player 2’s strategy. Hence,
the assessment (RL, DD) and (0, 1) is another separating equilibrium.
In pooling equilibria, player 1 of different types chooses the same action. We follow the same steps
as before.
Step1: consider a candidate strategy of player 1,

σ1 : I 7→ L; II 7→ L.

Step 2: Different from the case of separating equilibria, where we compute both r and q exactly,
we can only know the exact value of r, which is 1/2, but q can be any value in [0, 1].
Step 3: We know that player 2’s best response is to choose D if she observes action L given her
belief r = 1/2. When player 2 observes action R, her choice depends on her belief q. So we can consider
two cases: (case 1) if q ≤ 1/3, the player 2’s best response should be U ; otherwise, she should choose
D.

12
Step 4: Now we need to verify that, in two assessments ((LL, DU ), (1/2, q)) with q ≤ 1/3 and
((LL, DD), (1/2, q)) with q > 1/3, player 1’s strategy is optimal.
(4.1) In case 1, if player 1 of type I deviates from L to R, his payoff changes from 0 to 0, so the
strategy is optimal for type I; if type II deviates, his payoff changes from 8 to 4, so the strategy is
optimal for type II. Hence, ((LL, DU ), (1/2, q)) with q ≤ 1/3 is a PBE.
(4.2) In case 2, if type I deviates, his payoff increases from 0 to 8. Thus, he has incentives to
deviate and this case cannot be an equilibrium.
We can also check whether there is another pooling equilibrium with player 1’s strategy being
σ1 : I 7→ R; II 7→ R.
Following similar steps, the assessment ((RR, U D), (r, 1/2)) with r ≥ 2/3 is another pooling equilib-
rium.

5.3 Mixed strategy equilibria

Before we only consider the possibility that players play pure strategies. Here we consider the possibility
of mixed strategy in equilibrium.
Step 1: we consider a hybrid equilibrium where one of the types plays pure strategy and the other
randomizes. Consider that player 1 plays a strategy of the following form
σ1 : I 7→ L; II 7→ bL + (1 − b)R.
Step 2: we compute player 2’s belief. Since only type II could choose R, it is immediate that
q = 0. All type I and a fraction b of type II choose L, therefore the probability of L being chosen is
p + b(1 − p), among which fraction p is from type I. Hence, knowing that p = 1/2,
p 1
r= = .
p + b(1 − p) 1+b
A consequence of player 2’s belief is that, given q = 0, it is obvious to see that player 2’s best response
to R is U .
Step 3: To obtain player 2’s action when she observes L, i.e., L 7→ xU + (1 − x)D, we use that
fact the type II randomizes if and only if two actions yields the same expected payoff to him. If type
II chooses L, his payoff is 8(1 − x); if he chooses R, his payoff is 4 given that player 2 will choose U .
Solving 8(1 − x) = 4 yields that x = 1/2.
Step 4: from step 3, we know that player 2 randomizes if observing L, meaning that player 2 are
indifferent between actions at the information set following L. Given player 2’s belief, choosing U
yields 4r and choosing D 8(1 − r). Solving 4r = 8(1 − r) yields r = 2/3. Using the the result from
step 2, we can solve that b = 1/2.
Step 5: we verify that type I’s action is optimal as before. Given player 2’s strategy, if type I
deviates, his payoff changes from 2 to 0.
To conclude, (L. 21 L + 12 R, 12 U + 12 D.U ), (2/3, 0) is a hybrid

equilibrium. Without checking, an-
other hybrid equilibrium is ( 21 L + 12 R.R, U. 12 U + 12 D), (1, 1/3) .
Finally, we solve the fully mixed strategy equilibrium.
Given a general player 2’s strategy that characterized by two unknowns (x and y), that player 1
mixes meaning that he is indifferent between actions. For type I, it holds that 4x = 8(1 − y), and, for
type II, 8(1 − x) = 4y. Solving them yields that x = y = 2/3.
Thus, player 2 also mixes, which means that she is indifferent between action in any information
set. In the information set following L, 4r = 8(1 − r), so r = 2/3. Following R, 4(1 − q) = 8q, so
q = 1/3.
Using Bayes rule (or simply, equations (5) and (6)),
a 1−a
r= and q = .
a+b 2−a−b
Solving r = 2/3 and q = 1/3 yields that a = 2/3 and b = 1/3.

13
6 Repeated games
6.1 Preliminaries
• Let G = ⟨N, (Ai , ui )i∈N ⟩ be an n-player strategic form game, and call it a stage game.
– For example, G can be the following prisoner’s dilemma (PD):

• An infinitely repeated game Gδ is defined as follow:

– The stage game G is played at each discrete time period t = 1, 2, . . .
– At the end of each period, actions chosen by each player is revealed to everybody.
– Let at ∈ A be the action profile chosen by players at period t, the payoff of player i is given
by
∞
X
(1 − δ) δ t ui (at )
t=0

where δ ∈ (0, 1) is the discounting factor of the repeated game Gδ .

• A history, denoted by ht , at period t is a sequence of action profiles from period 1 through period
t − 1, in particular,

h1 = ∅
ht = (a1 , . . . , at−1 ), for t = 2, 3, . . .

– e.g. a possible fifth period history in PD is ((C, C), (C, C), (C, D), (D, D))
– The set of period t histories is given by

H 0 = {∅} and H t = At−1 , for t = 2, 3, . . .

• A pure strategy is a sequence of functions that assign an action in Ai to every history ht .

– A pure strategy for player i is given by

si = s1i , s2i , . . . , sti , . . .

where s1i ∈ Ai and sti : H t → Ai , and the set of strategies is denoted by Si for player i.
• For example, a grim trigger strategy in PD is as follow: for player i = 1, 2,

s1i = C
(
t t C aτ−i = C, ∀τ ≤ t − 1
si (h ) =
D otherwise

– start with playing C and switch to D if the opponent has played D in the past
– defection is triggered by the opponent’s defection
– grim as punishment lasts forever

P∞ then equilibrium outcome would be (C, C)

– If both players adopt the grim trigger strategy,
at every period and hence Ui (s) = (1 − δ) t=1 δ t−1 ui (C, C) = 2

14
• The continuation payoff, for some strategy profile s, after some history ht is
∞
X
Ui (s|ht ) = (1 − δ) δ τ −t ui (aτ )
τ =t

t t t τ τ τ −1 τ −1
where a = s (h ) and a = s (h ,a ) for all τ ≥ t + 1.

6.2 Equilibria
Definition 6.1. The strategy profile s is a Nash equilibrium of the repeated game Gδ if for all i ∈ N
Ui (si , s−i ) ≥ Ui (s′i , s−i ), ∀s′i ∈ Si .
We may want to refine Nash equilibrium to subgame perfect equilibrium in dynamic games.
Definition 6.2. The strategy profile s is a subgame perfect equilibrium of the repeated game Gδ if for
all i ∈ N and all ht ∈ H t
Ui (si , s−i |ht ) ≥ Ui (s′i , s−i |ht ), ∀s′i ∈ Si .
Claim 6.1. The grim trigger strategy profile is a Nash equilibrium of the repeated PD if δ ≥ 1/2.
• Suppose player 2 plays grim trigger strategy and check play 1 has no incentive to deviate.
• There are two classes of possible deviations:
– responding C every period, and then the expected payoff is 2
– responding D at some period, and show that such a deviation is not profitable.
• Let T + 1, T = 0, 1, . . ., be the first period that player 1 defects. Then, the best (in what sense?)
deviation must generates the sequence of action profile (why?)
(C, C) . . . (C, C), (D, C), (D, D), (D, D), . . .
| {z }
T times

– The corresponding expected payoff of this deviation is

∞
" T #
X X
t−1 T t−1
(1 − δ) δ 2+δ 3+ δ = 2 + δ T (1 − 2δ)
t=1 t=T +2

– If δ ≥ 1/2, the expected payoff of such a deviation is no greater than 2.

• However, the above grim strategy profile is not technically a SPE.

– Suppose that h2 = (C, D), which is off equilibrium path.
– If player 2 stick to grim trigger strategy, she get payoff δ and the following sequence of
action profiles realizes after h2
(D, C), (D, D), (D, D), . . .

– But, consider a shot-deviation: s22 ((C, D)) = D (recall that, in the grim trigger strategy
s22 ((C, D)) = C). Then, she will get payoff 1 and the following sequence of action profiles
realizes after h2
(D, D), (D, D), (D, D), . . .
• Modified grim trigger strategy profile is a SPE, which can be verified by one-deviation principle.
(
1 t t C ht = ((C, C), . . . , (C, C))
si = C and si (h ) =
D otherwise

15
6.3 Equilibrium payoffs
Definition 6.3. The set of feasible payoff profiles of a strategic game is the set of all weighted averages
of payoff profiles in the game.
• What are the possible avergae discounted payoff pairs in a NE beside (1, 1) and (2, 2)?
• Consider the outcome path (b1 , b2 , . . .) that consists of repetitions of the sequence (a1 , . . . , ak )
– Let the average payoff of the sequence (a1 , . . . , ak ) be x.
• Consider the following strategy profile, for i = 1, 2,
(
1 1 t t bti hr−i = br−i , ∀r = 1, . . . , t − 1
si = bi and si (h ) =
D otherwise
– If xi < 1 for some i, then player i has incentive to deviate by playing D.
– If xi > 1 for all i, then the strategy profile is a NE when δ is close to 1.

• Therefore, we have the following conclusions in IRPD:

– for any δ ∈ (0, 1), each player’s payoff in every discounted average payoff pair generated by
a NE is at least 1
– for every feasible pair (x1 , x2 ) of payoffs for which xi > 1, i = 1, 2, there is a pair (y1 , y2 )
close to (x1 , x2 ) such that for a discounted factor close enough to 1 and there is a NE in
which the pair of discounted average payoffs is (y1 , y2 ) .
• The (approximate) set of Nash equilibrium payoff pairs in IRPD:

• The SPE payoff is the same as NE payoffs:

– The argument is the same as NE payoff
– Consider the outcome path (b1 , b2 , . . .) that consists of repetitions of the sequence (a1 , . . . , ak )
and the profile of the following strategy
(
1 1 t t bti hr = br , ∀r = 1, . . . , t − 1
si = bi and si (h ) =
D otherwise
is a SPE.

16
6.4 General repeated games and folk theorems
• The basic idea behind IRPD is extended to general infinitely repeated games:
– If players cooperate, everyone gets a payoff higher than some “minimum” payoff
– A deviation triggers each player to begin an indefinite “punishment” of the deviant

• How can we find this minimum payoff in an arbitrary strategic game?

– The deviant seeks to maximize her payoff maxai ∈Ai ui (ai , a−i ).
– To punish player i, other players play a−i to minimize player i’s highest payoff, i.e. seek a
solution to
min max ui (ai , a−i ) ,
a−i ∈A−i ai ∈Ai

which is known as the minmax payoff of player i.

Theorem 6.1 (Nash folk theorem). Let G be a strategic form game and Gδ be an infinitely repeated
games with discount factor δ.

• For any discount factor δ the discounted average payoff of every player in any Nash equilibrium
of Gδ is at least her minmax payoff in G.
• Let w be a feasible payoff profile of G for which each player’s payoff exceeds her minmax payoff.
Then for all ϵ > 0 there exists δ̄ < 1 such that if the discount factor exceeds δ̄ then Gδ has a
Nash equilibrium whose discounted average payoff profile w′ satisfies |w − w′ | < ϵ.

Theorem 6.2 (Subgame perfect folk theorem for two-player games). Let G be a two-player strategic
form game and Gδ be an infinitely repeated games with discount factor δ.
• For any discount factor δ the discounted average payoff of every player in any subgame perfect
equilibrium of Gδ is at least her minmax payoff in G.

• Let w be a feasible payoff profile of G for which each player’s payoff exceeds her minmax payoff.
Then for all ϵ > 0 there exists δ̄ < 1 such that if the discount factor exceeds δ̄ then Gδ has a
subgame perfect equilibrium whose discounted average payoff profile w′ satisfies |w − w′ | < ϵ.

[FREE PDF sample] Student Solutions Manual Electrochemical Methods 2nd Edition Allen J. Bard ebooks
100% (5)
[FREE PDF sample] Student Solutions Manual Electrochemical Methods 2nd Edition Allen J. Bard ebooks
81 pages
Topic 2 Lecture Notes
No ratings yet
Topic 2 Lecture Notes
22 pages
ECON0027 Section 2
No ratings yet
ECON0027 Section 2
13 pages
Extensive Form Games: ECON2112
No ratings yet
Extensive Form Games: ECON2112
38 pages
Extensive Form Games: Sanjay Singh
No ratings yet
Extensive Form Games: Sanjay Singh
11 pages
4 Games in Extensive Form, Backward Induction and Subgame Perfection
No ratings yet
4 Games in Extensive Form, Backward Induction and Subgame Perfection
21 pages
Game Theory Extended Examples
No ratings yet
Game Theory Extended Examples
52 pages
Extensive Forms
No ratings yet
Extensive Forms
20 pages
2.1 Dynamic Games Introp
No ratings yet
2.1 Dynamic Games Introp
23 pages
Lec 3 - Game Theory and Economics
No ratings yet
Lec 3 - Game Theory and Economics
19 pages
lecture-3-SPE
No ratings yet
lecture-3-SPE
127 pages
SPNE
No ratings yet
SPNE
30 pages
2a Lecture ExtGames BI&SPE
No ratings yet
2a Lecture ExtGames BI&SPE
57 pages
Dynamic Games of Complete Information
No ratings yet
Dynamic Games of Complete Information
13 pages
2.2 Dynamic Imperfect Info
No ratings yet
2.2 Dynamic Imperfect Info
13 pages
Dynamic Games
No ratings yet
Dynamic Games
45 pages
03 Sequential Moves
No ratings yet
03 Sequential Moves
10 pages
GT Game 6 17
No ratings yet
GT Game 6 17
33 pages
Hold Up Game and Many More Extensive Form
No ratings yet
Hold Up Game and Many More Extensive Form
46 pages
EC220 - Slides 10
No ratings yet
EC220 - Slides 10
28 pages
Extensive Form Games
No ratings yet
Extensive Form Games
22 pages
Game Theory 3
No ratings yet
Game Theory 3
45 pages
Game Theory Lecture Notes Lectures 3-6
No ratings yet
Game Theory Lecture Notes Lectures 3-6
18 pages
Lecture 17
No ratings yet
Lecture 17
75 pages
Perfect Information Games 2015
No ratings yet
Perfect Information Games 2015
30 pages
Lecture 2-Game Theory
No ratings yet
Lecture 2-Game Theory
62 pages
Perfect-Information Games+exercises
No ratings yet
Perfect-Information Games+exercises
41 pages
Midterm Review 2: Introduction To Game Theory
No ratings yet
Midterm Review 2: Introduction To Game Theory
24 pages
Game Theory Slides Chapter 6-3
No ratings yet
Game Theory Slides Chapter 6-3
82 pages
EC941 - Game Theory: Prof. Francesco Squintani Email: F.squintani@warwick - Ac.uk
No ratings yet
EC941 - Game Theory: Prof. Francesco Squintani Email: F.squintani@warwick - Ac.uk
39 pages
MiA T3 DynamicGames
No ratings yet
MiA T3 DynamicGames
48 pages
Lec Slides
No ratings yet
Lec Slides
69 pages
Chapter A
No ratings yet
Chapter A
39 pages
ExtensiveFormGames_W
No ratings yet
ExtensiveFormGames_W
34 pages
3b Extensive-Form Games
100% (1)
3b Extensive-Form Games
17 pages
1 Extensive Games With Imperfect Information
No ratings yet
1 Extensive Games With Imperfect Information
17 pages
Ec2010a GT Sectionnotes All 2021
No ratings yet
Ec2010a GT Sectionnotes All 2021
65 pages
Game1a Normal Form Game-5
No ratings yet
Game1a Normal Form Game-5
5 pages
Extensive Games With Perfect Information1
No ratings yet
Extensive Games With Perfect Information1
31 pages
Topic 3 With Handwritten Notes
No ratings yet
Topic 3 With Handwritten Notes
17 pages
Extensive Form Games: Perfect Information: ECON2112
No ratings yet
Extensive Form Games: Perfect Information: ECON2112
69 pages
Game Theory Notes PP
No ratings yet
Game Theory Notes PP
36 pages
14.12 Game Theory Lecture Notes Lectures 7-9: 1 Backwards Induction
No ratings yet
14.12 Game Theory Lecture Notes Lectures 7-9: 1 Backwards Induction
12 pages
Sequential Games of Perfect Information UPF
No ratings yet
Sequential Games of Perfect Information UPF
58 pages
Microeconomics II Lecture 2: Game Theory: Mohammad Vesal
No ratings yet
Microeconomics II Lecture 2: Game Theory: Mohammad Vesal
62 pages
Lecture 8
No ratings yet
Lecture 8
44 pages
Game Theory Slides Chapter 1-2
No ratings yet
Game Theory Slides Chapter 1-2
20 pages
Introductory Microeconomics Economics 10004: Lecture 20: Semester 1, 2021
No ratings yet
Introductory Microeconomics Economics 10004: Lecture 20: Semester 1, 2021
30 pages
Lecture 3-Dynamic Games
No ratings yet
Lecture 3-Dynamic Games
25 pages
Systems Resource Management
No ratings yet
Systems Resource Management
48 pages
An Introduction To Dynamic Games
No ratings yet
An Introduction To Dynamic Games
9 pages
M4
No ratings yet
M4
3 pages
Examen Enero 2016 Soluciones
No ratings yet
Examen Enero 2016 Soluciones
5 pages
Lecture 1
No ratings yet
Lecture 1
24 pages
Basic Assumptions of The Game Theory: I N N I N
No ratings yet
Basic Assumptions of The Game Theory: I N N I N
6 pages
extensive games
No ratings yet
extensive games
36 pages
L3
No ratings yet
L3
53 pages
GameTheory Lecture 05 Extensive Form Games
No ratings yet
GameTheory Lecture 05 Extensive Form Games
26 pages
Basic Elements of Game Theory
No ratings yet
Basic Elements of Game Theory
28 pages
Topic 3
No ratings yet
Topic 3
17 pages
Differential Games
From Everand
Differential Games
Avner Friedman
No ratings yet
Parallel and Perpendicular Lines
No ratings yet
Parallel and Perpendicular Lines
11 pages
New Idea&Aes
No ratings yet
New Idea&Aes
222 pages
Mech Nonlin Mats 14.5 L01 Adv Plasticity
No ratings yet
Mech Nonlin Mats 14.5 L01 Adv Plasticity
28 pages
Practical Exercise 2
No ratings yet
Practical Exercise 2
2 pages
Pic 6-1-17 Co Pos Mapped
No ratings yet
Pic 6-1-17 Co Pos Mapped
6 pages
Problem Set 3a
No ratings yet
Problem Set 3a
2 pages
Program 1. WAP To Implement Bit Stuffing (Sender End)
No ratings yet
Program 1. WAP To Implement Bit Stuffing (Sender End)
16 pages
Building Verification Environment
No ratings yet
Building Verification Environment
23 pages
Integration Study Guide
No ratings yet
Integration Study Guide
11 pages
Detailed Lesson Plan in Math4 Co2
No ratings yet
Detailed Lesson Plan in Math4 Co2
7 pages
Worksheet Normal Distributions
100% (1)
Worksheet Normal Distributions
3 pages
Syllabus: 1. Polynomials A 2. Polynomials B
No ratings yet
Syllabus: 1. Polynomials A 2. Polynomials B
4 pages
In The History of Mathematical Science
No ratings yet
In The History of Mathematical Science
4 pages
What's The Best Monopoly Strategy
No ratings yet
What's The Best Monopoly Strategy
1 page
Average & Ages1
No ratings yet
Average & Ages1
125 pages
19 20Champs4Tests
No ratings yet
19 20Champs4Tests
40 pages
MSW USltr Format
No ratings yet
MSW USltr Format
4 pages
2nd Preboard Math Q
No ratings yet
2nd Preboard Math Q
5 pages
Crit C - Photosynthesis (1)
No ratings yet
Crit C - Photosynthesis (1)
3 pages
Wave Loading
No ratings yet
Wave Loading
10 pages
Create A Huffman Code Dictionary in MATLAB
No ratings yet
Create A Huffman Code Dictionary in MATLAB
10 pages
CVNG 3009 Exam Solutions 2014-2015
No ratings yet
CVNG 3009 Exam Solutions 2014-2015
9 pages
Memes As Speech Acts
100% (1)
Memes As Speech Acts
23 pages
CIE Review For : Probability and Statistics
No ratings yet
CIE Review For : Probability and Statistics
6 pages
Thermodynamics: Physics SF 016
No ratings yet
Thermodynamics: Physics SF 016
39 pages
Maths Curriculum Yr 8
No ratings yet
Maths Curriculum Yr 8
1 page
"Full Coverage": Histograms: (Edexcel IGCSE Nov-2010-4H Q17b Edited)
No ratings yet
"Full Coverage": Histograms: (Edexcel IGCSE Nov-2010-4H Q17b Edited)
15 pages
Quant
No ratings yet
Quant
100 pages
Mensuration Maths Formula in Hindi 42somp
100% (1)
Mensuration Maths Formula in Hindi 42somp
21 pages