Extensive Form Games: Backward Induction and Imperfect Information Games
Extensive Form Games: Backward Induction and Imperfect Information Games
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 1
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Lecture Overview
1 Recap
2 Backward Induction
4 Perfect Recall
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 2
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
p(a) ≥ 0 ∀a ∈ A
X
p(a) = 1
a∈A
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 3
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
p(a) ≥ 0 ∀a ∈ A
X
p(a) = 1
a∈A
Answer: yes, it was wrong. The version above fixes the problem,
changing the second sum so that it’s identical to the first.
Note that the constraint can equivalently be written as
X
[ui (a) − ui (a0i , a−i )]p(a) ≥ 0.
a∈A|ai ∈a
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 3
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Introduction
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 4
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Pure Strategies
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 5
q Aq q Aq q Aq
A
Recap
Backward A
Induction A
Imperfect-Information Extensive-Form Games Perfect Recall
o define a complete strategy for this game, each of the players must choose
each of his two choice nodes. Thus we can enumerate the pure strategies
rs as follows.
A, G), (A, H), (B, G), (B, H)}
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 6
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Subgame Perfection
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 7
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Lecture Overview
1 Recap
2 Backward Induction
4 Perfect Recall
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 8
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Centipede Game
1q A 2q A 1q A 2q A 1q A (3,5)
D D D D D
the procedure
The algorithm BACKWARD doesn’t return
I NDUCTION an equilibrium
is described strategy,
in Figure 5.6. but rather
The variable
labels
util at child is aeach
vector node
denotingwith a vector
the utility ofplayer
for each realatnumbers.
the child node; util at childρ(h)
denotes the element of this vector corresponding to the utility for player ρ(h) (the
This labeling can be seen as an extension of the game’s utility
player who gets to move at node h). Similarly best util is a vector giving utilities for
each player. function to the non-terminal nodes
Observe that The equilibrium
this procedure strategies:
does not take thestrategy
return an equilibrium best action
for each at each node.
of the
n players, but rather describes how to label each node with a vector of n real numbers.
This labeling can be seen as an extension of the game’s utility function to CPSC
Extensive Form Games: Backward Induction and Imperfect Information Games
the non-
532A Lecture 10, Slide 10
good news: not only are we guaranteed to find a subgame-perfect equilibrium (rather
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
than possibly finding a Nash equilibrium that involves non-credible threats) but also
this procedure is computationally simple. In particular, it can be implemented as a
Computing Subgame Perfect Equilibria
single depth-first traversal of the game tree, and thus requires time linear in the size
of the game representation. Recall in contrast that the best known methods for finding
Nash equilibria of general games require time exponential in the size of the normal
Idea:
form; Identify
remember thethatequilibria
as well the induced in theform
normal bottom-most trees,game
of an extensive-form and
is adopt
exponentially
these as larger
one than
movesthe original
up the representation.
tree
function BACKWARD I NDUCTION (node h) returns u(h)
if h ∈ Z then
return u(h) // h is a terminal node
best util ← −∞
forall a ∈ χ(h) do
util at child ←BACKWARD I NDUCTION(σ(h, a))
if util at childρ(h) > best utilρ(h) then
best util ← util at child
return best util
Figure 5.6: Procedure for finding the value of a sample (subgame-perfect) Nash equi-
librium of a perfect-information extensive-form game.
For zero-sum games, BackwardInduction has another name:
the minimax algorithm.
The algorithm BACKWARD I NDUCTION is described in Figure 5.6. The variable
util at child isHere
a vectorit’s
denoting
enoughthe utility for each
to store oneplayer at the child
number pernode; util at childρ(h)
node.
denotes the element of this vector corresponding to the utility for player ρ(h) (the
It’s possible to speed things up by pruning nodes
player who gets to move at node h). Similarly best util is a vector giving utilities for
that will
each player. never be reached in play: “alpha-beta pruning”.
Observe that this procedure does not return an equilibrium strategy for each of the
n players, but rather describes how to label each node with a vector of n real numbers.
Extensive
ThisForm Games:
labeling canBackward
be seenInduction and Imperfect
as an extension Information
of the game’s Games
utility function to CPSC 532A Lecture 10, Slide 10
the non-
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Backward Induction
118 5 Reasoning and Computing with the Extensive Form
1q A 2q A 1q A 2q A 1q A (3,5)
D D D D D
Lecture Overview
1 Recap
2 Backward Induction
4 Perfect Recall
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 12
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Intro
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 13
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Formal definition
Definition
An imperfect-information game (in extensive form) is a tuple
(N, A, H, Z, χ, ρ, σ, u, I), where
(N, A, H, Z, χ, ρ, σ, u) is a perfect-information extensive-form
game, and
I = (I1 , . . . , In ), where Ii = (Ii,1 , . . . , Ii,ki ) is an equivalence
relation on (that is, a partition of) {h ∈ H : ρ(h) = i} with
the property that χ(h) = χ(h0 ) and ρ(h) = ρ(h0 ) whenever
there exists a j for which h ∈ Ii,j and h0 ∈ Ii,j .
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 14
playerRecap
would beBackward
able toInduction
distinguish Imperfect-Information
the nodes). Thus, if I ∈ Games
Extensive-Form Ii is an equivalence
Perfect Recall clas
we can unambiguously use the notation χ(I) to denote the set of actions available
Example
player i at any node in information set I.
q1
"b
L" b R "
b
2 q" bq 2
" b
"b
A" " b (1,1)
bB
q" bq
" b
1
%e %e
ℓ % er ℓ % er
q
% eq q
% eq
(0,0) (2,4) (2,4) (0,0)
we can unambiguously use the notation χ(I) to denote the set of actions available
Example
player i at any node in information set I.
q1
"b
L" b R "
b
2 q" bq 2
" b
"b
A" " b (1,1)
bB
q" bq
" b
1
%e %e
ℓ % er ℓ % er
q
% eq q
% eq
(0,0) (2,4) (2,4) (0,0)
Normal-form games
5 Reasoning and Computing with the Extensive
q eq q eq
% e % e
% %
(-1,-1) (-4,0) (0,-4) (-3,-3)
Figure
Note5.11 The
that it Prisoner’s
would also beDilemma
the samegame
if we in
putextensive
player 2 form.
at the
root node.
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 17
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Randomized Strategies
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 18
Figure 5.1 The Sharing game.
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Notice that the definition contains a subtlety. An agent’s strategy requires a decision
Randomizedatthestrategies example
each choice node, regardless of whether or not it is possible to reach that node given
other choice nodes. In the Sharing game above the situation is straightforward—
player 1 has three pure strategies, and player 2 has eight (why?). But now consider the
game shown in Figure 5.2.
1
A B
2 2
C D E F
1
(3,8) (8,3) (5,5)
G H
(2,10) (1,0)
In order to define a complete strategy for this game, each of the players must choose
Give anan action
example oftwoachoice
at each of his behavioral
nodes. Thus westrategy:
can enumerate the pure strategies
of the players as follows.
S1 = {(A, G), (A, H), (B, G), (B, H)}
S2 = {(C, E), (C, F ), (D, E), (D, F )}
It is important to note that we have to include the strategies (A, G) and (A, H), even
though once A is chosen the G-versus-H choice is moot.
The definition of best response and Nash equilibria in this game are exactly as they
are in for normal form games. Indeed, this example illustrates how every perfect-
information game can be converted to an equivalent normal form game. For example,
the perfect-information game of Figure 5.2 can be converted into the normal form im-
age of the game, shown in Figure 5.3. Clearly, the strategy spaces of the two games are
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 19
Figure 5.1 The Sharing game.
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Notice that the definition contains a subtlety. An agent’s strategy requires a decision
Randomizedatthestrategies example
each choice node, regardless of whether or not it is possible to reach that node given
other choice nodes. In the Sharing game above the situation is straightforward—
player 1 has three pure strategies, and player 2 has eight (why?). But now consider the
game shown in Figure 5.2.
1
A B
2 2
C D E F
1
(3,8) (8,3) (5,5)
G H
(2,10) (1,0)
In order to define a complete strategy for this game, each of the players must choose
Give anan action
example oftwoachoice
at each of his behavioral
nodes. Thus westrategy:
can enumerate the pure strategies
of the players as follows.
A with probability .5 and G with probability .3
S1 = {(A, G), (A, H), (B, G), (B, H)}
Give an Sexample
= {(C, E), (C,of
2 a mixed
F ), (D, E), (D, F )} strategy that is not a behavioral
It is important to note that we have to include the strategies (A, G) and (A, H), even
strategy:
though once A is chosen the G-versus-H choice is moot.
The definition of best response and Nash equilibria in this game are exactly as they
are in for normal form games. Indeed, this example illustrates how every perfect-
information game can be converted to an equivalent normal form game. For example,
the perfect-information game of Figure 5.2 can be converted into the normal form im-
age of the game, shown in Figure 5.3. Clearly, the strategy spaces of the two games are
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 19
Figure 5.1 The Sharing game.
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Notice that the definition contains a subtlety. An agent’s strategy requires a decision
Randomizedatthestrategies example
each choice node, regardless of whether or not it is possible to reach that node given
other choice nodes. In the Sharing game above the situation is straightforward—
player 1 has three pure strategies, and player 2 has eight (why?). But now consider the
game shown in Figure 5.2.
1
A B
2 2
C D E F
1
(3,8) (8,3) (5,5)
G H
(2,10) (1,0)
In order to define a complete strategy for this game, each of the players must choose
Give anan action
example oftwoachoice
at each of his behavioral
nodes. Thus westrategy:
can enumerate the pure strategies
of the players as follows.
A with probability .5 and G with probability .3
S1 = {(A, G), (A, H), (B, G), (B, H)}
Give an Sexample
= {(C, E), (C,of
2 a mixed
F ), (D, E), (D, F )} strategy that is not a behavioral
It is important to note that we have to include the strategies (A, G) and (A, H), even
strategy:
though once A is chosen the G-versus-H choice is moot.
(.6(A, G), .4(B,
The definition H)) and
of best response (why not?) in this game are exactly as they
Nash equilibria
are in for normal form games. Indeed, this example illustrates how every perfect-
In thisinformation
game
the
game can be converted to an equivalent normal form game. For example,
every game
perfect-information behavioral
of Figure 5.2 canstrategy corresponds
be converted into the normal form im- to a mixed
age of the game, shown in Figure 5.3. Clearly, the strategy spaces of the two games are
strategy...
Multi Agent Systems, draft of September 19, 2006
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 19
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
1 sH
HH
L HH R
HH
s Hs 2
HH
T T
T U T D
L T R T
T T
T T
1,0 100,100 5,1 2,2
1 sH
HH
L HH R
HH
s Hs 2
HH
T T
T U T D
L T R T
T T
T T
1,0 100,100 5,1 2,2
1 sH
HH
L HH R
HH
s Hs 2
HH
T T
T U T D
L T R T
T T
T T
1,0 100,100 5,1 2,2
1 sH
HH
L HH R
HH
s Hs 2
HH
T T
T U T D
L T R T
T T
T T
1,0 100,100 5,1 2,2
1 sH
HH
L HH R
HH
s Hs 2
HH
T T
T U T D
L T R T
T T
T T
1,0 100,100 5,1 2,2
1 sH
HH
L HH R
HH
s Hs 2
HH
T T
T U T D
L T R T
T T
T T
1,0 100,100 5,1 2,2
1 sH
HH
L HH R
HH
s Hs 2
HH
T T
T U T D
L T R T
T T
T T
1,0 100,100 5,1 2,2
again,whether
strategy
to play L or R in his information set, but once he decides he plays that pure
D strongly
consistently. Thusdominant
the payoff of 100 isfor 2 in the context of mixed strate-
irrelevant
Lecture Overview
1 Recap
2 Backward Induction
4 Perfect Recall
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 22
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Perfect Recall
Corollary
In games of perfect recall the set of Nash equilibria does not
change if we restrict ourselves to behavioral strategies.
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 24
Recap Backward Induction Imperfect-Information Extensive-Form Games Perfect Recall
Extensive Form Games: Backward Induction and Imperfect Information Games CPSC 532A Lecture 10, Slide 25