Game Theory - Lecture 3
Game Theory - Lecture 3
Player 2
Heads Tails
Player 1 Heads 1, 1 1, 1
Tails 1, 1 1, 1
By looking at the best reply functions, we can see that the game does not have
any (pure) Nash equilibrium
A mixed strategy is a probability distribution over 2 or more pure strategies.
I The key strategic feature: each player would like to outguess the other
I Similar situations arise in poker, battles, competitive games (football,
tennis...), etc:
I in poker, an analogous question is how often to blu↵: suppose that
player i is known never to blu↵, then i’s opponent will fold whenever
i bids aggressively, but this will make it worthwhile for i to blu↵...
I on the battlefield, an analogous question is where to attack:
suppose that the attackers can choose between two locations (by
land or by sea) and that the defense can counter either attack if
(and only if) it is anticipated correctly...
I Recall our exercise on a kicker and the goalie – the same spirit
The common features in such strategic situations: there is no Nash equilibrium
(in the sense we used so far – ”pure strategy Nash Equilibrium”).
But it does not mean that we cannot say anything about agents’ eventual
behaviour.
Definition
I Consider the n-player normal-form game G = {S1 , ..., Sn ; u1 , ..., un }.
I Suppose that player i has K pure strategies: Si = {si1 , ..., siK }.
I A mixed strategy for player i is a probability distribution i = ( i1 , ..., iK )
where ik is the probability that player i will play strategy sik ,
for k = 1, ..., K .
I Since ik is a probability, it must be that 0 ik 1 for k = 1, ..., K and
K
X
ik =1
k=1
Lets extend the definition of Nash equilibrium to allow for mixed strategies
I the earlier definition that we have used so far requires that each player’s
pure strategy is a best response to the other players’ pure strategies
I the extended definition requires that each player’s mixed strategy is a
best response to the other players’ mixed strategies
I since any pure strategy can be represented as a mixed strategy that puts
zero probability on all of the player’s other pure strategies, the extended
definition is more general – it covers the earlier one
A mixed strategy will be given by a cumulative distribution function
A belief for player i is a probability distribution over the strategies of his opponents.
I suppose player 1 believes that player 2 will play Heads with probability q and
Tails with probability 1-q We refer to such a mixed strategy as a pure strategy.
I lets look for player 1’s pure-strategy best response to player 2’s mixed strategies
I given his belief, player 1’s expected payo↵s are: With these
equations in hand,
v1 (H,q) = 1 - 2q q( 1) + (1 q)1 = 1 2q if he chooses Heads we can calculate
the best response
of player 1 for any
v1 (T,q) = 2q - 1 q1 + (1 q)( 1) = 2q 1 if he chooses Tails choice q of player
2.
I 1 2q > 2q 1 if and only if q < 1/2, then player 1’s best response is
I the expected payo↵ is increasing in r if 2 4q > 0, i.e., q < 1/2 and decreasing
in r if q > 1/2 , then player 1’s best response is Maximise the linear response
Each player chooses r =1 if q < 1/2 w.r.t r
their strategy independently,
the probability of each outcome Player 1 will definitely
r 2 [0, 1] if q = 1/2 play heads, if he knows that
is the product of the probabilities
of the strategies that make up the player 2 will play heads with
outcome. r =0 if q > 1/2 a probability of < 0.5
I with the result we found when considering player 1’s mixed-strategy best
response to player 2’s mixed strategy
r =1 if q < 1/2
To find a Nash equilibrium we are
looking for a pair of choices (p, q) for
which the two best-response r 2 [0, 1] if q = 1/2
correspondences cross.
r =0 if q > 1/2
I this last statement is stronger because it means that
if q < 1/2 Heads is the best response (among all strategies, pure or mixed) and
if q > 1/2 Tails is the best response (among all strategies, pure or mixed)
Definition
The expected payo↵ of player i when he chooses the pure strategy si 2 Si
and his opponents play the mixed strategy i 2 (S i ) is
X
vi (si , i ) = i (s i ) · vi (si , s i )
s i 2S i
Similarly:
Definition
The expected payo↵ of player i when he chooses the mixed strategy
i 2 (Si ) and his opponents play the mixed strategy i 2 (S i ) is
0 1
X X X
vi ( i , i ) = i (si )·vi (si , i (s i )) = @ i (si ) · i (s i ) · vi (si , s i )A
si 2Si si 2Si s i 2S i
Definition
The expected payo↵ of player i when he chooses the mixed strategy
i 2 (Si ) and his opponents play the mixed strategy i 2 (S i ) is
0 1
X X X
vi ( i , i ) = i (si )·vi (si , i (s i )) = @ i (si ) · i (s i ) · vi (si , s i )A
si 2Si si 2Si s i 2S i
That is,
I player 1’s expected payo↵ from playing the mixed strategy ( 11 , ..., 1J ) is
the weighted sum of the expected payo↵ from playing each of the pure
strategies (s11 , ..., s1J ) where the weights are the probabilities ( 11 , ..., 1J )
A profile α* of mixed strategies is a mixed strategy Nash equilibrium if and only if every player’s
mixed strategy is a best response to the other players’ mixed strategies.
Definition
The mixed-strategy profile = ( 1⇤ , ..., n⇤ ) is a Nash Equilibrium if for each
player i⇤ is a best response to ⇤ i , that is for all i 2 N
⇤ ⇤ ⇤
vi ( i , i) vi ( i , i)
Player 2
Heads Tails
Player 1 Heads 1, 1 1, 1
Tails 1, 1 1, 1
I we’ve already computed the values of r denoted r ⇤ (q) such that player 1’s mixed
strategy (r , 1 r ) is a best response to player 2’s mixed strategy (q, 1 q)
I now lets compute the values of q, denoted q ⇤ (r ), such that player 2’s mixed
strategy (q, 1 q) is a best response to player 1’s mixed strategy (r , 1 r )
Heads 1
r*(q)
There is only one mixed
strategy Nash equilibria. 1/2
q*(r)
(0.5, 0.5) is a proper mixed
strategy.
Tails 0 q
0 1/2 1
Tails Heads
Again, we call r ⇤ (q) and q ⇤ (r ) correspondences instead of functions, since there exists a value of q such
that r ⇤ (q)ofhas
University St.more thanHS23
Gallen, one value, and there exists a value of r such
Alia that q ⇤ (r ) has more than one value. 132
Gizatulina
Mixed Nash Equilibrium
Observation: A player who randomizes between two actions (H,T) is indi↵erent
between playing H or T. This is the general feature of mixed NE:
Mixed strategies are not intuitive:
You randomize to make me indifferent.
Proposition
Let Si+ ⇢ Si denote the subset of pure strategies that player i plays with
⇤
positive probability in mixed strategy profile = ( 1 , .., I ). Strategy profile
is a Nash equilibrium in game G , if and only if for all i = 1, ..., n
If row player is
1. mixing on all of
⇤ her strategies in
vi (si , i) = vi (si0 , ⇤
i) = vi ( ⇤
i ,
⇤
i) for all si , si0 2 Si+ a NE, then each
must yield the
2. same expected
+ payoff.
vi (si , i) vi (si0 , i) for all si 2 Si+ and all si0 62 Si
Bill
Opera Basketball
Ann Opera 2, 1 0, 0
Basketball 0, 0 1, 2
I let (q, 1 q) be the mixed strategy in which Bill plays Opera with probability q
and Basketball with probability 1 q
I let (r , 1 r ) be the mixed strategy in which Ann plays Opera with probability r
and Basketball with probability 1 r
q 1-q
Bill
Opera Basketball
Opera 2, 1 0, 0 r
Ann
Basketball 0, 0 1, 2 1-r
Bill
Opera Basketball
Ann Opera 2, 1 0, 0
Basketball 0, 0 1, 2
r 1 + (1 r )0 = r if he chooses Opera
Opera 1
2/3
q*(r)
r*(q)
Basketball 0 q
0 1/3 1
Basketball Opera
I the two best-response functions intersect in three points – three Nash Equilibria:
I (q,1-q)=(1/3,2/3) and (r,1-r)=(2/3,1/3) Mixed Strategy
I (q,1-q)=(0,1) and (r,1-r)=(0,1) Pure Strategy
I (q,1-q)=(1,0) and (r,1-r)=(1,0) Pure Strategy
matrix
Find all (pure and mixed strategy) Nash Equilibria in the following
games. What are the players’ expected payo↵s in games I and II?:
Player 2
q 1-q
Left Right
H D
r
Up 1, 3 1, 2
Player 1
H 1, 1 10, 0
1-r
Down 2, 1 0, 0
D 0, 10 1, 1
Game II
Game I H/W Done:
Game 1 + 2 + 3
L M R
L M R
U 0, 1 1, 2 1, 0
U 6, 10 2, 8 0, 7
C 1, 0 0, 1 1, 2
B 8, 2 4, 4 1, 1
D 1, 2 1, 0 0, 1
Game III
Game IV
In each game, find payo↵s from the mixed strategy equilibrium for each
player
The rest of the lecture: will collect various dribs and drabs in static games of
complete information:
I We extend the notion of rationalizability to allow for dominance of mixed
strategies
I We will study the question of existence of a Nash Equilibrium (informally)
I We will discuss can one select an equilibrium when there are many of
them
Equipped with the mixed strategies, we now can extend the notion of
”dominated strategy” by allowing for mixed strategies
Definition
Let i 2 (Si ) and si0 2 (Si ) be possible strategies for player i, we say si0 is
strictly dominated by i if
Thus, player i can ignore strategy (σi).
ui ( i , s i ) > ui (si0 , s i )
for any s i 2S i
If player 2 is playing each of his strategies, L and C, with a
and relatedly probabilities p and 1 - p, respectively, then player 1 can calculate his
expected utility for each of his own strategies.
Definition
A strategy i is never a best response if there are no beliefs i 2 (S i ) for
player i for which i 2 BR( i )
By iterating on the knowledge of rationality, we iteratively delete strategies that are never best responses.
The set of strategies for a player that survives this iterated deletion of never best responses is called her set of
rationalizable strategies.
I When this is the case the assumption that a particular Nash equilibrium is
played relies on there being some mechanism or process that leads all
players to expect the same equilibrium.
I Weak-dominance: Choose the equilibrium that does not involve the play
of weakly dominated strategies.
I For example, the following game have two Nash Equilibria, one is in
dominated strategies.
1\2 L R
U 1,1 0, 0
D 0, 0 0, 0
1\2 L R
U 10, 0 5, 2
D 10, 11 2, 0
I The key idea is to look for Nash equilibria that remain to be Nash
equilibria even if with some very small probability, players make mistakes.
This concept, when used in a game of cards, can refer to a playing unintentionally playing
the wrong card through error (popularly known as tremble).
Nash equilibrium is trembling hand perfect if, for some small trembles by each
Proposition player, the Nash equilibrium strategies remain best responses.
⇤ ⇤
A Nash equilibrium = ( 1,..., I ) of a normal form game is trembling-hand
perfect if and only if
1. There is some sequence { k }1 k=1 of totally mixed strategies (mixed
strategies in which every pure strategy receives positive probability)
converging to the equilibrium strategies, i.e., limk!1 k = .
(L,R) is not a NE as there is another combination of strategies, (U, L) where both players obtain
greater payoffs.
By playing strategy U or L, each player guarantees a payoff at least as high (and sometimes a higher) than
that obtained by playing D.
I We have that:
Suppose
that the k k
column
1. limk!1 1 = (1, 0) = [U] limk!1 2 = (1, 0) = [L]
player plays
(1 − 1/k, 1/k),
then the row
2. And U and L are best responses since: For Player 1, playing U
is better than D
player’s
payoff is: ✓ ◆ ✓ ◆
k 1 1 1 k 1 1
u1 (U, 2) =1 1 +0 =1 u1 (D, 2)
=0 1 +0 =0
k k k k k
✓ ◆ ✓ ◆
1 1 1 1 1
u2 ( 1k , L) = 1 1 +0 =1 u2 ( 1k , R) = 0 1 +0 =0
k k k k k
Suppose that the row player plays (1 − 1/k, 1/k), then the column player’s payoff is:
University of St. Gallen, HS23 Alia Gizatulina 153
Equilibrium Selection: Trembling-hand perfection.
1/k 1 - 1/k
1\2 L R
1/k U 1,1 0, 0
1 - 1/k D 0, 0 0, 0
I We have that:
k k
1. limk!1 1 = (0, 1) = [D] limk!1 2 = (0, 1) = [R]
Proposition
If = ( 1 , . . . , I ) is a (normal form) trembling-hand perfect Nash equilibrium,
then i is not a weakly dominated strategy for any i = 1, . . . , I .
I The above suggest that strict Nash Equilibria cannot be ruled out by the
requirement of trembling-hand perfection.
I How can we select among strict Nash equilibria?
I Harsanyi and Selten suggested two refinements:
I Payo↵ (or Pareto) dominance
I Risk dominance
Payo↵ dominance is rather obvious: players will chose as a focal point payo↵
dominant equilibrium.
Risk dominance?
Proof
It involves a fixed-point theorem.
f(x)
f(x*)
45˚
0 x
0 x* 1
Your homework: