Lecture3
Lecture3
--
Lecture 3
Patrick Loiseau
EURECOM
Fall 2016
1
Lecture 2 recap
• Defined Pareto optimality
– Coordination games
• Studied games with continuous action space
– Always have a Nash equilibrium with some conditions
– Cournot duopoly example
2
Outline
1. Mixed strategies
– Best response and Nash equilibrium
2. Mixed strategies Nash equilibrium computation
3. Interpretations of mixed strategies
3
Outline
1. Mixed strategies
– Best response and Nash equilibrium
2. Mixed strategies Nash equilibrium computation
3. Interpretations of mixed strategies
4
Example: installing checkpoints
• Two road, Police choose on which to check,
Terrorists choose on which to pass
Terrorist
R1 R2 • Can you find a Nash
equilibrium?
R1 1 , -1 -1, 1
5
Matching pennies
• Similar examples:
Player 2 – Checkpoint placement
heads tails
– Intrusion detection
– Penalty kick
heads 1 , -1 -1, 1
– Tennis game
Player 1
6
Pure strategies/Mixed strategies
• Game ( N, ( Ai )i∈N , (ui )i∈N )
• Ai: set of actions of player i (what we called Si
before)
• Action = pure strategy
• Mixed strategy: distribution over pure strategies
si ∈ Si = Δ(Ai )
– Include pure strategy as special case
– Support: supp si = {ai ∈ Ai : si (ai ) > 0}
• Strategy profile: s = (s1,, sn ) ∈ S = S1 ×× Sn 7
Matching pennies: payoffs
• What is Player 1’s payoff if Player 2
plays s2 = (1/4, 3/4) and he plays:
Player 2
– Heads?
heads tails
heads 1 , -1 -1, 1
– Tails?
Player 1
8
Payoffs in mixed strategies: general
formula
• Game ( N, ( Ai )i∈N , (ui )i∈N ) , let A = × Ai
i∈N
• If players follow a mixed-strategy profile s, the
expected payoff of player i is:
ui (s) = ∑ ui (a)Pr(a | s) where Pr(a | s) = ∏ si (ai )
a∈A i∈N
heads 1 , -1 -1, 1
Player 1
tails -1, 1 1, -1
• Does that look like it could be a
Nash equilibrium?
10
Best response
• The definition for mixed strategies is
unchanged!
12
Important property
• If a mixed strategy is a best response then
each of the pure strategies in the mix must be
best responses
è They must yield the same expected payoff
Proposition:
For any (mixed) strategy s-i, if si ∈ BRi (s−i ), then
ai ∈ BRi (s−i ) for all ai such that si (ai ) > 0 .
14
Matching pennies again
• What is the best response
of Player 1 to s2 = (¼, ¾)?
Player 2
heads tails
heads 1 , -1 -1, 1
• What is the best response
of Player 1 to s2 = (½, ½)?
Player 1
tails -1, 1 1, -1
15
Nash equilibrium definition
Definition: Nash Equilibrium
A strategy profile (s1*, s2*,…, sN*) is a Nash
Equilibrium (NE) if, for each i, her choice si* is a
best response to the other players’ choices s-i*
16
Matching pennies again
• Nash equilibrium:
Player 2 ((½, ½), (½, ½))
heads tails
heads 1 , -1 -1, 1
Player 1
tails -1, 1 1, -1
17
Nash equilibrium existence theorem
• In mixed strategy!
– Not true in pure strategy
• Finite game: finite set of player and finite
action set for all players
– Both are necessary!
• Proof: reduction to Kakutani’s fixed-point thm 18
Outline
1. Mixed strategies
– Best response and Nash equilibrium
2. Mixed strategies Nash equilibrium computation
3. Interpretations of mixed strategies
19
Computation of mixed strategy NE
• Hard if the support is not known
• If you can guess the support, it becomes very
easy, using the property shown earlier:
Proposition:
For any (mixed) strategy s-i, if si ∈ BRi (s−i ), then
ai ∈ BRi (s−i ) for all ai such that si (ai ) > 0 .
à p = 1/3 22
Example: battle of the sexes (3)
Player 2
Opera Soccer
Opera 2,1 0,0
Player 1
Soccer 0,0 1,2
à q = 2/3 23
Example: battle of the sexes (4)
Player 2
Opera Soccer
Opera 2,1 0,0
Player 1
Soccer 0,0 1,2
24
Example: prisoner’s dilemma
Prisoner 2
• We know that (D, D) is NE D C
Prisoner 1
C -6, 0 -2, -2
• A NE in strictly dominant
strategies is unique! 25
General methods to compute Nash
equilibrium
• If you know the support, write the equations
translating indifference between strategies in
the support (works for any number of
actions!)
• Otherwise:
– The Lemke-Howson Algorithm (1964)
– Support enumeration method (Porter et al. 2004)
• Smart heuristic search through all sets of support
• Exponential time worst case complexity
26
Complexity of finding Nash equilibrium
• Is it NP-complete?
– No, we know there is a solution
– But many derived problems are (e.g., does there
exists a strictly Pareto optimal Nash equilibrium?)
• PPAD (“Polynomial Parity Arguments on
Directed graphs”) [Papadimitriou 1994]
• Theorem: Computing a Nash equilibrium is
PPAD-complete [Chen, Deng 2006]
27
Complexity of finding Nash equilibrium
(2)
NP-hard
NP-complete
NP
PPAD
28
Outline
1. Mixed strategies
– Best response and Nash equilibrium
2. Mixed strategies Nash equilibrium computation
3. Interpretations of mixed strategies
29
Mixed strategies interpretations
• Players randomize
• Belief of others’ actions (that make you
indifferent)
• Empirical frequency of play in repeated
interactions
• Fraction of a population
– Let’s see an example of this one
30
The Income Tax Game (1)
Tax payer
Honest Cheat
Auditor
A 2,0 4,-10 p
q 1-q
31
The Income Tax Game: NE
computation
• Mixed strategies NE:
33
The Income Tax Game (6)
• What could ever be done if one policy maker
(e.g. the government) would like to increase
the proportion of honest tax payers?
34
The Income Tax Game: Trying to make
people pay
Tax payer
Honest Cheat
Auditor
A 2,0 4,-20 p
q 1-q
37
The penalty kick game
• 2 players: kicker and goalkeeper
• 2 actions each
– Kicker: kick left, kick right
– Goalkeeper: jump left, jump right
• Payoff: probability to score for the kicker,
probability to stop it for the goalkeeper
• Scoring probabilities: Goal keeper
L R
L 58.30 94.97
Kicker
R
92.91 69.92 38
The penalty kick game: results
• Ignacio Palacios-Huerta. Professionals Play
Minimax. Review of Economics Studies (2003).
• Result:
Goal L Goal R Kicker L Kicker R
NE prediction 41.99 58.01 38.54 61.46
Observed freq. 42.31 57.69 39.98 60.02
39
Summary
• Mixed strategies: distribution over actions
– A Nash equilibrium in mixed strategies always
exists for finite games
– Computation is easy if the support is known
• All pure strategies in the support of a best response are
also best responses
• Makes other player indifferent in his support
– Computation is hard if the support is not known
– Several interpretations depending on the game at
stake
40