Game Theory
Game Theory
J. Zico Kolter
1
Outline
Computing equilibria
2
Prisoner’s dilemma
• Two prisoners being interrogated, can either stay silent or
implicate the other one
4
General ideas
• Both these games differ slightly from what we have seen so far
in class: in order to decide our action we need to account for
other agents that are also acting (and trying to account for our
actions, ad infinitum)
5
Outline
Computing equilibria
6
Games in normal form
7
• Example: Prisoner’s dilemma
Silent Implicate
– N =2
−1
if a = (Silent, Silent)
−1
−5
if a = (Silent, Implicate)
0
– u(a) =
0
if a = (Implicate, Silent)
−5
−3
if a = (Implicate, Implicate)
8
−3
• Example: Guess 2/3 the mean
– N arbitrary
– A = {1, 2, . . . , 10}N
( 1
2
if ai = 3 mean(a)
k=1 1{ak =[ 3 mean(a)]}
Pm 2
– ui (a) =
0 otherwise
9
• Battle of the sexes
WL LW
WL 2,1 0,0
LW 0,0 1,2
10
• Rock, paper, scissors
11
• Some special cases:
– Zero-sum game: two player game where u1 (a) = −u2 (a),
∀a ∈ A (e.g., rock paper scissors)
L R
L 1,1 -1,-1
R -1,-1 1,1
12
Pure and mixed strategies
13
• The probability of set of actions a under strategy profile s is
N
Y
s(a) = si (ai )
i=1
X N
Y
ui (s) = ui (a) sj (aj )
a∈A j=1
14
Best response
15
Nash equilibrium
• Key definition: a strategy profile s is a Nash equilibrium if si is
a best response to s−i for all i = 1, . . . , N
Silent Implicate
17
• Rock paper scissors
18
• Battle of sexes
WL LW
WL 2,1 0,0
LW 0,0 1,2
19
• In 1950, John Nash proved that
every game has at least one
equilibrium point (important,
requires mixed strategies)
20
Outline
Computing equilibria
21
Can we compute Nash equilibria?
22
Computing an equilibrium with known support
23
• Battle of sexes, let’s guess that the support for a mixed strategy
contains both WL and LW for husband and wife, and say that
husband chooses WL with probability p
WL LW
WL 2,1 0,0
LW 0,0 1,2
– Utilities for all actions in support must be equal for both players
25
• What happens when we try a support that does not produce a
NE?
26
Finding the support
• But, many times we will find a solution much faster (c.f. search,
mixed integer programming, etc)
27
• Lemke-Howson algorithm (stated very imprecisely)
3. Otherwise, continue
28
Outline
Computing equilibria
29
N -player games
• For more than two players, precise analogue of Lemke-Howson
algorithm doesn’t exist, but local search can still be effective
subject to 1T si = 1, s≥0
subject to s1 ≥ 0, 1T s1 = 1, s2 ≥ 0, 1T s2 = 1
subject to s1 ≥ 0, 1T s1 = 1
which is equivalent to linear program
minimize t
s1 ,t
subject to s1 ≥ 0, 1T s1 = 1, C T u ≤ t1
31
• Somewhat surprisingly, this is actually the optimal strategy for
player 1, even if player 2 can play mixed strategies (proof
involves an optimization concept called duality)
32