ECON2112 Study Notes
ECON2112 Study Notes
Strategy
Chapter 1 Static Games of Complete Information
1.1 Basic Theory: Normal-Form Games and Nash Equilibrium
1.1A A Normal-Form Representation of Games
Game Theory
Game theory studies multi-person decision problems and hence deals with strategic situations
A game (strategic situation or multi-person decision problem) is a situation in which players (two or
more individuals) interact and thereby jointly determine the outcome
o In a normal form game, players take decisions simultaneously
o In an extensive form game, players take decisions sequentially
(Typically) every player can modify the outcome but (typically) no player has full control (cannot
determine the entire outcome)
The aim of game theory is to provide at least one solution for every game
o A solution is a set of recommendations, one for each player, about how to play the game,
such that no player would have an incentive not to follow this recommendation
o Hence, the solution must be stable
In static or simultaneous-move games players simultaneously choose actions and then receive
payoffs that depend on the combination of actions chosen
In dynamic or sequential move games one player chooses their action before the other players
choose their action
o Can use normal-form or extensive-form representations
o Central issue in dynamic games is credibility
o In games of complete information, each player's payoff function is common knowledge
among all players
Definition (Admissibility)
The principle that players should not use either strictly or weakly dominated strategies (special NE)
¿ ¿ ¿ ¿ ¿
ui ( s1 ,⋯ , s ⅈ−1 , s i , s i+1 , … , sn )≥ ui ¿
The set of player i’s best responses against ( s1 , … , sn ) (denotes as Ri ( s 1 , … , s n ) ) is the set of player
Theorem
¿ ¿
The strategy profile ( s1 , … , s n) is a Nash equilibrium if and only if for every i=1, .. . , n we have :
¿ ¿ ¿
s i ∈ Ri (s 1 , … , s n)
1.2 Applications
1.2A A Cournot Model of Duopoly – 1838 Model of Imperfect Competition
Assumptions
There are two firms, 1 and 2 competing in a duopolistic market
These firms produce an identical (homogeneous) good
They to choose the quantities that they are going to produce simultaneously
The market clearing price when Q is the total amount in the market is:
{
p= a−Q if Q<a
0 if Q ≥ a
o P(Q)=a−Q is the market-clearing price when the aggregate quantity is Q=q 1 +q2
{
π i ( q1 , q2 )= p q i−c qi= ( a−q 1−q 2 ) qi−c q i
−c q i
if q 1+ q2 <a
if q 1 +q2 ≥ a
Face the same price as firms are competing in a market of undistinguishable goods
Solution
¿ ¿
A strategy profile (q 1 , q 2) is a Nash equilibrium if:
π 1 ( q¿1 , q¿2 ) ≥ π 1 ( q1 , q¿2 ) for every q 1 ∈ ¿
δπ1
=0=a−c−q2−2 q 1
δ q1
1
q1 =R1 ( q2 )= (a−c−q 2)
2
2 q¿2=(a−c−q ¿1)
o Ensures that both firm 1 and 2 are best responding to the other
¿ ¿ a−c
q1 =q2 =
3
a−c q2
q1 = −
2 2
a−c
Assume q ≥ 0 , ∴ R1 (0)= 2
a−c
2 becomes the maximum quantity ( q m ) as firms can never play a negative
quantity
a−c
o The monopoly quantity 2 strictly dominates any larger quantity
[
S 11= 0,
a−c
2 ] [
; S12= 0,
a−c
2 ]
R1 ( a−c2 )= a−c2 − 12 ( a−c2 )= a−c4
a−c a−c
4 becomes the minimum quantity as firms will never produce above 2
o (infinite iterations) … Process ends at Nash equilibrium
a−c a−c
S ∞1 ={ }; S∞2 ={ }
3 3
The Game
Players: Firms 1 and 2
Strategy: pi=[ 0, ∞ ) for i=1,2
Payoffs for firms:
{
( p1−c ) Q if p1 < p2 F 1 undercuts F 2( consumers buy all their products)
π 1 ( p1 , p 2) = 1 ( p 1−c ) Q if p1= p 2 Same price(firms receive half each)
2 F 1 sells nothing , price above competitors
if p1 > p2
0
Solution
¿ ¿
Nash equilibrium: Prices ( p1 , p2 ) such that:
π 1 ( p¿1 , p ¿2 ) ≥ π 1 ( p 1 , p¿2 ) for every p1 ∈¿
o b>0 reflects the extent to which firm i's product is a substitute for firm's j product
o Unrealistic demand function because demand for firm i's product is positive even when firm
i charges an arbitrarily high price, provided firm j also charges a high enough pric
o We assume b<2 to ensure the problem makes sense
The game is the same as the normal Bertrand Model of Duopoly
Solutions:
o Given p2 firm one chooses a price p1 so that:
max π p , p = p −c q =( p −c ) a− p +b p
( ) ( 1 ) 1 ( 2)
p1 1 1 2 1 1
1
p1=R1 ( p 2 )= (a+b p2 +c )
2
¿ ¿
o The profile ( p1 , p2 ) is a Nash equilibrium if it solves:
¿ 1 ¿ ¿ 1 ¿
p1= ( a+ b p 2+ c )∧ p2= (a+b p1 +c )
2 2
The Game
Players: n farmers
Strategy sets: gn= [ 0, ∞ ) for all n
Payofs: given a strategy profile (g1 ,. . . , gn ) the payoff to farmer i is:
A−( g1 +…+ gn ¿2 ) −c g i
π i ( g1 , … , gn ) =gi ¿
Solution
g
Take the first order condition using the chain rule ( g i part of (¿ ¿ 1 , … , gn ) to obtain:
¿
g1 +…+ gn ¿2−2 ( g1 +…+ g n ) g i−c=0
A−¿
Aggregate over every farmer:
2
g1 +…+ gn ¿ −nc=0
g1 +…+g n ¿ 2−2 ¿
nA−n ¿
Total number of goats in a Nash equilibrium:
¿
A−c ¿
¿
n
¿
2+n
G=( g1 + …+ g n )=¿
Optimality
The optimal number of goats is the amount that maximises total payoff:
G( A−G2)−cG
Taking the first order condition by chain rule:
2 2
A−G −2G −c=0
From where we can obtain a value for the socially optimal number of goats:
¿
A−c ¿
¿
n
¿
2+ n
1
1
[ ]
G= ( A−c ) 2 vs .Ge =¿
3
In a NE there are always more goats than the socially optimal (as long as n>1 )
e
As soon as there is more than one farmer, G will be greater than socially optimal
The Game
Strategies: S 1=S2= [ 0,1 ]
Payoffs:
1
o When I 1 < I 2 party 1 gets a share of the votes equal to (I + I )
2 1 2
1 1
u1=1∧u2=0 when ( I 1 + I 2 ) >
2 2
1
o If I 1 =I 2 then u1=u2 =
2
Solution
¿ ¿
Ideologies I 1 and I2 are a NE if no party can unilaterally increase their
chances of winning the election.
1
The unique Nash equilibrium is I 1 =I 2=
2
Furthermore, player i is indifferent between any two pure strategies used with strictly positive probability
¿
by pi (otherwise he would play a pure strategy)
E.g. under the NE= ( 12 H + 12 T , 12 H + 12 T ) player 1 and 2 are both indifferent between H and T
Information Sets
Definition (Information Sets)
Information sets tell us when players are moving.
An information set contains all the information that a player possesses when he must move.
When a player moves at some information set, this player cannot distinguish between decision
nodes (i.e. the histories leading to the decision nodes) contained in that information set
Every decision node inside the same information set must have the same set of choices available.
Perfect Recall
Definition (Perfect Recall)
When a player has to move he remembers all his previous moves and whatever he knew before. Not:
Once we know what the set of pure strategies is for each player, we can write down the normal form
representation of an extensive form game and solve for Nash Equilibrium
Mixed strategies
In an extensive form game, a player’s mixed strategy is a probability distribution on the set of that
player’s pure strategies
Theorem (Kuhn)
In any extensive form game with perfect recall, every probability distribution on ending nodes induced by
a mixed strategy profile can also be induced by some behavioural strategy profile
The consequence is that, although mixed strategies are more general, there is no loss of
generality in working only with behavioural strategies
Behavioural strategies
In an extensive form game, a player’s behavioural strategy specifies at:
1. Every decision node where a player must move (perfect information), or
2. Every information set where he must move (imperfect information)
a probability distribution over the choices that follow
Set of behavioural strategy profiles is the set of profiles that look like
this:
( xL+ ( 1−x ) R , ya+(1− y )b , zc +(1−z )d )
where 0 ≤ x ≤1, 0 ≤ y ≤ 1, 0≤ z ≤ 1
Example 2:
( xL+ ( 1−x ) R , ya+(1− y )b , zc +(1−z )d , wA +(1−w)B)
where: 0 ≤ x ≤1, 0 ≤ y ≤ 1, 0≤ z ≤ 1, 0 ≤ w≤ 1
where 0 ≤ x ≤1, 0 ≤ y ≤ 1, 0≤ z ≤ 1
Backwards Induction
Definition (Backwards Induction)
Players should optimise and make their choices in a way consistent with deductions about other players’
rational behaviour in the future. Implemented through use of the computational method.
Backwards Induction Computation in perfect information games
1. Find optimising behaviour at the last decision nodes in the game
2. Substitute those decision nodes by the expected payoff induced by such optimising behaviour
3. Repeat the same procedure in the resulting perfect information game
The result of this procedure is the behavioural strategy
Last stages
We first compute firm 2's reaction to an arbitrary quantity by firm 1
Suppose firm 1 chose q1 =q́1
o ( a−Q ) q2−c q 2
Whereas in the Cournot analysis, R2 (q 1) is firm 2's best response to a hypothesized quantity
to be simultaneously chosen by firm 1
Firm 1 will anticipate firm 2’s best response function
First stage
¿
Firm 1 knows that if he produces q1 firm 2 will produce according to q 2 ( q 1 )
Firm 1 will choose q1 to maximise:
π 1 ( q 1 )=( a−( q 1+ q¿2 ( q 1) ) ) q 1−c q1
( (
π 1 ( q 1 )= a− q1 +
a−q1−c
2 ))
q1−c q 1
d π 1 ( q1 ) a−c
=0 → q¿1 =
d q1 2
Player 2 either accepts the offer (in which case the game ends and the payoffs are immediately
received) or rejects the offer in which case play continues to the second period
At the beginning of the second period, player 2 proposes that player 1 take a share of s 2 of the
dollar, leaving 1−s2 for player 2 (convention that s t always goes to player 1)
Player 1 either accepts the offer or rejects the offer
At the beginning of the third period, player 1 receives a share s of the dollar, leaving 1−s for
player 2 where 0< s<1 (this is given exogenously)
o If player 1 offers 1−s1=δ (1−δs) , player 1 rejects and payoffs are (δ 2 s , δ(1−δs))
Therefore, player 1 proposes a split (1−δ(1−δs) , δ (1−δs)) In the subgame perfect equilibrium,
payoffs are (1−δ(1−δs), δ (1−δs))
If the subgame contains a decision node x then it also contains every choice, decision node,
and final node that comes after x
Note: Every extensive form is also a subgame of itself
Definition (Proper subgame)
A proper subgame is a subgame that is different from the entire game that contains it
A game with three subgames vs. a game without proper subgames:
¿
Similarly, if firm 2’s cost is low, q 2 ( c L ) will solve:
max [ ( a−q 1−q 2 )−c L ] q 2
¿
q2
Finally, firm 1 knows that firm 2’s cost is high with probability θ and should anticipate that firm
¿ ¿
2’s quantity choice will be q 2(c H ) or q 2(c L ) depending on firm 2’s cost.
¿
Thus, firm 1 chooses q1 to solve (so as to maximise expected profit):
[ ] [ ]
max θ ( a−q1−q ¿2 (c H ) ) −c q1 +(1−θ) ( a−q 1−q ¿2( c L ) ) −c q1
q1
q=¿ [ ] [
θ ( a−q1 −q2 (c H ) ) −c +(1−θ) ( a−q 1−q 2 (c L ) ) −c
¿ ¿
]
1
2
The solutions to the three-first order conditions are:
a−2c H + c 1−θ
q ¿2 ( c H )= + (c H −c L )
3 6
a−2 c L +c θ
q ¿2 ( c L ) = − (c H −c L )
3 6
a−2 c +θ c H + ( 1−θ ) c L
q1¿ =
3
describe i's uncertainty about the n-1 other players’ possible types, t −i given i's own type, t i . We
denote this game by G=( A , T , p ,u ) .
Following Harsanyi (1967), we will assume that the timing of a static Bayesian game is:
o Nature draws a type vector t=(t 1 , … ,t n) , where t i is drawn from the set of possible
types T i (assume this is done according to the prior probability distribution p(t)
When nature then reveals t i to player i, he or she can compute the belief
pi (t −i ∨t i ) using Bayes’ rule:
Frequently assume that the players’ types are independent, in which case pi (t −i )
does not depend on t i but is still derived from the prior distribution p(t)
o Nature reveals t i to player i but not to any other player
o The players simultaneously choose actions, player i choosing ai from the feasible set
Ai
o But if types are independent Pr ( t i|t −i ) = p(t i) and so pi ( t −i|t i )= p(t −i) : beliefs = prior
Beliefs p≡ p1 , … , pn
That is, each player-type is maximizing the expected payoff conditional on her own type
3.1 Applications
3.2A Mixed Strategies Revisited
Extending the Definition of Bayes Nash Equilibrium
Note that if T is a continuum, then p must be defined as a density and expectations are in this form
[ ] ∫ u ( s (t ) , … , s
E ui ( a i , s−i ( t −i )|t i )|t i ≡
t −i ∈ T −i
i 1 1 i−1 ( t i−1 ) , ai , s i+1 ( t i +1 ) , sn ( t n )|t i) pi ( t −i∨t i)
3.2A An Auction
First-price, sealed-bid auction
Two bidders, i=1,2
i’s valuation is v i
v i U (0,1) (iid draws)
Tie-breaking rule is fair coin
+¿ ≡ ¿
Actions: bi ϵ R¿
Types: T i =[0,1]
Beliefs: pi ( v i|v j )= p ( vi ) , i≠ j
Payoffs:
{
v i−b i
if b i>b j
v i−b i
ui ( bi , b j|bi )= if bi =b j
2 otherwise
0
Strategies: we can write bi ( v i) , i.e. the bid that player 1 would choose if type i is drawn
Since v j is uniformly distributed on (0, 1), the probability that it is exactly equal to any number is
0:
max ( v i−bi ) Pr ( bi >α + β v j )
bi
bi−α b i−α
Notice that (
Pr ( b i> α + β v j )=Pr v j <
β
=) β
bi−α
max ( v i−bi )
bi β
2 v i+ α
f .o.c bi =
β β
We get bi (vi) = vi2 2 + α
How do we find α?
Notice that no bidder should ever bid more than her valuation: bi (vi) ≤ vi
o Indeed, in a symmetric equilibrium there is always a chance of winning, hence any bid bi > vi
gives negative expected payoff
In particular bi (0) =α 2≤ 0 (together with bi ∈ R+) ⇒ α = 0
There is a symmetric (Bayes-Nash) equilibrium where bidders bid half their valuation
Appendix 3.2B shows that this is the unique symmetric equilibrium with strictly increasing,
differentiable strategies.
2 1 1 2
Nash Equilibria: ( Opera ,Opera ) , ( Fight , Fight ) ,( 3 Opera+ 3 Fight , 3 Opera+ 3 Fight)
A player mixed strategy might represent her opponent’s uncertainty about her choice of a pure
strategy (Harsanyi, 1973)
Formally, let Chris’s payoff if both go to the Opera be 2+t c
Similarly, Pat’s payoff if both go to the Fight is 2+t p
t c and t p are uniformly distributed over [0, x ] and we will look at x → 0 (hence, a slight
perturbation of the payoffs, not a big one)
Threshold Strategies
let us focus on (pure) strategies of the following type:
o Chris plays Opera if t c is greater than a threshold c
o Pat plays Fight if t p is greater than a threshold p
x−c x− p
notice that this implies that each play favourite option with probability x and x we
Expected Payoffs
Chris’ expected payoffs from playing Opera Fight are:
p p p
x
( ( )
2+t c ) + 1− ∙ 0= ( 2+t c )
x x
p p p
∙0+ (1− ) ∙ 1=1−
x x x
Playing Opera is optimal if and only if:
x
t c ≥ −3=c
p
Repeating the exercise for Pat gives:
x
p= −3
c