Or
Or
Game Theory
A mathematical theory designed to model: how rational individuals should behave when individual outcomes are determined by collective behavior strategic behavior Rational usually means selfish --- but not always Traditionally viewed as a subject of economics Subsequently applied by many fields evolutionary biology, social psychology
How should the monkeys each act so as to maximize their own calorie gain?
Choosing Strategies
In the simultaneous game, its harder to see what each monkey should do
Mixed strategy is optimal.
Trick: How can a monkey maximize its payoff, given that it knows the other monkeys will play a Nash strategy? Oftentimes, other techniques can be used to prune the number of possible actions.
5,3
0,0
5,3
0,0
5,3
0,0
Given that row will pick a, column will pick b. (a,b) is the unique Nash equilibrium.
Dominant Firm Game Two firms, one large and one small Either firm can announce an output level (lead) or else wait to see what the rival does and then produce an amount that does not saturate the market.
Follow
( 0.5,
4)
( 3, 2)
Follow
( 1, 8)
( 0.5, 1)
Subordin ate
Lead
Lead
Follow
( 0.5,
4)
( 3, 2)
Follow
( 1, 8)
( 0.5, 1)
Subordin ate
Lead
Lead
Follow
( 0.5,
4)
( 3, 2)
Follow
( 1, 8)
( 0.5, 1)
Follow
( 0.5,
4)
( 3, 2)
Follow
( 1, 8)
( 0.5, 1)
Follow
( 0.5,
4)
( 3, 2)
Follow
( 1, 8)
( 0.5, 1)
Follow
( 0.5,
4)
( 3, 2)
Follow
( 1, 8 )
( 0.5, 1)
A strategy ti 2 Si is a best response if there is no other strategy in Si that produces a higher payoff, given the opponents strategies.
(b,a) is a Nash equilibrium. To prove this: Given that column is playing a, rows best response is b. Given that row is playing b, columns best response is a.
Prisoners Dilemma
Each player can cooperate or defect
Column
cooperate defect
cooperate
Row defect
-1,-1
0,-10
-10,0
-8,-8
Prisoners Dilemma
Each player can cooperate or defect
Column
cooperate defect
cooperate
Row defect
-1,-1
0,-10
-10,0
-8,-8
Prisoners Dilemma
Each player can cooperate or defect
Column
cooperate defect
cooperate
Row defect
-1,-1
0,-10
-10,0
-8,-8
Prisoners Dilemma
Even though both players would be better off cooperating, mutual defection is the dominant strategy. What drives this?
One-shot game Inability to trust your opponent Perfect rationality
Prisoners Dilemma
Relevant to:
Arms negotiations Online Payment Product descriptions Workplace relations
Coordination Games
Consider the following problem:
A supplier and a buyer need to decide whether to adopt a new purchasing system.
Buyer
new
new Supplier old 0,0 20,20
old
0,0 5,5
No dominated strategies!
Coordination Games
Buyer new new Supplier old 20,20 0,0 old 0,0 5,5
This game has two Nash equilibria (new,new) and (old,old) Real-life examples: Beta vs VHS, Mac vs Windows vs Linux, others? Each player wants to do what the other does which may be different than what they say theyll do How to choose a strategy? Nothing is dominated.
Mixed strategies
Unfortunately, not every game has a pure strategy equilibrium.
Rock-paper-scissors
However, every game has a mixed strategy Nash equilibrium. Each action is assigned a probability of play. Player is indifferent between actions, given these probabilities.
Mixed Strategies
In many games (such as coordination games) a player might not have a pure strategy. Instead, optimizing payoff might require a randomized strategy (also called a mixed strategy)
Wife
football shopping
football
Husband shopping
2,1
0,0
0,0
1,2
Strategy Selection
Wife
shopping
2,1
0,0
0,0
1,2
If we limit to pure strategies: Husband: U(football) = 0.5 * 2 + 0.5 * 0 = 1 U(shopping) = 0.5 * 0 + 0.5 * 1 = Wife: U(shopping) = 1, U(football) = Problem: this wont lead to coordination!
Mixed strategy
Instead, each player selects a probability associated with each action
Goal: utility of each action is equal Players are indifferent to choices at this probability
a=probability husband chooses football b=probability wife chooses shopping Since payoffs must be equal, for husband:
b*1=(1-b)*2 b=2/3
For wife:
a*1=(1-a)*2 = 2/3
scissors
Setup
Player 1 plays rock with probability pr, scissors with probability ps, paper with probability 1-pr ps P2: Utility(rock) = 0*pr + 1*ps 1(1-pr ps) = 2 ps + pr -1 P2: Utility(scissors) = 0*ps + 1*(1 pr ps) 1pr = 1 2pr ps P2: Utility(paper) = 0*(1-pr ps)+ 1*pr 1ps = pr ps
Player 2 wants to choose a probability for each strategy so that the expected payoff for each strategy is the same.
Setup
qr(2 ps + pr 1) = qs(1 2pr ps) = (1-qr-qs) (pr ps) It turns out (after some algebra) that the optimal mixed strategy is to play each strategy of the time.
Intuition: What if you played rock half the time? Your opponent would then play paper half the time, and youd lose more often than you won. So youd decrease the fraction of times you played rock, until your opponent had no edge in guessing what youll do.
Repeated games
Many games get played repeatedly A common strategy for the husband-wife problem is to alternate
This leads to a payoff of 1, 2,1,2, 1.5 per week.
Requires initial synchronization, plus trust that partner will go along. Difference in formulation: we are now thinking of the game as a repeated set of interactions, rather than as a one-shot exchange.
Repeated games: Players repeatedly play a shorter game, receiving payoffs along the way.
Poker, blackjack, rock-paper-scissors, etc