1 Correlated Equilibria: Battle of Sexes. Recall, That This Game Has Two Players Who Want To Go Either To A Baseball Game
1 Correlated Equilibria: Battle of Sexes. Recall, That This Game Has Two Players Who Want To Go Either To A Baseball Game
1 Correlated Equilibria: Battle of Sexes. Recall, That This Game Has Two Players Who Want To Go Either To A Baseball Game
1 Correlated equilibria
In this lecture we will look at correlated equilibria. We start with an example. Consider the game
Battle of Sexes. Recall, that this game has two players who want to go either to a baseball game
(B) or a softball game (S). The payoff matrix is as given below. The first entry in a cell denotes
the player I’s reward and the second entry denotes player II’s reward. We can see that (S,S) and
(B,B) constitute deterministic Nashes, with rewards (2,1) and (1,2), respectively. A randomized
Nash consists of player I choosing S with probability 2/3 and B with probability 1/3 and player II
choosing S with probability 1/3 and B with probability 2/3. The expected reward in this case is
(2/3, 2/3), which is fair but lower than the worst outcomes in the deterministic Nashes.
S B
S 2,1 0,0
B 0,0 1,2
Now, consider a situation where a “trusted” authority flips a fair coin and based on the outcome
of the coin toss, tells the players what they should do. So, for example, if the coin shows heads,
player I is told to choose baseball and player II is told to choose baseball. Similarly, both players are
told to choose softball when the outcome is tails. (??) It is important to note that no individual
party has an incentive to deviate from what they are told to do. In this case, when player I is told
to choose B, he knows that player II is told to choose B as well. So, player I has no incentive to
deviate and switch to S as the payoff would be lower (0 compared to 1). The advantage of following
such a procedure is that the expected rewards are now higher – (3/2, 3/2) compared to that of
(2/3, 2/3) from the mixed NE.
Outcome Probability
(B,B) 1/2
(S,S) 1/2
A more interesting example of correlated equilibria is the game of Chicken. This is again a
two player game with the payoff matrix as shown below. In this case, the worst outcome occurs
when both players dare (D,D). The deterministic Nashes are (D,C) and (C,D) with rewards (7,2)
and (2,7), respectively. (??) A randomized Nash has players I and II choosing C and D with
probabilities 2/3 and 1/3, respectively. This randomized Nash has an expected reward of 4 23 for
each player.
C D
C 6,6 2,7
D 7,2 0,0
Now, let’s look what happens in the case of a correlated equilibrium. As before, a trusted party
tells each player what to do based on the outcome of the following experiment:
Outcome Probability
(C,D) 1/3
(D,C) 1/3
(D,D) 1/3
We note once again that the trusted party only tells each player what he/she is supposed to
do. The trusted party does not reveal what the other player is supposed to do. It is a correlated
equilibrium if no player wants to deviate from the trusted party’s instruction.
(??) So, in the Chicken example, if the trusted party tells player II to dare, then II has no
incentive to deviate. This is because II knows that the outcome must have been (C,D) and that
player I will obey the instruction to chicken. Next, let us consider the case when player II is told
to chicken. Then player II knows that the outcome must have been either (D,C) or (C,C), each
happening with equal probability. II’s expected payoff on playing C conditioned on the fact that
II is told to chicken is 12 ∗ 6 + 12 ∗ 2 = 4. In the above expression, 6 is the payoff from I also
playing C, i.e., the outcome was (C,C) and 2 is the payoff II gets when I plays D, i.e., outcome was
(D,C). If player II decides to deviate, i.e., play D when told to play C, then the expected payoff is
1 1
2 ∗ 7 + 2 ∗ 0 = 3.5 < 4. So, the expected payoff on deviating is lower than the payoff on obeying the
instruction of the trusted party. Therefore player II doesn’t deviate. Since the game is symmetric,
player I also has no incentive to deviate from the instruction of the trusted party. Note that in the
case of the correlated equilibrium, the expected reward for each player is = 13 ∗ 7 + 23 ∗ 4 = 5. This
is higher than the expected reward of 4 23 in the randomized Nash. Therefore, rewards can be made
better by correlation.
(??) Finally , we consider the general case of k player matrix games, where player i has ni
pure strategies. Then, we can find a correlated equilibrium in time polynomial in n1 n2 . . . nk using
linear programming.
Remarks
• We contrast this with the problem of finding a Nash equilibrium for a general game, for which
no polynomial time algorithm is known.
(??)For a correlated equilibrium, we need to find a probability distribution on the set of all
possible strategies. Let si be an element of the set of pure strategies of player i and let s =
(s1 , . . . , sk ). Also let payoffi (s) be the payoff to player i when strategy s is followed by the players.
p(s1 , . . . , sk ) denotes the probability with which the trusted party observes the event (s1 , . . . , sk ),
in which case players 1, . . . , k are told to play strategies s1 , . . . , sk , respectively. To ensure that a
correlated equilibrium results, no player should have an incentive to deviate from the instruction.
So, if player i is told to play si , then then all other strategies for that player should have no better
outcome. Thus we want the expected payoff of strategy si to be at least as great as the expected
0
payoff when player i alone switches to some other strategy si .
X X 0 0
p(s)payoffi (s) ≥ p(s)payoffi (s1 , .., si , ..sk ) ∀si (1)
s:si =si 0
s:si =si
P
In addition, since p(s) is a probability distribution, p(s) ≥ 0 and p(s) = 1. The above
inequalities define a linear program which can be solved to get a correlated equilibrium. We note: