Game-Theoretic Analysis Tools: Tuomas Sandholm
Game-Theoretic Analysis Tools: Tuomas Sandholm
tools
Tuomas Sandholm
Professor
Computer Science Department
Carnegie Mellon University
Terminology
• In a 1-agent setting, agent’s expected utility maximizing strategy is well-
defined
• But in a multiagent system, the outcome may depend on others’ strategies
also
• Game theory analyzes stable points in the space of strategy profiles =>
allows one to build robust, nonmanipulable multiagent systems
• Agent = player
• Action = move = choice that agent can make at a point in the game
• Strategy si = mapping from history (to the extent that the agent i can
distinguish) to actions
• Strategy set Si = strategies available to the agent
• Strategy profile (s1, s2, ..., s|A|) = one strategy for each agent
• Agent’s utility is determined after each agent (including nature that is
used to model uncertainty) has chosen its strategy, and game has been
played: ui = ui(s1, s2, ..., s|A|)
Game representations
Matrix form
Extensive form (aka normal form
aka strategic form)
Down player 2
Right 7, 8
Woman
boxing ballet
boxing 2, 1 0, 0
Man
ballet 0, 0 1, 2
Criticisms of Nash equilibrium
• Not unique in all games, e.g. Battle of the Sexes
– Approaches for addressing this problem
• Refinements (= strengthenings) of the equilibrium concept
– Eliminate weakly dominated strategies first
– Choose the Nash equilibrium with highest welfare
– Subgame perfection
– …
• Focal points
• Mediation
• Communication
• Convention
• Learning 1, 0 0, 1
• Thrm.
– Any finite game,
– where each action node is alone in its information set
• (i.e. at every point in the game, the agent whose turn it is to
move knows what moves have been played so far)
– is dominance solvable by backward induction (at
least as long as ties are ruled out)
• Constructive proof: Multi-player minimax search
Rock-scissors-paper game
Sequential moves
Rock-scissors-paper game
Simultaneous moves
Mixed strategy Nash equilibrium
Mixed strategy = agent’s chosen probability
distribution over pure strategies from its strategy set
rock 0, 0
move of Each agent has a
agent 2 best response strategy
scissors and beliefs
1, -1
paper (consistent with each
rock other)
-1, 1
rock -1, 1 Symmetric mixed
strategy Nash eq:
move of scissors scissors Each player
0, 0 plays each pure
agent 1 paper
paper strategy with
probability 1/3
1, -1
- 100, - 100
Nuke
Kennedy
Arm
• Pure strategy Nash equilibria: (Arm,Fold), (Retract,Nuke)
Fold 10, -10
• Pure strategyKhrushchev
subgame perfect equilibria: (Arm,Fold)
• Conclusion: Kennedy’s Nuke threat was not credible
Retract -1, 1
Ultimatum game, again
Thoughts on credible threats
• Could use software as a commitment device
– If one can credibly convince others that one cannot change
one’s software agent, then revealing the agent’s code acts as
a credible commitment to one’s strategy
– E.g. nuke in the missile crisis
– E.g. accept no less than 60% as the second mover in the
ultimatum game
• Restricting one’s strategy set can increase one’s utility
– This cannot occur in single agent settings
• Social welfare can increase or decrease
Conclusions on game-theoretic
analysis tools
• Tools for building robust, nonmanipulable systems for
self-interested agents
• Different solution concepts
– For existence, use strongest equilibrium concept
– For uniqueness, use weakest equilibrium concept
Strong Nash eq
Strength
against
collusion Coalition-Proof Nash eq
Strength