0% found this document useful (0 votes)

185 views

Lesson 35 Game Theory and Linear Programming

This document summarizes a lesson on game theory and linear programming. It begins with announcements about homework and office hours. It then outlines topics to be covered, including recapping previous material, defining zero-sum games and strategies, and discussing how to solve games using the fundamental theorem and by formulating them as linear programming problems. It provides examples of solving the rock-paper-scissors game as a linear program. The document derives how to formulate the problem from either the row or column player's perspective as an LP to find optimal strategies.

Uploaded by

TheGreatCthulhu

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

185 views

Lesson 35 Game Theory and Linear Programming

Uploaded by

TheGreatCthulhu

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Lesson 35

Game Theory and Linear Programming

Math 20

December 14, 2007

Announcements
I

Pset 12 due December 17 (last day of class)

Lecture notes and K&H on website

next OH Monday 12 (SC 323)

Outline
Recap
Definitions
Examples
Fundamental Theorem
Games we can solve so far
GT problems as LP problems
From the continuous to the discrete
Standardization
Rock/Paper/Scissors again
The row players LP problem

Definition
A zero-sum game is defined by a payoff matrix A, where aij
represents the payoff to the row player if R chooses option i and C
chooses option j.

Definition
A zero-sum game is defined by a payoff matrix A, where aij
represents the payoff to the row player if R chooses option i and C
chooses option j.
I

The row player chooses from the rows of the matrix, and the
column player from the columns.

The payoff could be a negative number, representing a net

gain for the column player.

Definition
A strategy for a player consists of a probability vector representing
the portion of time each option is employed.

Definition
A strategy for a player consists of a probability vector representing
the portion of time each option is employed.
I

We use a row vector p for the row players strategy, and a

column vector q for the column players strategy.

A pure strategy (select the same option every

represented by a standard basis vector ej or e0j .
if R has three choices and C has five:

0
e4 = 0 0 0 1
e02 = 1
0

A non-pure strategy is called mixed.

time) is
For instance,

Definition
The expected value of row and column strategies p and q is the
scalar
n
X
E (p, q) =
pi aij qj = pAq
i,j=1

Probabilistically, this is the amount the row player receives (or the
column player if its negative) if players employ these strategies.

Rock/Paper/Scissors

Example
What is the payoff matrix for Rock/Paper/Scissors?

Rock/Paper/Scissors

Example
What is the payoff matrix for Rock/Paper/Scissors?

Solution
The payoff matrix is

0 1 1
0 1 .
A= 1
1 1
0

Example
Consider a new game: players R and C each choose a number 1,
2, or 3. If they choose the same thing, C pays R that amount. If
they choose differently, R pays C the amount that C has chosen.
What is the payoff matrix?

Solution

1 2 3
A = 1 2 3
1 2 3

Theorem (Fundamental Theorem of Matrix Games)

There exist optimal strategies p for R and q for C such that for
all strategies p and q:
E (p , q) E (p , q ) E (p, q )

Theorem (Fundamental Theorem of Matrix Games)

There exist optimal strategies p for R and q for C such that for
all strategies p and q:
E (p , q) E (p , q ) E (p, q )
E (p , q ) is called the value v of the game.

Reflect on the inequality

E (p , q) E (p , q ) E (p, q )
In other words,
I

E (p , q) E (p , q ): R can guarantee a lower bound on

his/her payoff

E (p , q ) E (p, q ): C can guarantee an upper bound on

how much he/she loses

This value could be negative in which case C has the

advantage

Fundamental problem of zero-sum games

I
I

Find the p and q !

Last time we did these:
I
I

Strictly-determined games
2 2 non-strictly-determined games

The general case well look at next.

Pure Strategies are optimal in Strictly-Determined Games

Theorem
Let A be a payoff matrix. If ars is a saddle point, then e0r is an
optimal strategy for R and es is an optimal strategy for C. Also
v = E (e0r , es ) = ars .

Optimal strategies in 2 2 non-Strictly-Determined Games

Let A be a 2 2 matrix with no saddle points. Then the optimal

strategies are
a a
22
12

a a
a11 a12
22
21

q = a11 a21
p=

where = a11 + a22 a12 a21 . Also

|A|

This could get a little weird

This derivation is not something that needs to be memorized, but
should be understood at least once.

Objectifying the problem

Lets think about the problem from the column players

perspective. If she chooses strategy q, and R knew it, he would
choose p to maximize the payoff pAq. Thus the column player
wants to minimize that quantity. That is, C s objective is realized
when the payoff is

E = min max pAq.
q

Objectifying the problem

Lets think about the problem from the column players

This seems hard! Luckily, linearity, saves us.

From the continuous to the discrete

Lemma
Regardless of q, we have
max pAq = max e0i Aq
p

1im

Here e0i is the probability vector represents the pure strategy of

going only with choice i.

From the continuous to the discrete

Lemma
Regardless of q, we have
max pAq = max e0i Aq
p

1im

Here e0i is the probability vector represents the pure strategy of

going only with choice i.
The idea is that a weighted average of things is no bigger than the
largest of them. (Think about grades).

Proof of the lemma

Proof.
We must have
max pAq max e0i Aq
p

1im

(the maximum over a larger set must be at least as big). On the

other hand, let q be C s strategy. Let the quantity on the right be
maximized
when i = i0 . Let p be any strategy for R. Notice that
P
p = i pi e0i . So
E (p, q) = pAq =

m
X

pi e0i Aq

i=1

m
X

i=1

!
pi

e0i0 Aq = e0i0 Aq.

i=1

Thus
max pAq e0i0 Aq.
p

m
X

pi e0i0 Aq

The next step is to introduce a new variable v representing the

value of this inner maximization. Our objective is to minimize it.
Saying its the maximum of all payoffs from pure strategies is the
same as saying
v e0i Aq
for all i. So we finally have something that looks like an LP
problem! We want to choose q and v which minimize v subject to
the constraints

n
X
j=1

v e0i Aq

i = 1, 2, . . . m

qj 0

j = 1, 2, . . . n

qj = 1

Trouble with this formulation

Simplex method with equalities?

Not in standard form

Resolution:
I

We may assume all aij 0, so v > 0

Let xj =

qj
v

Since we know v > 0, we still have x 0. Now

n
X
j=1

xj =

n
1X
1
qj = .
v
v
j=1

So our problem is now to choose x 0 which maximizes

The constraints now take the form
v e0i Aq 1 e0i Ax,
for all i. Another way to write this is
Ax 1,
where 1 is the vector consisting of all ones.

xj .

Upshot

Theorem
Consider a game with payoff matrix A, where each entry of A is
x
positive. The column players optimal strategy q is
,
x1 + + xn
where x 0 satisfies the LP problem of maximizing x1 + + xn
subject to the constraints Ax 1.

Rock/Paper Scissors

The payoff matrix is

0 1 1
0 1 .
A= 1
1 1
0

Rock/Paper Scissors

The payoff matrix is

0 1 1
0 1 .
A= 1
1 1
0
We can add 2 to everything to make

2 1 3
= 3 2 1 .
A
1 3 2

Convert to LP
The problem is to maximize x1 + x2 + x3 subject to the constraints
2x1 + x2 + 3x3 1
3x1 + 2x2 + x3 1
x1 + 3x3 + 2x3 1.
We introduce slack variables y1 , y2 , and y3 , so the constraints now
become
2x1 + x2 + 3x3 + y1 = 1
3x1 + 2x2 + x3 + y2 = 1
x1 + 3x3 + 2x3 + y3 = 1.

An easy initial basic solution is to let x = 0 and y = 1. The initial

tableau is therefore
x1 x2 x3 y1 y2 y3
y1
2
1
3 1 0 0
y2
3
2
1 0 1 0
y3
1
3
2 0 0 1
z 1 1 1 0 0 0

z
0
0
0
1

value
1
1
1
0

Which should be the entering variable? The coefficients in the

bottom row are all the same, so lets just pick one, x1 . To find the
departing variable, we look at the ratios 12 , 31 , and 11 . So y2 is the
departing variable.
We scale row 2 by 13 :
y1
y2
y3
z

x1 x2 x3 y1
2
1
3 1
1 2/3 1/3 0
1
3
2 0
1 1 1 0

y2 y3
0 0
1/3
0
0 1
0 0

z
0
0
0
1

value
1
1/3
1
0

Then we use row operations to zero out the rest of column one:

y1
x1
y3
z

x1
x2
x3 y1
y2 y3
7/3
0 1/3
1 2/3 0
2/3
1/3
1/3
1
0
0
7/3
5/3
0
0 1/3 1
1/3
0 1/3 2/3 0
0

z
0
0
0
1

value
1/3
1/3
2/3
1/3

We can still improve this: x3 is the entering variable and y1 is the

departing variable. The new tableau is

x3
x1
y3
z

x1
x2 x3
y1
y2 y3
3/7 2/7
0
0 1/7 1
5
1
3
1
/7 0 /7
/7 0
1/7
0 18/7 0 5/7
1
3
2
1
0 /7 0
/7
/7 0

z
0
0
0
1

value
1/7
2/7
3/7
3/7

Finally, entering x2 and departing y3 gives

x3
x1
x2
z

x1 x2 x3
y1
y2
y3
7/18 5/18
1/18
0 0 1
1/18
7/18 5/18
1 0 0
1/18
7/18
0 1 0 5/18
1
1
1/6
0 0 0
/6
/6

z
0
0
0
1

value
1/6
1/6
1/6
1/2

So the x variables have values x1 = 1/6, x2 = 1/6, x3 = 1/6.

Furthermore z = x1 + x2 + x3 = 1/2, so v = 1/z = 2. This also
means that p1 = 1/3, p2 = 1/3, and p3 = 1/3. So the optimal
strategy is to do each thing the same number of times.

Now lets think about the problem from the column players
perspective. If he chooses strategy p, and C knew it, he would
choose p to minimize the payoff pAq. Thus the row player wants
to maximize that quantity. That is, Rs objective is realized when
the payoff is
E = max min pAq.
p

Lemma
Regardless of p, we have
min pAq = min pAej
q

1jn

The next step is to introduce a new variable v representing the

value of this inner minimization. Our objective is to maximize it.
Saying its the minimum of all payoffs from pure strategies is the
same as saying
v pAej
for all j. Again, we have something that looks like an LP problem!
We want to choose p and v which maximize v subject to the
constraints

m
X
i=1

v pAej

j = 1, 2, . . . n

pi 0

i = 1, 2, . . . m

pi = 1

As before, we can standardize this by renaming

1 0
p
v

(this makes y a column vector). Then

m
X
i=1

yi =

1
,
v

So maximizing v is the same as minimizing 10 y. Likewise, the

equations of constraint become v (
v y0 )Aej for all j, or y0 A 10 ,
0
or (taking transposes) A y 1. If all the entries of A are positive,
we may assume that v is positive, so the constraints p 0 are
satisfied if and only if y 0.

Upshot

Theorem
Consider a game with payoff matrix A, where each entry of A is
y0
positive. The row players optimal strategy p is
,
y1 + + yn
where y 0 satisfies the LP problem of minimizing
y1 + + yn = 10 y subject to the constraints A0 y 1.

The big idea

The big observation is this:

Theorem
The row players LP problem is the dual of the column players LP
problem.

The final tableau in the Rock/Paper/Scissors LP problem was this:

x3
x1
x2
z

x1 x2 x3
y1
y2
y3
7/18 5/18
1/18
0 0 1
1/18
7/18 5/18
1 0 0
1/18
7/18
0 1 0 5/18
1/6
1/6
1/6
0 0 0

z
0
0
0
1

value
1/6
1/6
1/6
1/2

The entries in the objective row below the slack variables are the
solutions to the dual problem! In this case, we have the same
values, which means R has the same strategy as C . This reflects
the symmetry of the original game.

Example
Consider the game: players R and C each choose a number 1, 2,
or 3. If they choose the same thing, C pays R that amount. If
they choose differently, R pays C the amount that C has chosen.
What should each do?

Answer.
Choice
1
2
3

R
54.5%
27.3%
18.2%

C
22.7%
36.3%
40.1%

The expected payoff is 2.71 to the column player.

Game Theory
67% (3)
Game Theory
52 pages
Game Theory
No ratings yet
Game Theory
30 pages
Lecture Notes For Actuarial Insurance: 1 Decision Theory
No ratings yet
Lecture Notes For Actuarial Insurance: 1 Decision Theory
13 pages
Game 121 2 PDF
No ratings yet
Game 121 2 PDF
7 pages
1731009700_DM_(class_36-37)
No ratings yet
1731009700_DM_(class_36-37)
61 pages
14-Game Theory-Slide
No ratings yet
14-Game Theory-Slide
50 pages
Summary Note 3 (2024)
No ratings yet
Summary Note 3 (2024)
31 pages
Game Theory Model
No ratings yet
Game Theory Model
10 pages
Topic 29 TEORIA DOS JOGOS
No ratings yet
Topic 29 TEORIA DOS JOGOS
7 pages
Notes Game Theory
No ratings yet
Notes Game Theory
8 pages
Or Unit 4
No ratings yet
Or Unit 4
14 pages
University of Palestine: Operations Research
No ratings yet
University of Palestine: Operations Research
26 pages
Game Theory: By: Purvi R. Chaudhary
100% (1)
Game Theory: By: Purvi R. Chaudhary
28 pages
Determination of Best Strategy
No ratings yet
Determination of Best Strategy
14 pages
Game Theory Questions With Solutions
No ratings yet
Game Theory Questions With Solutions
18 pages
Game Theory: Strategies Selected by The Adversaries
No ratings yet
Game Theory: Strategies Selected by The Adversaries
54 pages
Chapter9ppt PDF
No ratings yet
Chapter9ppt PDF
32 pages
Game Theory 1
No ratings yet
Game Theory 1
10 pages
Module-3 - Game Theory
No ratings yet
Module-3 - Game Theory
58 pages
Game Theory Assignment 1
No ratings yet
Game Theory Assignment 1
4 pages
Duality in LPP and GAME Theory Solution
No ratings yet
Duality in LPP and GAME Theory Solution
9 pages
Lecture 12 Game Theory
No ratings yet
Lecture 12 Game Theory
46 pages
Game Theory
No ratings yet
Game Theory
50 pages
Lecture 910 - Game Theory
No ratings yet
Lecture 910 - Game Theory
114 pages
Game Theory
No ratings yet
Game Theory
75 pages
15.053/8 February 28, 2013: 2-Person 0-Sum (Or Constant Sum) Game Theory
No ratings yet
15.053/8 February 28, 2013: 2-Person 0-Sum (Or Constant Sum) Game Theory
34 pages
Game Theory PDF 2
No ratings yet
Game Theory PDF 2
10 pages
Game Theory
No ratings yet
Game Theory
13 pages
ECO208 GameTheory2
No ratings yet
ECO208 GameTheory2
8 pages
Subject Name: Operations Research Subject Code: 10CS661 Prepared By: Sindhuja K Department: CSE
No ratings yet
Subject Name: Operations Research Subject Code: 10CS661 Prepared By: Sindhuja K Department: CSE
57 pages
Chapter 7 Zero-Sum Games
No ratings yet
Chapter 7 Zero-Sum Games
25 pages
Ch 3 [Theory of Games]_29f5ea93 0ef2 47e4 8a56 Bfb6acde360d
No ratings yet
Ch 3 [Theory of Games]_29f5ea93 0ef2 47e4 8a56 Bfb6acde360d
8 pages
Combined Lecture Notes
No ratings yet
Combined Lecture Notes
104 pages
Game Theory: The Problem: - Strategic Behavior
No ratings yet
Game Theory: The Problem: - Strategic Behavior
22 pages
Game Theory Slides
No ratings yet
Game Theory Slides
50 pages
@6 - Game Theory
No ratings yet
@6 - Game Theory
29 pages
Game Theory2
No ratings yet
Game Theory2
23 pages
16 Optimal Mixed Strategy PDF
No ratings yet
16 Optimal Mixed Strategy PDF
7 pages
Games and Decision Making
No ratings yet
Games and Decision Making
7 pages
.trashed-1737215490-2e66a9d9a74dc5c11b620f70663400da_MIT15_053S13_tut08
No ratings yet
.trashed-1737215490-2e66a9d9a74dc5c11b620f70663400da_MIT15_053S13_tut08
31 pages
Game Theory Complete Notes
No ratings yet
Game Theory Complete Notes
59 pages
2.operation Research 2022
No ratings yet
2.operation Research 2022
85 pages
Determination of Best Strategy Using Game Theory
No ratings yet
Determination of Best Strategy Using Game Theory
7 pages
Sol2 Solutions Section 2
No ratings yet
Sol2 Solutions Section 2
32 pages
121review Game Theory
No ratings yet
121review Game Theory
17 pages
Solutions To Exercises of Section II.1
No ratings yet
Solutions To Exercises of Section II.1
31 pages
15.053 Tuesday, March 20: 2-Person 0-Sum (Or Constant Sum) Game Theory
No ratings yet
15.053 Tuesday, March 20: 2-Person 0-Sum (Or Constant Sum) Game Theory
36 pages
HW 4 Sols
No ratings yet
HW 4 Sols
5 pages
For Second Semster-1
No ratings yet
For Second Semster-1
45 pages
Session 15-16 - Games Strategies
No ratings yet
Session 15-16 - Games Strategies
49 pages
Dominance in Game Theory
100% (2)
Dominance in Game Theory
14 pages
Hw3 2 Solutions Jehle - Reny
0% (1)
Hw3 2 Solutions Jehle - Reny
7 pages
Game Theory
No ratings yet
Game Theory
29 pages
Game Theory Lecture
No ratings yet
Game Theory Lecture
28 pages
GAME THEORY (2).pptx
No ratings yet
GAME THEORY (2).pptx
39 pages
Games Theory and Linear Proramm
No ratings yet
Games Theory and Linear Proramm
28 pages
MATH4321 Hw1 Solution PDF
No ratings yet
MATH4321 Hw1 Solution PDF
13 pages
Game Theory B With Dominance Principle
No ratings yet
Game Theory B With Dominance Principle
35 pages
Game Theory: Minimax, Maximin, and Iterated Removal: Naima Hammoud
No ratings yet
Game Theory: Minimax, Maximin, and Iterated Removal: Naima Hammoud
29 pages
Calculus Super Review
From Everand
Calculus Super Review
Editors of REA
No ratings yet
Quine: Perspectives On Logic, Science and Philosophy: Interview by Bradley Edmister and Michael Ojshea
No ratings yet
Quine: Perspectives On Logic, Science and Philosophy: Interview by Bradley Edmister and Michael Ojshea
11 pages
Design and Intended Use of A Passive Actuation Strategy For A 2019
No ratings yet
Design and Intended Use of A Passive Actuation Strategy For A 2019
6 pages
Acr Math Month Culmination
No ratings yet
Acr Math Month Culmination
4 pages
Full Syllabus Test - 01 -- RMO Pre-Departure Camp Recorded_Questions_670d845b40612b822716a47b
No ratings yet
Full Syllabus Test - 01 -- RMO Pre-Departure Camp Recorded_Questions_670d845b40612b822716a47b
3 pages
PERIDA EMMALYN T. - BSci103-Lesson 1.1-Learning Task
No ratings yet
PERIDA EMMALYN T. - BSci103-Lesson 1.1-Learning Task
9 pages
Flux 3 Asymptode
No ratings yet
Flux 3 Asymptode
13 pages
Mar HMMT2021 Qual Int Bee
No ratings yet
Mar HMMT2021 Qual Int Bee
1 page
Stegmayer+et+al 2024 Fluid Dyn. Res. 10.1088 1873-7005 Ad6c7b
No ratings yet
Stegmayer+et+al 2024 Fluid Dyn. Res. 10.1088 1873-7005 Ad6c7b
27 pages
Vn3D PDF
100% (3)
Vn3D PDF
56 pages
Blade Element Theory
No ratings yet
Blade Element Theory
6 pages
Arithmetic 1
No ratings yet
Arithmetic 1
92 pages
STA 371G (Damien)
No ratings yet
STA 371G (Damien)
8 pages
Simple Random Sample
No ratings yet
Simple Random Sample
6 pages
IC Engines CEP REPORT
No ratings yet
IC Engines CEP REPORT
12 pages
DFS
No ratings yet
DFS
5 pages
Vorticity and Circulation
No ratings yet
Vorticity and Circulation
5 pages
Banking - RRB PO Prelims Full Length Mock Test 1 PYQ - English
No ratings yet
Banking - RRB PO Prelims Full Length Mock Test 1 PYQ - English
36 pages
Algorithm Design by Kleinberg and Tardos The Art of Computer Programming by Donald Knuth How To Solve It by Computer by R. G. Dromey
No ratings yet
Algorithm Design by Kleinberg and Tardos The Art of Computer Programming by Donald Knuth How To Solve It by Computer by R. G. Dromey
2 pages
A Review Paper On Face Recognition Techn
No ratings yet
A Review Paper On Face Recognition Techn
8 pages
A Better Way To Forecast
No ratings yet
A Better Way To Forecast
12 pages
Chapter 6 - Behavioural Modelling
No ratings yet
Chapter 6 - Behavioural Modelling
42 pages
Tutorial - Unit-I - Partial Differentiation
No ratings yet
Tutorial - Unit-I - Partial Differentiation
4 pages
Computer Application Project For Class X (2010-21) : (Mid Term)
No ratings yet
Computer Application Project For Class X (2010-21) : (Mid Term)
10 pages
The Internal Model Principle of Control Theory: Automatica September 1976
No ratings yet
The Internal Model Principle of Control Theory: Automatica September 1976
10 pages
Maths Class X Sample Paper Test 04 For Board Exam 2024 Answers
No ratings yet
Maths Class X Sample Paper Test 04 For Board Exam 2024 Answers
14 pages
DSTL STs QP With Solution
No ratings yet
DSTL STs QP With Solution
45 pages
18-12-2024_Sr.S60_Elite, Target & LIIT-BTs_2nd Year Syllabus_Jee-Main-GTM-06&01_Q.PAPER
100% (2)
18-12-2024_Sr.S60_Elite, Target & LIIT-BTs_2nd Year Syllabus_Jee-Main-GTM-06&01_Q.PAPER
22 pages
Maggi - Strategic Trade Policies With Endogenous Mode of Competition
No ratings yet
Maggi - Strategic Trade Policies With Endogenous Mode of Competition
23 pages
1996 CUG Presentation Nonblocking Assigns
No ratings yet
1996 CUG Presentation Nonblocking Assigns
26 pages
Module 1 Chapter 4 Week 4 Fundamentals of Surveying Lecture
No ratings yet
Module 1 Chapter 4 Week 4 Fundamentals of Surveying Lecture
10 pages