Equilibrium in a Stochastic n-Person Game: 1 Ή the game is eJ (Γ) - This choice

This document summarizes a research paper on equilibrium in stochastic n-person games. It contains the following key points: - The paper considers a game with a finite set of states where the next state depends stochastically on the current state and players' strategy choices. Players choose strategies to minimize their expected long-term costs, which are discounted over time. - It defines the concept of an equilibrium strategy set where no player can improve their cost by changing their individual strategy. - The main results are that an equilibrium strategy set exists and is unique, proven using properties of contraction mappings on the strategy space.

Uploaded by

Srinivasan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

126 views6 pages

Equilibrium in a Stochastic n-Person Game: 1 Ή the game is eJ (Γ) - This choice

Uploaded by

Srinivasan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

J . SGI. HIROSHIMA UNI V. SER.

A I
28 (1964), 8993
Equilibrium in a Stochastic n Person Game
A. M. Fink
(Received November 20, 1963)
Heuristically, a stochastic game is described by a sequence of states
which are determined stochastically. The stochastic element arises from a
set of transition probability measures. The determination of the particular
transition probability measure to be used at a move of the game is controlled
in part by each of the n players and it is this determination scheme which
gives rise to the strategies.
One might consider the following economics problem. We have n firms
competing for a market. Each will make a strategy decision periodically.
During each period the n firms play an ?zperson game. Across the infinite
horizon then we have a sequence of games. The economic situation behaves
in such a way that the game played in a given period depends on the game
played in the previous period and the strategies used in this period. The
dependence is not deterministic but is stochastic. A player's strategy will
reflect a concern both for the game now being played and for the situation
that will be probably confronted in the next period. The relative strength
of these two concerns will be in a geometric ratio, the socalled discounting
over the infinite horizon. We shall call the outcome of each game a cost to
each player. Negative cost is thus a gain.
More precisely we are considering a game which is described by a finite
set of states / . A play is a sequence of states {/}=
0
> &'
6
1 the game is
in the state /, each player h may choose an alternative j
h
eJ
h
( ). This choice
is made with knowledge of the state . Once each player has made his choice,
the game proceeds to the state fe with probability PH^.J^ Here F

y
1
...
;w
^>0
and ^Pij
r
..j
n
k = 1. The cost to player h of being in state and having the
vector ; = (/ , , /

) chosen is C
h j
. Each player furthermore chooses to
discount his projected cost by a factor

, where 0 <

< l . Thus if a
sequence of states {
n
} and alternative choices {]
n
} have been made, the cost
to player h is given by
(1)
The analysis of this game is simplified by a slight change in the outlook
and notation. Let g(h, ) be the cost to player h given that the game started
at the state . Then equation (1) can be rewritten
90 A. M. FINK
(2) g(h, o) = C
hio
j
Q
+ a
h
g(h, ).
Since is not strictly determined, we are interested in the expected value of
g(k, ). If the selection of the vector ;
0
is a function of the state &
5
i.e. a pure
strategy, one would get for every such strategy a relation
Of course, we may expect that the players will not use pure strategies.
Mixed strategies thus must be introduced. For every state of the game,
player h will give a probability distribution x
h
() on the alternative set J
h
(\
i.e. a; (0 = (&(0> > *m(0) where Xj()~>0, * j() = l and m is the cardinality
3
of J
h
(). If such a set of probability distributions x is given for every alter
native set, then letting e
hi
(x)=E\^g(h, OJ in (3) we have
(4) e
hi
(x) =
J w &
= 1 , , 7Z .
LEMMA 1. F or even/ ;= {^
m
(Ol^
w
G) is probability vector}, there exist
numbers e
hi
(x) satisfying relation (4). This set furthermore is unique.
Proof: The relation (4) for fixed h is a linear system whose coefficient
matrix A has the following properties.
a
k
= * *7, . ( )P, *, ( fc).
J m
By Hadamard's theorem all eigenvalues of A have absolute value at least
| f* I = l tf*> 0. Thus zero is not an eigenvalue of A and the linear
k+i
system has a unique solution.
Now each player h will use a strategy that tends to minimize his expected
cost. That is, if v
hi
mine
z
(X), then from (4) we get
(5) V
M
= min ^COCC o H tf

l]P
o
^], i = 1, ..., .
x
h
U~) ~J m k
h = 1, , n.
We will say that the vector x is an equilibrium point if and only if no player
can improve his cost by changing his strategy, i.e., if relation (5) is considered
Equilibrium in a Stochastic ^Person Game 91
for every player h and every state , the minimization of the right hand side
leads back to the same vector x. We will show t hat such equilibrium points
do in fact exist.
Let X^ {(VOL), , ^( ) , *
2
(1), ..., J( \ ..., *(!), ..., *"(<r)); **(0 is a
probability distribution on / * ()} , with euclidean metric and
R={(vn
9
, t7
n
); Vij real}, d(u, )=max|M/ y ;/y|. X is compact and R is
*./
complete. In the proofs t hat follow one notes t hat one can confine the
discussion to any closed convex subset of X and results are not altered. Thus
we simultaneously establish the existence of equilibrium points in constrained
games.
To ease the notational bulk we introduce the vector function / . Define
(6) / (*, y, I,)* =
J m=f=h k
where x e X and y\) is a probability vector. Thus (5) can be written
(7) v
hi
= mi n / 0, y, )
M
.
y
h
(.D
We note t hat f(x
9
y, v) has t he following properties :
(a) f(x, y, v) is continuous
(b) f(x, y, v)
hj
f(x, y, u)
hj
< a max \V
M
U
M
\, a = max a
h
k,l h
(c) f(x, y, v) is linear in y.
Let x 6 X and define a mapping T
x
of R into itself by the equation
(8) ( ,fO

y = min/ (*,
y
, fO

y
y
h
D
THEOREM 1 : For every x e X, T
x
is a contraction mapping of R.
Proof : Let u and be two elements of R and let x be a fixed element of
X. Let T
x
u=f(x,y,u) and T
x
v = f(x, z, )
9
then T
x
u<f(x,z,u) and 7 > <
/ (*,y,O Thus
(T
x
u)
h
j (T
x
v)
hj
<f(x, z, u)
hj
f( , z, v)
h
j<ad(v, u) by (b) and
(T
x
v)
hj
(T
x
u)
h
j<f(x, y, v)
h
j f(x, y, u)
h
j<ad(v, u).
Hence
x
u, T
x
v) = max | (T
x
v)
hj
(T
x
u)
hj
\ <ad(v, u).
Corollary 1. For every % e X, T
x
has a unique fixed point v.
Corollary 2. The set {T
x
\xtX} is equicontinuous.
Let x 6 X and define mappings and /? by
92 A. M. FINK
(9) 0*0 = {v I = min f(x, y, v)
y
(10) ( )
We note t hat (x) is a single valued function by Corollary 1. By (c) it is
clear t hat (x) is convex and closed for every xtX.
LEMMA 2. The range of (x) is bounded.
Proof. By Theorem 1 the sequence v
0
= Q, v
n+
= T
x
v
n
converges to (x).
Furthermore, we have d(v
m9
v
m
)<ad(v
m
_, v
m
2)< <oc
m
'
l
d(vi, VQ) so that
d(v
n9
*^d(v
l9
vo). Thus d( ( \ 0) < d(T
x
0
9
0) ^

1
max | ( , 0)
|
| <
J. OC 1 O J. O h,i
max|C
/
v| . Hence (x) is bounded as x takes on all values in X.
1 O hi j
Define S
v
(x) T
x
v. S
v
is a mapping of X into R.
LEMMAS. S
v
is continuous on X. Furthermore, {S
v
\v is bounded} is
equicontinuous.
Proof: Let S

(x) = T
x
v = f(x, 7, v)<Lf(x, 2, v) and S
v
(x')=T
x
,v = f(x', z, }<
f(x
f
, y, v}', then S
v
(x'} S
v
(x)<f(oc', y, ) f(x, y, v) and S
v
(x) S
v
(x'^f(x, z, v)
f(x
f
, z
9
v).
If v is restrained to be in a bounded region, then the right hand sides
can be made uniformly small because of the uniform continuity of / on
compact sets.
LEMMA 4. / / x
n
+x and (x^~^v^ then (x) = v
Q
.
Proof: d(
9
T
x
vo)<d(v
0
, (x
n
}} + d( (x
n
\ T
x
(x
n
}} + d(T
x
(x
n
\ T
x
v
0
) =
d(v
Q9
( n)) + d(S
(Xn}
(x
n
\ S
(Xn)
(x)) + d(T
x
(x
n
\ T
x
v
0
) *Q as ?i>oo because (x
n
^v^
and { (
n
)} is bounded by Lemma 2 so that Lemma 3 applies to the second
term.
LEMMAS. / / x
n
^x, j
n
~^j and y
n
(x
n
\ then y e (x).
Proof: By taking subsequences we can consider # GO >t?
0
. By Lemma 4
(x) = v
0
. Now d(f(x
9
y, v
0
\ vo)<d(f(x, y, V
Q
\ f(x
n
, y
n
> /?(*))) + <* (/ (*, Jn, (*n)\
VQ) = d(f(x
9
y, V
Q
\ f(x
n9
y
n9
/?GO)) + d( (x
n
\ VQ) + 0 as n > oo . Thus v
0
=f(x
9
y, V
Q
\
and by Lemma 4, v
0
is the fixed point so t hat f(x, y, VQ) =V
Q
= min/ (# , 2, V
Q
\
thus y (x).
THEOREM 2. There exist x 6 X, v e R such that v = f(x
9
x, v) = mmf(x, , v).
Equilibrium in a Stochastic rc Person Game 93
i.e. x 6 (x).
Proof: This is a consequence of the Kakutani fixed point theorem, i.e.
the mapping takes points into convex closed sets, by Lemma 5 the set
\J (x) is closed, hence sequentially compact and by Lemma 5 it is an upper
X
semi continuous set function.
I t is also interesting to note t hat /?(#) is continuous (Lemmas 2 and 4)
and thus its range is compact and connected. In case the set of states / is
denumerable, one replaces R by the space of bounded sequences and requires
t hat I Chj i < M for some M, and the results are still valid for X the appropriate
cartesian product of probability spaces. One also notes t hat Theorem 1 and
its corollaries are valid if the cardinality of J
h
() is arbitrary and min is
replaced by inf. In this case the techniques of this paper yield effective
strategies.
If n = 2 the results of this paper are those given by Shapely [2 \ and for
n = l the problem is a well known dynamic program problem. See Takahashi
[ J] for a different generalization of Shapely's results.
References
1. S. Kakut ani, A Generalisation of Brouwefs Fixed Point Theorem, Duke Math. Jour n al, vol. 8 (1941),
pp. 457 459.
2. L. S. Shapely, Stochastic games, Proc. Nat. Acad. Sc., vol. 39 (1953), pp. 10951100.
3. M. Takahashi, Stochastic Games with Infinitely Many Strategies, Jour n al of Science of the Hiroshima
University, Series A I, vol. 26 (1963), pp. 123134.
University of Nebraska

An Introduction To Stochastic Control
No ratings yet
An Introduction To Stochastic Control
134 pages
John Nash - Non-Cooperative Games
No ratings yet
John Nash - Non-Cooperative Games
31 pages
2014 - Lectures Notes On Game Theory - WIlliam H Sandholm
No ratings yet
2014 - Lectures Notes On Game Theory - WIlliam H Sandholm
167 pages
게임이론 강의
100% (1)
게임이론 강의
88 pages
Continuous-Time Limit of Dynam
No ratings yet
Continuous-Time Limit of Dynam
33 pages
Nash NonCooperativeGames 1951
No ratings yet
Nash NonCooperativeGames 1951
11 pages
Complexity Results For Some Classes of Strategy Games Fischer - Felix
No ratings yet
Complexity Results For Some Classes of Strategy Games Fischer - Felix
177 pages
Nash - PHD - Thesis - Original Version - Latex
No ratings yet
Nash - PHD - Thesis - Original Version - Latex
32 pages
MIT6 254S10 Lec05
No ratings yet
MIT6 254S10 Lec05
28 pages
Health Economics 3
No ratings yet
Health Economics 3
40 pages
Ergodic Properties of Markov Processes
No ratings yet
Ergodic Properties of Markov Processes
39 pages
Two Different Approaches To Nonzero-Sum Stochastic Differential Games
No ratings yet
Two Different Approaches To Nonzero-Sum Stochastic Differential Games
14 pages
DMW Theorem
No ratings yet
DMW Theorem
13 pages
Detailed Lesson Plan 2
100% (1)
Detailed Lesson Plan 2
7 pages
Fujimoto T Karunathilake N Ranade R TM2124 Bimatrix Games Have A Quasi-Strict Equilibrium 2018 Published Version
No ratings yet
Fujimoto T Karunathilake N Ranade R TM2124 Bimatrix Games Have A Quasi-Strict Equilibrium 2018 Published Version
12 pages
Mechanical Toy
100% (1)
Mechanical Toy
10 pages
Lapidot 1983
No ratings yet
Lapidot 1983
8 pages
EC744 Lecture Note 7 Stochastic Dynamic Programming: Prof. Jianjun Miao
No ratings yet
EC744 Lecture Note 7 Stochastic Dynamic Programming: Prof. Jianjun Miao
24 pages
Stochastic Approximations and Differential Inclusions, Part II: Applications
No ratings yet
Stochastic Approximations and Differential Inclusions, Part II: Applications
23 pages
Stochastic Games with Infinitely Many Strategies: N, k, Γ m n k, ί-th j-pV >0 I pyt. g (k. k
No ratings yet
Stochastic Games with Infinitely Many Strategies: N, k, Γ m n k, ί-th j-pV >0 I pyt. g (k. k
12 pages
On The Constrained Equilibrium Problems With "Nite Families of Players
No ratings yet
On The Constrained Equilibrium Problems With "Nite Families of Players
19 pages
L.STETTNER (Warszawa) : Applicationes Mathematicae 22,1 (1993), Pp. 25-38
No ratings yet
L.STETTNER (Warszawa) : Applicationes Mathematicae 22,1 (1993), Pp. 25-38
14 pages
1951 Nash
No ratings yet
1951 Nash
10 pages
Game 02
No ratings yet
Game 02
8 pages
Fitjee 2015
0% (2)
Fitjee 2015
32 pages
ECON4211 PS1 Sol 2024F
No ratings yet
ECON4211 PS1 Sol 2024F
7 pages
336 Lecture4 2007
No ratings yet
336 Lecture4 2007
5 pages
8 Extremal Principle Nirjhar Nath Ganit Bikash
No ratings yet
8 Extremal Principle Nirjhar Nath Ganit Bikash
11 pages
Nash Proof
No ratings yet
Nash Proof
6 pages
Lecture VI: Existence of Nash Equilibrium
No ratings yet
Lecture VI: Existence of Nash Equilibrium
12 pages
ArXiv 9806022
No ratings yet
ArXiv 9806022
15 pages
Minimax Theorem and Nash Equilibrium
No ratings yet
Minimax Theorem and Nash Equilibrium
5 pages
Imo 2012
No ratings yet
Imo 2012
11 pages
Book Reviews
No ratings yet
Book Reviews
7 pages
ErgodicTheory LecNotes PDF
No ratings yet
ErgodicTheory LecNotes PDF
20 pages
Kakutani's Fixed Point Theorem: A New Proof: Economics 8103 Microeconomic Theory Spring 2005
No ratings yet
Kakutani's Fixed Point Theorem: A New Proof: Economics 8103 Microeconomic Theory Spring 2005
10 pages
Solutions To The 84th William Lowell Putnam Mathematical Competition Saturday, December 2, 2023
No ratings yet
Solutions To The 84th William Lowell Putnam Mathematical Competition Saturday, December 2, 2023
5 pages
Meszaros A. R. Math 167 - Mathematical Game Theory (2017, Midterm 2, March 3)
No ratings yet
Meszaros A. R. Math 167 - Mathematical Game Theory (2017, Midterm 2, March 3)
4 pages
Putnam 1997 (Problems and Solutions)
No ratings yet
Putnam 1997 (Problems and Solutions)
18 pages
Lecture VI: Existence of Nash Equilibrium
No ratings yet
Lecture VI: Existence of Nash Equilibrium
12 pages
Alibaba 2024
No ratings yet
Alibaba 2024
6 pages
Two-Person Cooperative Games (NASH) PDF
No ratings yet
Two-Person Cooperative Games (NASH) PDF
13 pages
EC744 Lecture Note 9 Convergence of Markov Processes: Prof. Jianjun Miao
No ratings yet
EC744 Lecture Note 9 Convergence of Markov Processes: Prof. Jianjun Miao
22 pages
Nonlinear Programming Exam Note 1
No ratings yet
Nonlinear Programming Exam Note 1
3 pages
PRE CALCULUS 2ndQ SLM
No ratings yet
PRE CALCULUS 2ndQ SLM
45 pages
Teaching Notes Updated 24 25
No ratings yet
Teaching Notes Updated 24 25
2 pages
n dP d (P +Q) P f f +1 dP dQ Σ 3 1 2 n f f +1 n g 1−g g 1−g 1
No ratings yet
n dP d (P +Q) P f f +1 dP dQ Σ 3 1 2 n f f +1 n g 1−g g 1−g 1
2 pages
Linear Differential Games of Pursuit With Integral Block of Control in Its Dynamics
No ratings yet
Linear Differential Games of Pursuit With Integral Block of Control in Its Dynamics
4 pages
Notes On Non-Cooperative Game Theory Econ 8103, Spring 2009, Aldo Rustichini
No ratings yet
Notes On Non-Cooperative Game Theory Econ 8103, Spring 2009, Aldo Rustichini
30 pages
EE364a Homework 3 Solutions: 0 N 0 1 N N 1 1 N N 0 0
No ratings yet
EE364a Homework 3 Solutions: 0 N 0 1 N N 1 1 N N 0 0
19 pages
Tutorial Ocaml
100% (1)
Tutorial Ocaml
22 pages
1control de Lect.l1
No ratings yet
1control de Lect.l1
11 pages
Course in Game Theory
No ratings yet
Course in Game Theory
17 pages
McKelvey (1976) - Intransitivities in Multidimensional Voting Models and Some Implications For Agenda Control
No ratings yet
McKelvey (1976) - Intransitivities in Multidimensional Voting Models and Some Implications For Agenda Control
11 pages
Math 8 2019-2020 (3rd Quarter)
No ratings yet
Math 8 2019-2020 (3rd Quarter)
9 pages
Exponential Dichotomy: Zhisheng Shuai Department of Mathematical and Statistical Sciences University of Alberta
No ratings yet
Exponential Dichotomy: Zhisheng Shuai Department of Mathematical and Statistical Sciences University of Alberta
7 pages
OR1
No ratings yet
OR1
135 pages
Review of Basic Concepts: Normal Form: 14.126 Game Theory Muhamet Yildiz
No ratings yet
Review of Basic Concepts: Normal Form: 14.126 Game Theory Muhamet Yildiz
11 pages
Mathematics Pedagogy in Hindi
No ratings yet
Mathematics Pedagogy in Hindi
13 pages
נוסחאות ואי שיוויונים
No ratings yet
נוסחאות ואי שיוויונים
12 pages
EC744 Lecture Note 3 Dynamic Programming Under Certainty: Prof. Jianjun Miao
No ratings yet
EC744 Lecture Note 3 Dynamic Programming Under Certainty: Prof. Jianjun Miao
17 pages
IMC 2009 Problems With Solutions (Day 1)
No ratings yet
IMC 2009 Problems With Solutions (Day 1)
4 pages
Math 101 - Revise - Syllabus
No ratings yet
Math 101 - Revise - Syllabus
10 pages
Class Xii CS Practical File 2
No ratings yet
Class Xii CS Practical File 2
63 pages
Descartes Fermat Analytic Geometry
No ratings yet
Descartes Fermat Analytic Geometry
58 pages
Zainab Shehzadi
No ratings yet
Zainab Shehzadi
87 pages
AUGUST2015
No ratings yet
AUGUST2015
2 pages
Lecture#03,4
No ratings yet
Lecture#03,4
27 pages
Math 10 - M7 - Q4
No ratings yet
Math 10 - M7 - Q4
4 pages
SQP 8 MATHS 12 - Solution
No ratings yet
SQP 8 MATHS 12 - Solution
25 pages
Lab 08: Fourier Transform
No ratings yet
Lab 08: Fourier Transform
3 pages
6062af0725e0171bcd2e6c7b - SS AP Calculus AB Unit 4
No ratings yet
6062af0725e0171bcd2e6c7b - SS AP Calculus AB Unit 4
8 pages
Excel Stat Tables
No ratings yet
Excel Stat Tables
14 pages
ch08 SamplingDist
No ratings yet
ch08 SamplingDist
43 pages
Math 151 Homework 04 PDF
No ratings yet
Math 151 Homework 04 PDF
6 pages
Tutor-Marked Assignment E: Mathematics IGCSE Module Six: Further Geometry and Trigonometry Probability
No ratings yet
Tutor-Marked Assignment E: Mathematics IGCSE Module Six: Further Geometry and Trigonometry Probability
5 pages
Arc Length and Radian Measure: Reteach
No ratings yet
Arc Length and Radian Measure: Reteach
10 pages
3.decision Making and Looping
No ratings yet
3.decision Making and Looping
3 pages
Post Test: Data On Pre-Test (Copy From "Pre-Test" File "Raw For Report" Sheet, Row 9)
No ratings yet
Post Test: Data On Pre-Test (Copy From "Pre-Test" File "Raw For Report" Sheet, Row 9)
24 pages
Quiz 5
No ratings yet
Quiz 5
10 pages
Matrices 1 Marks Answers
No ratings yet
Matrices 1 Marks Answers
6 pages
AP Question Bank Advanced
No ratings yet
AP Question Bank Advanced
5 pages
Us2 M 92 Improper Fractions Activity Sheet English United States Ver 7
No ratings yet
Us2 M 92 Improper Fractions Activity Sheet English United States Ver 7
4 pages

Equilibrium in a Stochastic n-Person Game: 1 Ή the game is eJ (Γ) - This choice

Uploaded by

Equilibrium in a Stochastic n-Person Game: 1 Ή the game is eJ (Γ) - This choice

Uploaded by

J . SGI. HIROSHIMA UNI V. SER.

You might also like