Micro Notes August 14 2016
Micro Notes August 14 2016
Geoffrey Heal
August 14, 2016
Contents
I
1 Consumer Choice
1.1 Comparative Statics . . . . . . . . . . . . . . . . . . . . . . .
1.2 Preferences . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6
7
8
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
15
19
20
21
21
24
26
5 Preference Aggregation
28
30
II
Set
. .
. .
. .
. .
.
.
.
.
.
34
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
35
36
37
38
42
43
III
44
44
Paradoxes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
10 Risk Aversion
10.1 Risk Management . . . . . . . . . . . . . . .
10.1.1 Mean-Variance . . . . . . . . . . . .
10.2 Comparison of payoffs in terms of return and
10.3 A Geometric Approach to Insurance . . . . .
. . .
. . .
risk
. . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
50
51
56
57
59
60
12 State-Dependent Preferences
61
13 Subjective Probabilities
63
13.1 Savages Axioms . . . . . . . . . . . . . . . . . . . . . . . . . . 64
14 Non-Expected Utility Approaches
66
14.1 MinMax Approaches . . . . . . . . . . . . . . . . . . . . . . . 67
14.2 MaxMin Expected Utility . . . . . . . . . . . . . . . . . . . . 68
14.3 Smooth Ambiguity Aversion . . . . . . . . . . . . . . . . . . . 69
2
14.4 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
15 Third Problem Set
75
IV
77
General Equilibrium
16 Edgeworth Box
78
81
18 Problem Set 4
86
87
20 Public Goods
88
21 External Effects
93
94
95
Part I
Consumer Choice
Commodity space is RN . Commodity bundles are vectors in RN . Consumption set X RN . Reflects physiological constraints - how much effort
can be supplied, how much food is needed to survive, dependence of effort
on food consumption, maximum of 24 hours in a day, etc. Generally closed,
N
for simplicity take to be R+
.
Budget is constraint on amount that can be spent, not to exceed income
W . p RN is a pricevector, and p.x is the
cost of bundle x at prices p. The
N
budget set B is B = x R : p.x W .
Consumer problem is to choose a best point in B according to the ordering &, i.e. to choose in
{x NB : y B, x & y}. The demand correspondence is x (p, W ) = x R+ : x B, y B, x & y . This is a correspondence (set-valued map) as it is not necessarily single-valued.
Note that the price vector p is orthogonal to the budget hyperplane H =
{x : p.x = W }, i.e. p.z = 0 for any vector z lying in the hyperplane H. To
see this let y be such that p.y = W, and also let x H, then (x y) is a
vector in p.x = W , and p.x = p.y = W so p. [x y] = 0 and p is orthogonal
to (x y).
Definition 3. Demand correspondence x (p, W ) is homogeneous of degree zero if x (p, W ) = x (p, W ) for any p, W and any > 0.
6
Also needed is
Definition 4. Demand correspondence satisfies Walras Law if for every
p > 0 and W > 0 we have p.x = W for all x x (p, W ).
Assume that the demand correspondence is single-valued, i.e. a function.
Proposition 2. The demand function is homogeneous of degree zero.
Proof. {x : p.x W } = {x : p.x W }
1.1
Comparative Statics
..
..
...
Dp (p, Y ) =
.
.
xN (p,W )
p1
.......
xN (p,W )
pN
xn (p, W )
pk
pk
xn (p, W )
n,W =
xn (p, W )
W
W
xn (p, W )
pk
pk +
xn (p, W )
W = 0 n = 1, ..., N
W
7
n
But as x (p, W ) = x (p, W ) > 0, x
=0
n
+
pn x
pk
1.2
Preferences
A strengthening of this is
Definition 8. The preference relation & on X is strictly convex if for every
x X the upper contour set is strictly convex or if we have if y & x & z &
x & y 6= z then y + (1 ) z x for any (0, 1).
Homothetic preferences are often used because they have some neat properties:
Definition 9. A monotone preference relation on X is homothetic if x
y x y = 0
Intuitively, with homothetic preferences every indifference curve can be
obtained from one by scaling it up or down. Another property often used is
that of being quasi-linear:
Definition 10. A preference relation & is called quasi-linear with respect to
commodity 1 (called the numeraire commodity) if
(1) all indifference sets are parallel displacements of each other along
the axis of commodity 1, that is if x y then x + e1 y + e1 for
e1 = (1, 0, 0, 0, ..., 0) and any scalar > 0, and
(2) good 1 is desirable, that is x + e1 x for all x and > 0.
Next continuity of preferences:
Definition 11. The preference relation & on X is continuous if its upper
and lower contour sets are closed.
Rational continuous preferences can be represented by a real-valued function which we call a utility function. This means there is a function
U : X R such that U (y) = U (x) if f y & x.
Proposition 5. If the preference ordering & on X is rational and continuous
then there exists a continuous utility function that represents this ordering.
This means that the utility function numbers indifference sets in such a
way that the numbering is higher for more preferred sets. Note that there
are in general very many ways of doing this so that the utility representation
is not unique. Preferences are in some sense more fundamental than the
utility representation. And this is not always possible: continuity of the
upper contour sets matters, as the next example shows.
9
2.1
(2.1)
1
is a Lagrange multiplier. We assume the function U to be
where R+
concave: in this case
11
2.2
UMP solutions
Necessary conditions for x to be a solution to the UMP are that there exist
a Lagrange multiplier such that:
U (x )
5 pn , with equality if xn > 0
xn
So if we are at an interior optimum then
U (x )
= pn
xn
In matrix notation, letting U (x) = (U/x1 , ......., U/xN ), we have
U (x ) 5 p, x . [U (x) p] = 0
This tells us that at an interior solution the gradient vector of the utility function must be proportional to the price vector, or more intuitively marginal
rates of substitution between commodities must be equal to their price ratios:
U (x )/xk/U (x )/xj
= pk/pj
Note that the value of the Lagrange multiplier gives us the marginal utility
of wealth, i.e. the rate at which utility increases with wealth if the budget
constraint is relaxed slightly. To see this note that if W changes by 4W
then the resulting change in utility is
4U =
X
X U
4xn =
pn 4xn = 4W
xn
Proof. Point 6:
P
xj
j pj pn = 0.
V
pn
U xj
j xj pn
pj pnj . But
xj p j = W xn +
p1 (1 )
=W
p2
, x2 = W
p1
p2
From these we can work out the income and price elasticities:
P ED1,1 = P ED2,2 = 1, P ED1,2 = P ED2,1 = 0, IED1 = IED2 = 1
Note that the cross PEDs are zero, which is obvious when you rewrite the
utility function as lnx1 + (1 ) lnx2 . We can work with the log of the
utility function as it is an increasing transformation and so preserves the
underlying ordering, i.e. the upper contour sets are the same as before. It
is often interesting to look at expenditure shares as a function of price and
income. In this case we have
S1 =
p 1 x1
p 2 x2
= , S2 =
=1
W
W
13
so the expenditure shares are just the exponents of the consumption levels
in the Cobb-Douglas case.
Indirect utility function: by definition V (p, W ) = U (x ). From above
1
1
so
V
(p,
W
)
=
W
x1 = W p1 , x2 = W 1
p2
p1
p2
Example 2: Linear Utility
M axx {ax1 + bx2 } s.t. p1 x1 + p2 x2 = W
There exists such that
U
5 pi , = if xi > 0, i = 1, 2
xi
So
a 5 p1 , = if x1 > 0; b 5 p2 , = if x2 > 0
So if x1 , x2 > 0 then a/b = p1/p2 . Otherwise either x1 > 0 and x2 = 0 or vice
versa.
Example 3: Fixed Coefficients
Utility is U (x) = min {ax1 , bx2 } so the problem is
M axx min {ax1 , bx2 } , p1 x1 + p2 x2 = W
We know that to avoid wasting either good we need ax1 = bx2 so we want
to max x1 with x2 = x1 a/b. So p1 x1 + p2 x1 ab = W and
x1 =
bW
aW
, x2 =
p1 a + p2 b
p 1 a + p2 b
abW
p1 a + p2 b
x2
14
p1
p2
1
1
pr1 pr
2
+ 1 = x2 p 2
pr1 + pr2
pr2
=W
so finally
pr1
pr1
2
x2 = W r
, x1 = W r 1 r
r
p1 + p 2
p1 + p2
Indirect utility is
r
1/
1/
1
W pr1
W pr1
p1 + pr2
1
2
r
r r
V (p1 , p2 , W ) =
+
=
W
=
W
(p
+
p
)
1
2
pr1 + pr2
pr1 + pr2
(pr1 + pr2 )
15
1
p2
, p2 =
, x 1 = x2
x1
x2
p1 1
p2
p1 1
+ (1 ) ln x2 = U
p2
ln x2 = U ln
p1 1
1
p1
p2
1
U
U
x1 = e
, x2 = e
p1
1
p2
p
1
p2
p2
2
1
U
U
e
p1
=e
+ p2
p1
1
p2
1
p1
1
16
p p 1
2
1
=e
1
Example: Fixed Coefficients
2b
We know that V ((p1 , p2 , W ) = p1abW
W = Expenditure = U p1 a+p
b+p2 a
ab
Example: CES
1
We know that U = V (p1 , p2 , W ) = W (pr1 + pr2 ) r W = Expenditure =
1
U (pr1 + pr2 ) r
U
Definition 14. The optimal commodity vector in the EMP, denoted h (p, U ) :
RN RN , is known as the Hicksian or compensated demand function.
The adjective compensated indicates that this is demand as a function
of prices with the utility level constant, which means that as prices change
wealth must be altered too to allow the consumer to maintain the same utility
level.
Proposition 14. Suppose U is continuous and represents a locally nonN
satiated preference & on R+
. Then for any p 0 the Hicksian demand
function h (p, U ) is
1. homogeneous of degree zero in p
2. U (h (p, U )) = U for any p
Next we establish a connection between the expenditure function and the
compensated (Hicksian) demand function. Note that e (p, U ) = h (p, U ) .p
Proposition 15. Let U be a continuous utility function representing a locally
non-satiated strictly convex preference relation & on X. For all p and U the
Hicksian demand h (p, U ) is the derivative vector of the expenditure function
e (p, U ) with respect to prices:
h (p, U ) = p e (p, U ) or hn (p, U ) = e (p, U ) /pn n
Proof. Assume for simplicity that h (p, U ) 0 and is differentiable. Using
the chain rule we can write
X hi
X
e (p, U )
=
hi (p, U ) pi =
pi
+ hn (p, U )
pn
pn i
pn
i
Using the FOCs this is
X hi U
e (p, U )
=
+ hn (p, U )
pn
p
h
n
i
i
17
hi U
pn hi
So
the compensated demand function is the regular demand function evaluated at the same prices and at the wealth level required to reach the
specified utility level, and
the regular demand function is the compensated demand function evaluated at the same prices and at the indirect utility function evaluated
at the same prices and wealth.
The next proposition states that the effect of a price change on demand
can be decomposed into two parts, one due to the price change alone at
constant welfare level and the other due to the change in real income resulting
from a price change. These are called the substitution and income effects
respectively.
Proposition 16. (The Slutsky equation) Assume U is a continuous utility
function representing a locally non-satiated strictly convex preference relation
& defined on X. Then for all (p, W ) and U = V (p, W ) we have
hn xn (p, W )
xn (p, W )
=
xk (p, W ) n, k
pk
pk
W
or equivalently in matrix notation
Dp x (p, W ) = Dp h (p, U ) DW x (p, W ) x (p, W )T
and attaining
Proof. Consider a consumer facing the price-wealth pair p, W
= e p, U . For all (p, U ) we
utility level U . Her wealth level must satisfy W
have hn (p, U ) = xn (p, e (p, U )). Differentiating this with respect to pk and
evaluating at p, U gives
hn p,
U
xn p,e
p,
U
xn p, e p, U e p,
U
=
+
(3.1)
pk
pk
W
pk
This yields from proposition 15
hn p,
U
xn p,e
p,
U
xn p, e p, U
=
+
hk p, U
pk
pk
W
18
= e p,
we
Finally since W
U and hk p, U = xk p, e p, U = xk p, W
have
xn p, W
xn p, W
hn p,
U
=
+
xk p, W
pk
pk
W
Simple rearrangement gives
xn p, W
hn p,
U
xn p, W
xk p, W
pk
pk
W
Now let k = n so that we are looking at own price effects:
xn p, W
hn p,
xn p, W
U
xn p, W
pn
pn
W
,
W
n
W
can also express this in terms of elasticities by multiplying through by pn /xn :
W p n xn
pn
hn p,
xn p, W
U pn xn p, W
=
pn
xn
pn
xn
W
xn W
which states that the elasticity of regular (Marshallian) demand equals that
of compensated (Hicksian) demand minus the income elasticity of demand
times the expenditure share.
3.1
= F (x, y, p) = 0
U2 (x | y) p2
We can use the implicit function theorem to get the impact of a change in
the parameter y on the demand for say x1 .
x1
Fy
U2 U1y U1 U2y
=
=
y
F x1
U2 U11 U1 U21
and for a particular functional forms we can determine the sign of this.
3.2
Another example is the supply curve for labor as a function of the wage
rate. Let utility be U (y, L) where y is consumption and L is leisure, so U
is increasing in both. Income is given by y = (K L) w + A where w is the
wage rate and K is the number of hours available for work and leisure. A is
the agents non-labor income. The consumer problem is
M axL U (y, L) , y + wL = wK + A
and we let S = wK +A be the total income the agent could earn if he devoted
all his time to work. This is his wealth: unlike in previous cases it depends
on the price w. The FOCs are
UL w = 0, Uy = 0
From this we get the Marshallian demand function xl (p, w, S) for leisure
and by solving the expenditure minimization problem we get the Hicksian or
compensated demands hl (p, w, U ) for leisure. Using the Slutsky equation we
can write
hl
xl
xl
=
l
w
w
S
Note that xl (p, w, S) = xl (p, w, wK + A) from which
dxl
xl
xl
=
+K
dw
w
S
20
Note that xl depends on w via two of its arguments so we need to take this
into account when differentiating with respect to w. I have used dxl /dw to
stand for the derivative taking this into account. The standard derivation of
the Slutsk equation does not take this into account.
Substituting this into the Slutsky equation we have
dx
xl
hl
xl
xl
= l K
=
l
w
dw
S
w
S
so
hl
dxl
=
+
dw
w
hl
S
(K l)
Here the first term is negative as it is the own price effect: an increase in the
wage rate makes leisure more expensive and reduces consumption of leisure
(substitution effect), while if leisure is a normal good the second term is
positive representing the income effect of a wage change.
3.2.1
Example
so the ratio of leisure to work equals the ratio of the exponents of leisure
and work. Labor supply is independent of the wage rate: income effects
exactly offset substitution effects. In the first problem set you will investigate
whether this is true for a CES utility function.
Now we switch to an application in the cost-benefit area:
3.3
change? Note that e (p1 , U 0 ) tells us how much it costs to attain the initial
welfare level at the new prices p1 . So the CV is the change in wealth needed
to compensate for the new prices and get back to the initial state. We can
write the CV as
V p1 , W CV = U 0
In each case we are trying to restore the consumer to the original welfare
level but in the EV case using the initial prices and in the CV case using the
final prices. These two clearly in general give different answers, but they will
nevertheless give the same ranking of alternatives - the consumer is better off
under p1 if and only if these measures are both positive. For normal goods
(IED > 0) we have that EV = CV and in the case of quasi-linear preferences
we have EV = CV . To see this consider the two-good case with just the
price of good 1 changing so that p01 6= p11 , p02 = p12 . We can express the EV in
terms of the Hicksian or compensated demand.
Recall that w = e (p0 , U 0 ) = e (p1 , U 1 ) and that (by proposition 15)
h1 (p, U ) = e (p, U ) /p1 . Hence we can write
EV p , p , W = e p , U W = e p0 , U 1 e p1 , U 1 =
p01
h1 p1 , p2 , U 1 dp1
p11
so the EV can be represented by the area between p11 and p01 to the left of the
Hicksian demand curve for good 1 associated with utility level U 1 . Similarly
the CV can be expressed as
p01
0 1
CV p , p , W =
h1 p1 , p2 , U 0
p11
which is the area between the two prices to the left of the Hicksian demand
curve corresponding to utility level U 0 . Clearly
p01
0 1
0 1
EV p , p , W CV p , p , W =
h1 p1 , p2 .U 1 h1 p1 , p2 , U 0
p11
h
i
This reduces to M inx1 p1 x1 + p2 U f (x1 ) and the solution to this is x1 =
0 1
p1
f
, X2 = U f (x1 ). Hence the expenditure function is
p2
0 1 p
p1
0 1
1
e p1 , p2 , U = p1 f
+ p2 U f
f
p2
p2
and the derivative of this with respect to p1 is independent of the utility level.
In this case the EV and CV are equal and are both equal to the conventional
Marshallian consumer surplus, the area between the two prices to the left of
the regular demand curve.
3.4
24
p01 +t
p01
p01 +t
=
p01
h1 p1 , p2 , U 1 dp1 th1 p01 + t, p2 , U 1
h1 p1 , p2 , U 1 dp1 h1 p01 + t, p2 , U 1 dp1
25
products of each income elasticity with its budget share must equal
one, i.e.
X
k E k = 1
k
where Ek =
xk y
y xk
and k = pk xk /Y .
27
Preference Aggregation
The first issue to look at here is that of social indifference curves. Recall that
for a rational preference relation & on RN the preferred or indifferent set to
the point x is P I (x) = {y : y & x}. Let there be I individuals indexed by
i, each consuming xi . The for each person we have P Ii (xi ) = {y : y &i xi }
where &i denotes the i-th individuals preference ordering. Consider
(
)
X
X
P I (x1 , ..., xI ) =
P Ii (xi ) = Y : Y =
yi , yi P Ii (xi ) i
i
So this is the set of points that can be divided between agents so that each
is in the preferred or indifferent set to xi . The boundary of this set is the
social indifference curve SIC associated with (x1 , ...., xI ), SI (x1 , ..., xI ), and
is the set of points that can be divided between agents so that each is on
the indifference curve containing xi . An important question is: do these
indifference curves define a rational preference - complete and transitive?
Complete is not an issue: transitive is.
Proposition 17. A sufficient condition for the SICs SI (x1 , ..., xI ) for all
possible allocations x1 , ....., xI to form a transitive preference is that all agents
have identical and homothetic preferences. (Note: a necessary and sufficient
condition is a bit weaker but not a lot: it is that each agent have a preference
that is an affine translation of a given homothetic preference. See paper by
me and Chichilnisky Jour Math Econ 1983 )
Proof. Recall that a preference is homothetic if and only if x y x
y > 0. This implies that x & y x & y > 0. So if agent i0 s
preferences are homothetic then all preferred or indifferent sets are scalar
multiples of each other: > 0 : P Ii (xi ) = P Ii (yi ) f or any xi , yi . If all
preferences are homothetic and identical there exists an individual k and an
allocation xk and scalars j (xj ) such that
P Ij (xj ) = j (xj ) P Ik (xk )
Hence
X
P Ij (xj ) = P Ik (xk )
j (xj ) = P Ik (xk )
Wi
Wi = 0 l
0 1 p
Wi p2 0 1 p2
2
= f
, xi,1 =
f
p1
p1
p1
p1
29
x : Ui (x) = Ui .
o
n
All forms of voting are social choice rules - single non-transferable votes,
transferable votes, proportional representation, etc. They all map a diverse set of individual preferences into a single social preference. Voting
systems and other social choice systems run into the Condorcet Paradox.
Consider a set of three people {A, B, C} with preferences over three alternatives {, , }. Let their rankings of these alternatives be as follows:
A : , B : , C : . Let them vote between
the three options. Two prefer to , and two prefer to . We expect,
then that they will vote for over . But in fact two prefer to , so we
have . This is called a voting cycle or Condorcet cycle.
Transitive individual preferences are aggregated via voting to an intransitive
social preference in this case.
More generally consider a social choice rule as defined above satisfying
the following properties:
1. Unrestricted Domain - it works for all possible N-tuples of preferences
in N .
2. Pareto Principle: if all individual preferences prefer alternative to
alternative then the social preference also prefers to .
3. Independence of Irrelevant Alternatives: the social preference between
any two alternatives {, } depends only on individuals preferences
over and and not on their preferences about other alternatives.
Formally, for any pair of alternatives , , and for any two prefer0
0
ence n-tuples &i and &i , if & and &i agree on {, } then the social
preference between and is the same for both preference N-tuples.
4. Non-Dictatorship: there is no individual such that { &i & }
where & denotes the social preference.
Theorem 1. (Arrows Impossibility Theorem) There is no social choice rule
satisfying the above four conditions.
Another way of thinking about this: any rule satisfying the first three conditions is dictatorial.
This is a very influential result - it has been taken as saying that perfect
democracy is impossible. But there are social choice rules that satisfy points
2, 3 and 4 if we place some restrictions on the N-tuples of preferences admitted
and drop the unrestricted domain condition. There are many results showing
31
7
7.1
7.2
Endogenous Preferences
33
Part II
The boundary of this set is called the y1 isoquant: {y2 , .., yN : f (y2 , ..., yN ) = y1 }.
The slope of this isoquant is the marginal rate of technical substitution. A
common example of a production function is the Cobb-Douglas:
y = x1 x2
Here the marginal rate of technical substitution [MRTS] between x1 and x2
is
x2
M RT S1,2 =
x1
7.3
1. Y is non-empty
2. Y is closed
N
{0}
3. No free lunch: Y R+
35
7.4
Problem is
M axy {p.y} , y Y
or
M axy {p.y} , F (y) 5 0
and given that the profit maximizing plan will generally be in the boundary
we can work with
M axy {p.y} , F (y) = 0
Definition 19. Profit function (p) = M axy {p.y} , y Y
f
pl , = if xl > 0
xl
These FOCs are generally necessary conditions for profit maximization: they
are also sufficient if the production possibility set is convex. From the FOCs
it follows that the MRTS will equal the price ratio:
F/xl
pl
=
F/xj
pj
and of course the FOC for the single output case states that the price of an
input is the value of its marginal product.
Some facts about the profit function and the supply correspondence:
36
Proposition 20. (1) The profit function (p) is homogeneous of degree one.
(2) The supply correspondence y (p) is homogeneous of degree zero.
(3) If y (p) consists of a single point then (p) is differentiable at p and
(p) = y (p) (Hotellings Lemma)
(4) If y (p) is a function and differentiable at p, then Dy (p) = D2 (p) is
a symmetric and positive semi-definite matrix with Dy (p) p = 0.
Point (4) here is sometimes called the Law of Supply: it says that
quantities respond to a price change in the same direction as the price change
- if a price increases then the supply of that good increases too.
7.5
c
, = if y > 0
y
37
Examples:
pL1 K w = 0
pL K 1 r = 0
which imply
w
K
=
L
r
This yields
L =
1
w
K =
w
1
o 1
r
1
r
1
o 1
(
w
1
) 1
p+
y
y
= 0, r = 0, y L K = 0
L
K
38
Dividing the first by the second and substituting into third yields factor
demand functions given y:
L =
r
w
+
1
+
, K =
r
w
/(+)
y 1/(+)
1 +
r + y +
w
k
c (w, r, y) = wL + rK =
+
k
w + r + y +
1
1
c
= w + r + y + =
y
k
c (w, r, y)
AC =
= Gy + , G =
w + r +
y
k
and check whether this is increasing or decreasing with output:
1
AC
1
=
Gy + < 0 + > 1
y
+
so average costs are decreasing with output if and only if we have increasing
returns to scale. With diminishing returns to scale average costs are rising
with output level.
Leontief technology
The production function is
f (x1 , x2 ) = min [ax1 , bx2 ]
As we know that the firm will not waste inputs, it must operate where y =
ax1 = bx2 . So the input demands are
y y
(x1 , x2 ) =
,
a b
39
w
w2
+
a
b
1
y
w1
w2
,
0
if
<
a
a
b
y
w2
w1
>
0,
if
(x1 , x2 ) =
b
a
b
(x1 , x2 : ax1 + bx2 = y, x1 , x2 0) if wa1 = wb2
and the cost function is just
c (w1 , w2 , y) = min
40
w2 i
,
y
a b
hw
41
Figure 7.1:
7.6
Aggregation of Firms
We noted that it is hard to aggregate consumers. It is much easier to aggregate firms. In this context a useful result is the following, which tells us
that maximizing profits over the sum of all firms production possibility sets
is equivalent to maximizing over each set individually and then adding up
these maxima:
Proposition
22. Let Yi , i = 1, . . . , I be production possibility sets and Y =
P
N
vector. Let y = argmaxyY {p.y} and
i Yi be their sum. p R is a priceP
But
Pp.yi p.yi yi Yi , i. So i p.yi = p. i yi p. i yi yi Yi
p. i yi p.y y Y .
42
43
Part III
are closed.
So the set of combinations of l, l0 that are at least as good as l00 is a closed
set, as is the set that of combinations that are no better than l00 . As in
the deterministic case this is ruling out lexicographic preferences where the
agent places all emphasis on the probability of one particular outcome - for
example on the risk of death being zero. Next comes another assumption, a
very crucial one.
Definition 24. The preference relation on the space of simple lotteries L
satisfies the independence axiom if for all l, l0 , l00 L and a (0, 1) we have
l l0 if f al + (1 a) l00 al0 + (1 a) l00 .
In words, if we mix two lotteries with a third one, the the preference
ordering of the two resulting mixtures does not depend on (is independent
of) the particular third lottery used. l is preferred to l0 iff l in any mixture
with a third lottery l00 is preferred to l0 in the same mixture with l00 . Heres
an example of what this axiom means.
Suppose l l0 and a = 0.5. 0.5l + 0.5l00 is the lottery resulting from a
coin toss between l, l00 , say heads giving l and tails l00 . Likewise 0.5l0 + 0.5l00
is the lottery generated by a coin toss between l0 , l00 , again heads giving l0 .
Conditional on heads 0.5l + 0.5l00 is at least as good as 0.5l0 + 0.5l00 , and
conditional on tails they give the same outcome. So it is reasonable that
0.5l + 0.5l00 0.5l0 + 0.5l00 , which is what the axiom implies.
Definition 25. The utility function U : L R has an expected utility form
if there is an assignment of numbers u1 , ...., uN to the N outcomes such that
for every simple lottery l = (p1 , ...., pN ) L we have
X
un pn
U (l) =
n
A utility function with the expected utility form is called a von NeumannMorgenstern (vN-M) expected utility function.
Note that if we let ln denote the degenerate lottery yielding P
outcome n
with probability one then U (ln ) = un . The expression U (l) = n un pn is
linear in probabilities, suggesting
45
k=1
P
for any K lotteries lk and probabilities a1 , ..., aK 0, k ak = 1. [In words
the utility of a mixture of simple lotteries equals the expectation of the utilities
of the individual simple lotteries.]
Proof. Suppose U has the linearity property. We can write any lottery l =
(p
degenerate
lotteriesPl1 , . . . . . . , lN : l =
P
P1 , .., pnN ) as a combinationPof the
n
n
n pn un . So U has
n pn U (l ) =
n pn l ) =
n pn l . Then U (l) = U (
the expected utility form.
Now suppose U has the expected utility form, and
consider a compound
k
k
lottery
(l
,
...,
l
;
a
,
...,
a
)
where
l
=
p
,
....,
p
K
1
K
k
1
N . Its reduced lottery is
P 1
0
l = k ak lk . Then we have, remembering that the probability of outcome n
in the reduced lottery is
X
ak pkn
pn = a1 p1n + a2 p2n + .... + aK pK
n =
k
!
U
X
k
ak lk
"
=
un
#
X
ak pkn =
"
X
k
ak
#
X
n
un pkn =
ak U (lk )
46
ak [U (lk ) + ] =
ak U (lk )
U (l) U (l)
U l U (l)
Since l U l +(1 l ) U (l) = U l l + (1 l ) l and U represents the preference it follows that l l l + (1 l ) l. In this case since U is also linear
and represents the same preferences
h
i
9.1
Paradoxes
Lots of people feel uneasy about the independence axiom and there are various paradoxes that seem to violate it. One of the best known is the Allais
Paradox. Number of outcomes is 3, N = 3.
First prize Second prize Third prize
$2,500,000
$500,000
$0
First choice is between two lotteries l1 , l10 :
l1 = 0, 1, 0 : l10 = 0.10, 0.89, 0.01
Second is between l2 , l20
l2 = 0, 0.11, 0.89 : l20 = 0.10, 0, 0.90
48
10
Risk Aversion
Consider a lottery over monetary amounts, non-negative numbers, with probabilities given by the density function f (t) , t [0, ]. The cumulative disx
tribution function is F (x) = 0 f (t) dt. Note that the final distribution for
a compound lottery is a weighted average of the distributions of each of the
component lotteries: if l1 , . . . . . . , lK : aP
1 , ..., aK is a compound lottery then
the cumulative distribution is F (x) = k ak Fk (x).
We now take the set of all lotteries to be the set of all cumulative distributions over non-negative amounts of money. We can apply the vN-M
theorem to show that there is a utility function of the form
U (F ) = u (x) dF (x)
Note that U is defined on lotteries and u is defined on amounts of money. U is
generally called the VN-M utility and u the Bernoulli utility. The axioms of
expected utility theory place restrictions on U but not on u, which could be
any increasing continuous function.
Note: it is sometimes argued that u should be bounded above, because
of the St Petersburg Paradox. Assume that u is unbounded and let xm be
an amount of money such that u (xm ) > 2m . Consider the following lottery:
Toss a coin repeatedly until heads comes up. If this happens on the mth
toss the payoff is xm . The expected utility from this lottery is
m
m X
X
1
1
m
>
= +
2
u (xm )
2
2
1
1
so you ought in principle to be willing to pay any amount to play this lottery.
Clearly most people are not, so this is an argument for u being bounded
above.
Definition 26. A decision maker is risk averse if for
any lottery F (.) the
degenerate lottery that yields the expected amount xdF (x) with certainty
50
u (x) dF (x) u
xdF (x) F
In words, the expected utility of the outcome does not exceed the utility of
the expected outcome. This is called Jensens Inequality, and is the inequality
used to define a concave function, so risk aversion is equivalent to the function
u being concave - diminishing marginal utility of income or wealth.
Definition 27. The certainty equivalent of a lottery F , denoted c (F, u), is
the amount of money that the individual regards as indifferent to the gamble
represented by the lottery:
10.1
Risk Management
zdF (z) > 1, so that its mean return exceeds that of the safe asset.
Wealth W can be invested in any way between the two assets, with a, b
the amounts invested in the risky and safe assets respectively, with a+b = W .
For any realization of z the portfolio pays az + b. The choice problem is
Note that zdF (z) > 1 (0) > 0. So a = 0 cannot satisfy this equation
and the optimal portfolio has a > 0. Conclusion: if a risk is actuarially
favorable then a risk averter will always accept at least a small amount of it.
52
Definition 28. Given a Bernoulli utility function u for money, the ArrowPratt coefficient of absolute risk aversion at x is defined as rA (x) = u00 (x) /u0 (x).
Note the negative sign in front: for concave functions this is always nonnegative.
We know that risk neutrality is equivalent to linearity, and that risk
aversion seems to increase with the curvature of u. The utility function can be
recovered from rA by integrating twice, up to two integration constants. The
integration constants dont matter as u is unique only up to two constants
anyway.
Example: Consider the utility function u (x) = eax for a > 0. Then
we have u0 (x) = aeax and u00 (x) = a2 eax , so that rA (x, u) = ax.
Given two utility functions u1 (x) , u2 (x), when can we say that one is
more risk averse than the other?
Proposition 27. The following are all equivalent:
1. rA (x, u2 ) rA (x, u1 ) x
2. There exists an increasing concave function (.) such that u2 (x) =
(u1 (x)) x: that is u2 is a concave transformation of u1 .
3. c (F, u2 ) c (F, u1 ) F (.)
4. Whenever u2 (.) finds a lottery F (.) at least as good as a riskless outcome x, then
u1 (.) also finds F (.) at least as good as x. Or u2 (x) dF (x)
u2 (
x) u1 (x) dF (x) u1 (
x) F (.) , x.
Proof. Show that 1 and 2 are equivalent. Note that for some increasing
function we always have u2 (x) = (u1 (x)) because the two represent the
same (increasing) ordering on R1 . Differentiating
u02 (x) = 0 (u1 (x)) u01 (x)
and again
2
u002 (x) = 0 (u1 (x)) u001 (x) + 00 (u1 (x)) (u01 (x))
Dividing both sides of u002 by u02 and using the first line we get
rA (x, u2 ) = rA (x, u1 )
00 (u1 (x)) 0
u (x)
0 (u1 (x)) 1
53
has a distribution function F (z) which we assume satisfies zdF (z) > 1, so
that its mean return exceeds that of the safe asset.
Wealth W can be invested in any way between the two assets, with ai , bi
the amounts invested in the risky and safe assets respectively, with ai + bi =
Wi . For any realization of z the portfolio pays ai z + bi . The choice problem
is
54
3. Given any risk F (t) on t > 0, the certainty equivalent cx = u (tx) dF (t)
satisfies x/
cx is decreasing in x.
Proof. We will show that 1. implies 3. Pick a distribution F (t) over t, and
for any x define ux (t) = u (tx). Let c (x) be the usual certainty equivalent
from definition 27: ux (c (x)) = ux (t) dF (t). Note that as u0x (t) = xu0 (tx)
1 u00 (tx)
u00x (t)
=
tx
u0x (t)
t u0 (tx)
55
for any x. Hence if 1. holds then ux0 is less risk averse than ux whenever
x0 > x. It follows from proposition 27 that c (x0 ) > c (x) and so c is increasing.
By the definition of ux , ux (c (x)) = u (xc (x)). In addition
Mean-Variance
Next a simple illustration of the role that the index of RRA can play. Let
y be a random variable distributedas F (y) with mean y . The expected
utility associated with this is Eu = u (y) dF (y) and define x so that
u (y x) = Eu = u (y) dF (y)
Here x is the cost of risk bearing, the difference between the certainty equivalent and mean y of the prospect F . Clearly
u00 (y )
2
0
so that
u (y x) u (y ) =
u00 (y ) 2
u00 (y )
2
(y y ) dF (y) =
2
2
u00 (y ) 2
1 y u00 2
1 2
x=
=
2
2 u0 y
2 y
So the cost of risk bearing is one half of the variance of the risk over the
mean outcome times the IRRA. It is also the index of absolute risk aversion
times half the variance.
We can write
1
Eu = u (y x) = u (y ) + u00 (y ) 2
2
56
which is a function of the mean outcome y and its variance 2 , and from
this we can use the implicit function theorem to get
u00
y
= 0
u + 0.5u000
Assuming u000 0 so the utility is almost quadratic we have
y
u00
= 0 >0
u
as the slope of an indifference curve in y space. So this is linear if
the index of absolute risk aversion (IARA) is constant and defines convex
preferred-or-indifferent sets if the IARA is increasing.
10.2
In comparing risky choices, we can ask two different questions: is one more
rewarding than the other, in terms of offering better outcomes, and is one
more risky than the other?
First we formalize the idea that distribution F yields unambiguously
higher returns than distribution G. We assume distributions satisfy F (0) = 0
and F (x) = 1 for some x. Two possible approaches: one to ask whether every
expected utility maximizer whose utility is increasing in income will prefer
one to the other, and the second is to ask if for every amount of money x the
probability of getting at least x is greater under one than under the other.
Both approaches lead to the same concept.
Definition 31. The distribution F first order stochastically dominates G if,
for every non-decreasing function u : R R,
57
which reduces to
b
0
u0 (x) F (x) dx
u (b) 1 u (a) 0
10.3
Figure 10.1:
Figure 10.1 gives a geometric way of thinking about insurance. There are
two states, 1 & 2. It is not certain which will occur, and their respective
probabilities are p1 , p2 . The consumers initial endowment is at the point z1
giving z11 in state 1 and z12 in state 2. The 45 degree line shows situations
where income is the same in each state, and these are therefore fully-insured
positions. The consumers expected utility is given by
u (z11 ) p1 + u (z12 ) p2
and the slope of an indifference curve is therefore
p1 u0 (z11 )
p2 u0 (z12 )
59
On the 45 degree line, z11 = z12 so this slope is just p1 /p2 , the ratio of the
probabilities.
Now consider the move from the initial position z1 to the fully insured
position z2 . This involves selling z1 z0 of income in state 1 and buying
z2 z0 of income in state 2. This transaction will move the consumer to a
fully insured position. What is the expected value of this transaction? The
probability of state 1 is p1 so the probability of giving up z1 z0 is p1 , and the
probability of state 2 and so of acquiring z2 z0 is p2 . So the expected value
0
, so
of this transaction is p1 (z1 z0 )+p2 (z2 z0 ) which is zero if pp21 = zz21 z
z0
the transaction is actuarially fair if the slope of the budget line, which is the
right hand side here, equals the price ratio. As the slope of an indifference
curve is always equal to the price ratio on the 45 degree line, the offer of
actuarially fair insurance will always be accepted and lead to full insurance.
Note that in this context convexity of the preferred-or-indifferent sets is
equivalent to risk aversion: it implies a preference for moving towards the 45
degree line.
11
We have discussed the index of relative risk aversion, xu00 (x) /u0 (x). This
parameter is also important in other areas of economics, where it is known as
the elasticity of the marginal utility of consumption and generally denoted
(c). To see why it is called this note that
cu00
du0 (c) c
=
dc u0 (c)
u0
Note also that the proportional rate of change of the present value of marginal
utility u0 (ct ) et (where ct is a function of time t) is given by
1 dc
dln u0 (ct ) et /dt = (c)
= g
c dt
where g = 1c dc
.
dt
Now consider the problem
dk
M axc
u (ct ) et dt, ct +
= f (k)
dt
0
60
where we are maximizing the integral of the utility of consumption, discounted at rate 0, subject to the constraint that consumption plus
investment dk/dt adds up to output f (k) where k is the capital stock and
f a strictly concave production function. This is called the Ramsey problem
or the optimal growth problem.
To solve this problem we use a Hamiltonian:
H = u (ct ) et + t et [f (kt ) ct ]
where t et is a time-varying shadow price, ct is called the control variable
and kt the state variable, and the expression multiplied by the shadow price
is the rate of change of the state variable.
First order conditions - necessary conditions - for a path of ct , kt to solve
this problem are that
d t et
H
H
= 0t,
=
t
ct
dt
kt
If all functions are concave these conditions are not only necessary but also
sufficient, plus one additional technical condition known as a transversality
condition.
Applying these conditions gives
u0 (ct ) = t
and
dt
t = t f 0 (kt ) ,
dt
dt
= 0 = f 0 (k)
dt
,from which
g + = f 0 (kt )
So the return on capital - f 0 (k) - has to equal the elasticity of MU times the
growth rate of consumption plus the discount rate. And the LHS here is the
rate of change of the marginal value of consumption.
12
State-Dependent Preferences
So far we have assumed that lotteries deliver money and that preferences are
over amounts of money, with no other characteristics mattering. It may be
61
however that the circumstance under which money is delivered matter: for
example, money if you have lost your house in an earthquake or hurricane
may be more valuable than money if you just won the lottery. An umbrella
is much more useful if it is about to rain than on a hot dry day. We will call
circumstances such as whether it is dry or raining, or whether there is an
earthquake or hurricane, states of nature. S is the set of all possible states
of nature (earthquakes, hurricanes, dry, wet, .... ), assumed to be finite, nonintersecting and exhaustive, and s S is a particular state (earthquake etc).
The probability of s occurring is s .
Definition 34. A random variable is a function g : S R+ that maps
states into monetary outcomes.
Every random variable g (.) gives rise toPa money lottery described by the
distribution function F (.) where F (x) = {s:g(s)x} s for all x. A random
variable can now be represented by a vector of the monetary payoffs it gives
in each of the states in S, denoted (x1 , ...., xS ) where we are using S also to
S
denote the number of states in S. The set of all random variable is now R+
.
In this framework the primitive concept is a preference ordering over the
S
. Such a preference can be represented by an
set of all random variables, R+
expected utility function as before, with one difference: the utility function
can now depend on the state of nature. So we have
S
Definition 35. The preference relation on R+
has an expected utility representation if for every s S there is a function us : R+ R suchPthat for any
0
0
S
0
0
(x
s s us (xs )
P1 , ..., xS ) 0and (x1 , ...., xS ) R+ , (x1 , ..., xS ) (x1 , ..., xS ) if f
).
u
(x
s
s s s
13
Subjective Probabilities
13.1
Savages Axioms
Axiom P6. For every f, g, h F with f g there exists a partition of S [a collection of pairwise disjoint events whose union is S] denoted
{A1 , A2 , .., An } such that for every i
fAhi g & f gAh i
This is roughly like a continuity assumption, but it is hard to state continuity
in Savages framework.
Axiom P7. Consider acts f, g F and an event A S. If for every
s S, f A g (s) then f A g, and if for every s A, g (s) A f , then
g A f .
Proposition 32. [Savage] Assume that X is finite. Then satisfies P1
to P6 if and only if there exists a probability measure on states S and a
non-constant utility function u : X R such that for every f, g F ,
f g
u (f (s)) d (s)
u (g (s)) d (s)
S
f g
u (f (s)) d (s)
u (g (s)) d (s)
S
1. If horse A wins you get a trip to Paris, and otherwise you get trip to
Rome
2. If horse A wins you get a trip to London and otherwise a trip to Rome
3. If horse A wins you get a trip to Paris and otherwise a trip to Los
Angeles
4. If horse A wins you get a trip to London and otherwise a trip to Los
Angeles
Clearly 1 and 2 are the same if A loses. Generally your choice will depend
on preferences and beliefs or probabilities, but presumably the chance of A
winning is the same in each case, so the choice depends on your preferences
between Paris and London. The same is true for 3 and 4, and axiom P2
requires 1 2 3 4. If two acts are equal on a given event, it does not
matter what they are equal to. So it doesnt matter if when the horse loses
you get Rome or LA.
14
chance from urn 1 is 0.5 and so it is less than this from urn 2. So the chances
of both red and black from urn 2 are less than 0.5. But they have to sum to
1.
Here is a related example. You have to bet on the toss of a coin. There
are two coins and you can choose which to toss. One has been tossed many
times and came down heads 50% of them. The other has never been tested.
Which would you rather bet on? In one case you know that the odds are
50/50: in the other case this is a reasonable assumption but you have no
evidence. Most people would sooner bet on the tested coin.
In both of these examples people cannot quantify probabilities and stay
away from bets involving unquantified risks.
14.1
MinMax Approaches
14.2
14.3
In this case we again work with many probability distributions that are consistent with what we know. But rather than focussing only on the worst
of them, in the sense of lowest expected utility, we give them all weights and
take note of them all according to these weights. The weight attached to
a distribution can be thought of as the subjective assessment of the chance
of that probability distribution being the correct one. (Klibanoff, Marinacci
and Mukerji, Decision-Making under Ambiguity, Econometrica, 2005, 73(6),
1848-1892)
The first assumption is that for any probability over states S there is a
utility such that acts are ranked by the expectation of that utility:
Axiom 1. Let p be a probability
over states S. Then there exists u :
utility Ep f = S u (f (s)) dp
In words this states that we prefer f to g if and only if the expectation of the
function of the expected utilities according to the weights is greater for
f than for g. We can think of as a second order utility function - defined
on expected utilities - and the weights as second order probabilities.
14.4
Examples
First look at the two-urn version of the Ellsberg paradox. Urn 1 has 100 balls,
50 red and 50 yellow. Urn 2 has also 100 red and yellow balls in unknown
proportions. You are asked if you are interested in betting $10 on a red ball
being drawn from urn 1 or urn 2. We consider the value of this bet firstly
with linear utilities and then with concave utilities.
Linear utilities: the value of the bet on urn 1 is clearly 0.510+0.50 = 5.
With urn 2 all possible distributions of 100 balls between red and yellow
are possible, and we can take either the mmu approach or the smooth am69
n pn 10 =
X n
1 X
10n =
nn
101
10 n
n
1
)
100
n
u (10)
100
If is linear this is just 0.5u (10), the same as the value of a bet on urn 1.
This is the case of no ambiguity aversion.
Let u (10) = 100 and consider instead the case of (x) = x0.5 , a strictly
concave function. Then the value of the bet is 1
100
100
100
100
X
1 n
1 X
1 X
1 X
(n) =
n
100 =
n<
101
100
101 n=0
101 n=0
101 n=0
n=0
Eu (f | mi ) = u (x) dp (x | f, mi )
The mmu approach is to value each act f according to the model that
gives the worst outcome:
minmi u (x) dp (x | f, mi )
and then maximize this value across acts:
i 0 (Eu (f | mi )) Eu0 (f | mi ) = 0
and divide the FOC through by the denominator of this expression to get a
new way of stating the FOC:
X
i0 Eu0 (f | m) = 0
i
So the expected sum of the marginal expected payoffs from a change in the
act f must be zero, where the expectation is calculated at the ambiguityadjusted probabilities. Because is concave and so 0 is decreasing, these
adjusted probabilities give more weight to bad outcomes and less to good
outcomes than the original second-order probabilities i .
Problem
A university has an endowment W that it may invest in bonds B or equity E.
Each type of security may go up 10% or go down 10%. The distributions are
not independent. The university has two financial advisers X and Y who give
different estimates of the probabilities of the possible cases, and the university
cannot tell which if either is correct. For adviser X these probabilities are
xij , and for Y they are yij . The university evaluates outcomes according to a
concave utility function U (P ) where P is the financial payoff. Formulate the
universitys investment problem according to the MaxMin Expected Utility
approach and the Smooth Ambiguity approach.
Answer
Here is the table of possible outcomes and the probabilities that advisor X
assigns to them: for advisor Y replace xij by yij .
B/E +10% -10%
+10%
x11
x12
-10%
x21
x22
The investment in equities is eW and that in bonds is (1 e) W . So the
expected utility according to advisor X is
EUx =
72
n
o
EUx
0
0
= 0.2 x21 U (C21 ) x12 U (C12 )
e
where Cij is income in state i, j, and this derivative is positive when e = 0
and negative when e = 1.
For the MaxMin Expected Utility approach we need to find EUx (e)
and EUy (e) for each value of e, pick the min,
V (e) = M in {EUx (e) , EUy (e)}
e
Note that we can divide both sides of this equation by x (EUx (e)) +
0
y (EUy (e)) giving
0
x (EUx (e))
y (EUy (e))
0
0
EUx (e)+
EUy (e) = 0
0
0
0
0
x (EUx (e)) + y (EUy (e))
x (EUx (e)) + y (EUy (e))
which we can write as
0
x0
x (EUx (e))
y (EUy (e))
=
y0 =
0
0
0
x (EUx (e)) + y (EUy (e))
x (EUx (e)) + y 0 (EUy (e))
So the solution involves setting the expected marginal gain from shifting the
portfolio equal to zero, where the expectation is taken via these ambiguityadjusted second order probabilities. Note that if is strictly concave then
the ambiguity adjustment involves placing more weight on the bad outcome
than with the initial second order probabilities. If is linear there is no
change.
74
15
75
76
Part IV
General Equilibrium
Next we study the interactions of firms and consumers through markets.
Firms as before are characterized by production possibility sets: firm i has
production possibility set Yi RN , and yi Yi is production plan. There are
I firms. The sign convention as before is that inputs areP
negative and outputs
N
with i pi = 1, pi 0 i.
positive. A price vector p is an element of R+
Clearly profits are given by i = p.yi . Firms seek to:
M axyi Yi {p.yi } = i
N
and a consumption
Consumers as before have preferences j on Xj R+
vector is xj Xj . There are J consumers. Preferences are represented by an
ordinal utility function uj : RN R. Consumers have endowments wj RN :
wj is the vector of goods that individual j owns and can either consume or
sell. Typically it contains labor, which may be consumed as leisure or sold
as work, and any other items that belong to the individual. Firms are owned
by individuals: individual j owns a fraction ji of firm i, entitling her to this
fraction of its profits. So the consumer choice problem is
X
ji i
M axxj u (xj ) , p.xj p.wj +
i
Here total spending power is the value of endowments plus the income from
shareholdings. A set of consumption and production plans, one for each
consumer and producer, is called an allocation.
Definition 36. Let yi , xj be an allocation. We say this is feasible if
X
X
X
xj
wj +
yi
j
i ji i
There are several questions one can ask about this concept. One is - is
it Pareto efficient? Another is - does such an equilibrium exist? We will
investigate the first question extensively. Before tackling the general cases,
we will look at a simple 2X2 case that can be studied geometrically, the case
of two consumers trading two goods, with no production - a 2X2 exchange
economy.
16
Edgeworth Box
is of length w1 and whose vertical side is w2 long. The lower left corner is
the origin for consumer a0 s preferences and the top right corner is that for
consumer b0 s preferences, increasing to the south west. The budget line for a
consumer is a line whose slope equals the price ratio, and which goes through
that consumers endowment vector [because she can afford her endowment
whatever the prices are]. Any point in this rectangle represents an allocation
of the two goods between the two consumers: its coordinates relative to the
normal lower left origin are the amounts allocated to consumer a, and the
remaining amounts, which are the coordinates relative to the upper right
origin, are the amounts allocated to consumer b. So in particular the initial
endowments of the two consumers form a point in this box. A line through
this point with slope equal to the price ratio gives the budget lines of both
consumers.
Figure 16.2:
The equilibrium prices associated with any allocation are given by the
slope of the budget line that goes through that allocation and is simultaneously tangent to indifference curves of a and b: this is shown in figure 16.1.
[Note - there may be more than one set of equilibrium prices associated with
an initial allocation.] This is a point where demand and supply are equal
and note that it is on the contract curve and so is Pareto efficient. So this
geometric approach suggests that a competitive equilibrium is Pareto efficient. Figure 16.2 shows a configuration where demand and supply are not
equated. The initial endowment is E, and at the prices shown A wants to
move to the point A where her indifference curve is tangent to the budget
line, and likewise B wants to move to B. So A wants to sell DE of good 1 and
80
17
81
and hence
X
p .x0j > p .
wj +
p .yi0
so that the allocation x0j , yi0 cannot be feasible. Feasibility requires that
X
j
x0j =
wj +
yi
which contradicts the previous inequality, taking the inner product with p
on both sides.
So the take-away here is that if consumers are maximizing utility and
firms are maximizing profits, all facing the same prices, and markets clear
[the allocation is feasible] then the allocation is Pareto efficient. All facing
the same prices is crucial: it means that marginal rates of substitution in
production and consumption are all the same.
82
W
=
p
.
w
+
j
j
j
i p .yi such that
1. i, yi max p .y, y Yi
2. j, if xj xj p .xj Wj
P P
P
3.
j xj =
j wj +
i yi
Note that the second condition here is different from that in definition 40: we
are not asking for utility maximization, but that any preferred choice costs
no less than the wealth level. Under some extra conditions this does imply
utility maximization - as we will see.
Proposition 36. Second theorem of welfare economics. Assume that all
production sets Yi are convex and that all preferences are also convex
and
locally non-satiated. Then for every Pareto efficient allocation xj , yi there
is a price vector p 6= 0 such that p, xj , yi form a quasi-equilibrium with
transfers.
P
Proof. 1. Let Vj = x RN : uj (x) > uj xj and define V = j Vj . This
is the set of aggregate
P allocations that can make everyone better off than at
xj . Also let Y
P= i Yi . Clearly V, Vj , Y are all convex sets.
2. Call
j wj = w, the aggregate endowment. The set Y + w, the
aggregate production set translated by the aggregate endowment, is the set
of all aggregate bundles available for consumption given the technology and
endowments.
3. Note that V (Y + w) = . This is an implication of the Pareto
efficiency of the allocation - if this intersection were non-empty then there
would be a vector that is feasible (in Y + w) and can
P be used to give every
83
P
P
uj (xj ) > uj (xj ) and so xj Vj j xj V , so p.
j r, and taking
jx
P
the limit as xj xj we have that j p.xj r.
P
P
P
6. p.
= p. (w + i yi ) = r . By 5 p.
r, but in
j xj
j xj
P
P P
r,
addition we know that j xj = i yi + w Y + w and so p.
j xj
P
P
P
p. w + yi +
yh r = p. w + yi +
yh
h6=i
h6=i
xk
xk r = p. xj +
p. xj +
k6=j
k6=j
so p.xj p.xj .
9. The wealth levels p.xj = Wj support p, xj , yi as a quasi-equilibrium
with transfers. Conditions 1 and 2 of definition 41 follow from 7 and 8 above.
Condition 3 follows from the feasibility of a Pareto efficient allocation.
What is the distinction between a quasi-equilibrium and an equilibrium,
and when are they the same? In a quasi-equilibrium, anything that is better,
costs no less: in an equilibrium, it costs more. When might something that
is better not cost more? When the price of a good in which you are not
satiated is zero. In this case in R2 the budget line is horizontal or vertical
and a consumer may wish to go infinitely far along it but be limited by the
amount of the good available - the total endowment.
Something similar could happen if the consumer has endowments only of
goods that have zero prices, so effectively has zero income.
If every consumer has positive wealth at a quasi-equilibrium with transfers then it is a price equilibrium with transfers and so any Pareto efficient
allocation can be supported as a price equilibrium with transfers.
84
85
18
Problem Set 4
Problem 1.
Consider an Edgeworth box economy in which consumers have the Cobb1
1
Douglas utility functions u1 (x11 , x21 ) = x11 x21
and u2 (x21 , x22 ) = x21 x22
.
Consumer i0 s endowments are (1i , 2i ) > 0. Solve for the equilibrium price
ratio and allocation. How do these change with a small change in 11 ?
Problem 2
Give the mathematical formula for two preferences which lead to an Edgeworth box in which prices are independent of the initial allocation of endowments amongst the agents. Prove that in this case the prices are independent
of the initial allocation. Assume that the total endowments of the two goods
are equal.
Problem 3
Construct, and give the mathematical formulae for, an Edgeworth box
in which there is more than one competitive equilibrium from some initial
allocations. Hint: you might work with linear preferences.
86
19
We now know something about the welfare properties of a general equilibrium. But we dont actually know if such an equilibrium exists. This is not
a trivial question - it is easy to construct examples of economies where there
is clearly no price vector at which all markets clear simultaneously. So this
question needs some work. The main concept in working on this is the excess
demand function. Recall definition 39: a competitive equilibrium is a set of
consumption plans xj , production plans yi and prices p such that
1. i, yi max p .y, y Yi
2. j, xj max uj (xj ) , p .xj Wj
P P
P
3.
j xj =
j wj +
i yi
So firms are maximizing profits, consumers utility, and demand and supply
balance. Now define for any price vector z (p), the excess demand associated
with that price:
X
X
X
z (p) =
xj (p)
wj
yi (p)
where
yi (p) = ArgM ax p.y, y Yi , xj = argmax Uj (x) , p.x p.wj +
P
i ji i . So z (p) is the difference between demand and supply at prices p,
and for an equilibrium we need this to be non-positive for all goods. It can
be negative - supply greater than demand - for goods whose price is zero. So
the question now is: does there exist a price p such that z (p ) 5 0?
Note that z (p) is a map from prices to commodity
space. Prices
P can be
N
N
N
considered as points in the simplex in R , S = p R : pl 0, l pl = 1 .
We know that demand and supply functions are homogeneous of degree zero
in prices so we can always scale prices to be in the simplex without changing
excess demand.
Next we modify z (p) to z 0 (p) which is a map from the simplex to itself.
Define the following function z + (p) on S N : zl+ (p) = M ax {zl (p) , 0}. Note
that z + is continuous and that
X
z + (p) .z (p) =
M ax {zl , 0} zl = 0 z (p) 5 0
l
Now construct
a (p) =
X
pl + zl+ (p)
l
87
f (p) =
1
+
p
+
z
(p)
.z (p ) =
a (p )
1
1
.z
(p
)
+
z
(p)
.z
(p
)
=
z + (p ) .z (p )
a (p )
a (p )
Therefore z + (p ) .z (p ) = 0, which from above means that z (p ) 5 0 as
required. So we have proved that there is a price at which all markets clear,
provided that all demand and supply functions are continuous functions,
which requires that all production sets be strictly convex and all utilities be
strictly quasi concave. We have:
Proposition 38. If all production sets in the economy are strictly convex
and all utilities strictly quasi-concave then there exists a price vector p at
which z (p ) 5 0, that is, at which all markets clear.
Now we move on to consider some situations where competitive equilibria
are not efficient.
20
Public Goods
First a result on characterizing Pareto efficient allocations. The basic proposition is that we can characterize PE allocations as allocations that maximize
a weighted sum of utilities.
Proposition 39. Suppose that the consumption vectors xj maximize the
P
P
P
P
weighted utility sum j j Uj (xj ) ,
where j
j xj
i Yi +
j wj
88
Proof. Suppose the xj are not PE. Then there exists an alternative set
P
P 0
P
0
of consumption vectors xj which are feasible ( j xj
Yi + j w j )
i
0
and such that Uj xj Uj xj j & j : Uj xj > Uj xj . In this case
P
P
0
j U j xj >
j Uj xj a contradiction.
A public good is one that, if provided for one person, is provided for
all, or for local public goods, for all in a group or location. The traditional
textbook examples are law and order, public health, and defense. A more
contemporary examples is air quality, which if improved for one person in
a region is necessarily improved for all. Public goods are said to be nonexcludable and non-rivalrous.
The traditional market mechanism does not work well for public goods,
as the provider cannot ensure that everyone who benefits from these goods,
pays for them. For example, a group in New York city may decide to incur
costs to make the air in NYC cleaner and less dangerous, and ask people to
pay for this. But they have no way of ensuring that every who benefits, pays.
People can free ride, enjoy the benefits without paying. This is why we refer
to public goods, reflecting the fact that these are normally provided by the
government, which has the ability to force people to pay via taxes.
Let cj R be consumption of a normal, private, good, and g R be
the level of provision of a public good. For each individual j utility depends
on the consumption of both: Uj (cj , g) where g being the consumption of the
public good is the same for all, but people can of course choose different levels
of the private goods. We assume Uj to be concave. We suppose that each
person has a budget that can be divided between the regular consumption
goods and contributing to the provision of the public good. The total amount
of the public good provided
P is a function of the total amount contributed by
0
all individuals: g = f
j gj where gj is j s contribution to the public good
and f is concave. Hence the individual optimization problem is (setting the
price of the private good equal to one)
!
X
M axcj ,gj Uj (cj , g) , g = f
gk , cj = Wj gj
k
Uj /cj
Uj
Uj 1
Uj /g
1
= f 0 or
=
or
= 0
0
Uj /g
g
cj,l f
Uj /cj
f
This last expression says that the marginal rate of substitution between the
public and private good should equal the marginal rate of transformation
between them.
Now look at the socially efficient allocation between public and private
goods. Consider the problem
!
X
X
X
X
X
M ax
Uj (cj , g) , g = f
gk ,
cj =
Wj
gk
j
which leads to a Pareto efficient allocation with the public good. The Lagrangean is
)
! !
(
X
X
X
X
X
+
gk
gj
gk
Wj
L=
Uj cj , f
j
Uj 1
Uj
Uj 1 X Ul
or
that
=
cj f 0
g
cj f 0
g
l6=j
X Uj /g
1
= 0
Uj /cj
f
j
90
This is known as the Bowen-Lindahl-Samuelson formula for the optimal provision of a public good. Note the difference between the first order condition
U /g
that the individual chooses Ujj/cj = f10 and that which is Pareto efficient
P Uj /gj
1
j Uj /cj = f 0 . Individual choices do not lead to an efficient outcome in this
case. In fact is is easy to see that individual choices under-provide
P Uj /gj the public
U /g
good relative to a Pareto efficient outcome, for clearly j Uj /cj > Ujj/cj
and if f is strictly concave then this implies that the optimal level of provision
is greater than the private level.
Now lets look into having a market for the public good in which different
people pay different prices and the good is provided by a profit-maximizing
firm. So person j pays price pj for the public good (the price of the private
good is one), and everyone pays the provider of the public good, the production of which uses as an input the private good. So the individual problem
is
M axcj ,g Uj (cj , g) , Wj cj pj g = 0
and the first order conditions are
Uj
Uj /g
Uj
= pj
= j ,
= j pj , so
cj
g
Uj /cj
and the problem for the firm producing the public good is
X
M ax g
pk z where g = f (z)
k
pk = 1 or
pk =
1
f0
the amount of the private good allocated in total to the production of the
public good.
A Pareto efficient allocation is the solution to
X
X
M ax
{log (g) + log (cj )} , g +
cj = W
j
The Lagrangean is
(
L=
cj
= ,
=
g
cj
W
,
J (1 + )
g=
W
1+
g = zj +
zk ,
cj =
k6=j
W
zj
J
k6=j
zk
W
J
1
zj
X
W
= zj (1 + ) +
zk
J
k6=j
or
W = (J + )
k6=j
so that
W
W
, cj =
J +
J +
which is the same as the efficient allocation only if J = 1.
g=
92
X
j
zj
21
External Effects
aj Uj (cj , cj ) +
p.cj
Wj
1 X Uk
1 Uj
aj
ak
= pl
cj,l
k6=j cj,l
Noting that if the external effects are harmful the terms Uk /cj,l are negative, and we can think of the second term on the RHS as a tax, which corrects
for the external costs by adding them to the market prices of goods.
The private optimum is the solution to
M ax Uj (cj , cj ) p.cj = Wj
and clearly the FOCs are
1 Uj
= pl
j cj,l
P
Uk
which differ from the social optimum by the term 1 k6=j ak c
. Hence if
j,l
this amount is added to the price pl for individual j, the private and social
FOCs will coincide. Note that the tax is in principle person-specific. These
taxes are known as Pigovian taxes after Arthur Pigou. Pollution taxes
(carbon taxes) are examples of Pigovian taxes, and cap and trade systems
are also systems for adding external costs to the prices that agents face.
93
22
Common property resources are resources to which all have equal access. A
classic example is fisheries, though ground water is also a common property resource. Assume that the total production from a common property
resources Y is a function
PF of the total inputs applied to the resource X,
Y = F (X), where X = i xi , and xi is the input applied by person i. So Y
could be the catch from a fishery and xi the number of vessel-hours applied
to the fishery by agent i. Alternatively Y could be the water withdrawn from
an aquifer and xi the capacity of the wells drilled by person i.
We assume F 0 > 0 and F 00 < 0. These imply that k 0 : LimX F 0 (X) =
k, & LimX F 00 (X) = 0.
We assume further that person i0 s output is
y i = xi
F (X)
X
which means that she gets as her output a share of total equal to the share of
inputs that she provides. Another way of thinking of this is that F (X) /X
is the average product of the input, and each agent gets the average product
times the amount of input she provides. We can write this as
y i = xi
F (xi + xi )
xi + xi
where xi is the vector of inputs provided by all agents other than i. If the
cost of input is p then each agent seeks to maximize
i = xi
F (xi + xi )
pxi
xi + xi
If she takes X as given - as would make sense if there are many agents, each
small with respect to the total, the FOCs are
F (X)
=p
X
so that average product equals price.
Now look at the Pareto efficient outcome. We seek to maximize F (X)
pX and the FOC is
F
=p
X
94
or marginal product equals price. As F (X) /X > F 0 (X), we see that the
resource is over-used under a competitive regime. Note that under the assumptions specified on F , it is the case that
F (X)
0
=0
LimX F (X)
X
so that with infinitely many agents the two outcomes are the same. They
are also the same when there is only one agent.
23
,
x
and
a
price
vector
p
and
wealth
levels
W
,
W
=
p
.
y
j
j
j
i
j wj +
P
i p .yi , such that
1. For each firm i the first order conditions for profit maximization are
satisfied, i.e. p = i Fi (yi ) , for some i > 0.
2. For each consumer j , xj maximizes Uj (xj ) subject to p .xj Wj
P P
P
3.
x
=
w
+
j
j
j
j
i yi
So this is a competitive equilibrium except that firms are not necessarily
maximizing profits, but they are satisfying the FOCs for profit maximization.
Profits may be negative, so that they could be increased by closing down. So
any CE is a MCPE, but the converse is not true.
Note that a MCPE satisfies the FOCs for PE: all marginal rates of substitution and transformation are equal. We can see that this is necessary for
95
The Lagrangian is
L=
X
j
j Uj (xj ) +
(
X
)
xj
wj
yi
+ i Fi (yi )
96
24
i ji i
= Wj
known, agents can trade on risk-free spot markets (normal markets) using
the money obtained from the securities they purchased that pay off in that
state. Under certain conditions this structure can imitate the outcome that
would occur if we had a competitive equilibrium with the full N S markets.
But instead we have only N markets for goods and S markets for securities,
where S is the number of states, and generally N + S < N S.
To see this, assume that p RN S (there is only one time period now)
is the competitive equilibrium price vector with a full set of state-contingent
commodity markets. A typical element is pks , the price of good k in state
s. Let xjks be the amount of good k person j bought in state s at the
Her purchase of securities across all states s must satisfy the budget constraint
X
rjs = Wj
s
So if consumers anticipate the prices that will rule in spot markets once
uncertainty is resolved and believe these to satisfy pks qs = pks s S, k the
outcome will be an efficient competitive equilibrium. The conclusion: S securities markets and N goods markets can replace N S contingent commodity
markets if agents have price expectations that mimic the prices that would
have ruled on contingent commodity markets. (Arrow, The Role of Securities
in an Optimal Allocation of Risk-Bearing, Review of Economic Studies 1964,
first published in French in 1953.)
99
100