Optimal Taxation
Optimal Taxation
Roozbeh Hosseini
∗
In preparing these notes I have benefited from: Chari and Kehoe (1998), V.V. Chari and Larry Jone’s
first year Macro lectures notes and V.V. Chari, Narayana Kocherlakota and Mike Golosov’s lectures on
Public Economics. These notes are prepared for the Public Economics course at the W.P. Carey School
of Business, Arizona State University. They contain errors. Please use with caution and do not circulate.
YOU, the reader, are the only person responsible for the errors in the notes! Comments are welcome.
1
ECN 741: Public Economics Fall 2008
Contents
References 68
2
ECN 741: Public Economics Fall 2008
Consider an economy with n types of consumption good that are produced using labor input:
F (c1 + g1 , . . . , cn + gn , l) = 0 (1)
ci is private and gi is public consumption of good i and l is the labor input. F is a constant
return to scale technology. Consumers face the following maximization problem
max U (c1 , . . . , cn , l)
c1 ,...,cn ,l
subject to
n
X
pi (1 + τi )ci = l
i=1
n
X
max p i xi − l
x1 ,...,xn ,l
i=1
F (x1 , . . . , xn , l) = 0
n
X n
X
pi gi = p i τ i ci (2)
i=1 i=1
• prices: p = (p1 , . . . , pn )
• policy: π = (τ1 , . . . , τn )
such that
3
ECN 741: Public Economics Fall 2008
ci + gi = xi for i = 1, . . . , n (3)
Proposition 1 Any competitive equilibrium allocations must satisfy the resource feasibility
constraint
F (c1 + g1 , . . . , cn + gn , l) = 0 (4)
Furthermore, any allocations that satisfy (4) and (6) can be supported as a competitive
equilibrium for appropriately constructed polices and prices.
Proof.
Suppose (c, x, l) is a competitive equilibrium allocation. Then the following FOC must hold
Ui
= −(1 + τi )pi for i = 1, . . . , n
Ul
Replacing out for prices (and taxes) from FOC into budget constraint gives the imple-
mentability constraint. The feasibility follows by definition of equilibrium.
Now consider allocations (c, x, l) that are feasible (given vector of g) and satisfy (5). Con-
struct prices from the FOC of the firm
Fi
pi = − for i = 1, . . . , n
Fl
set policy as
Ui Fl
1 + τi = for i = 1, . . . , n
Ul Fi
4
ECN 741: Public Economics Fall 2008
You can verify that the policy and prices (as constructed above) together with the allocation
(c, x, l) is a competitive equilibrium.
We are interested in the problem of choosing the best policy π to maximize the welfare of
consumers. One restriction on such a problem is that the resulting allocation be a competitive
equilibrium allocation for each given policy. The timing is the following: First, government
chooses a policy, Second, private agents makes decision. We are interested in finding the
equilibrium of this game.
subject to
n
X n
X
pi gi = p i τ i ci
i=1 i=1
and (c(π ), x(π ), l(π )) together with p(π ) is a competitive equilibrium for every π 0 ∈ Π.
0 0 0 0
Suppose π, (c(·), x(·), l(·)) and p(·) is a Ramsey equilibrium. Then we call (c(π), x(π), l(π))
a Ramsey allocation.
Proof.
Follows from the definition of Ramsey allocation.
5
ECN 741: Public Economics Fall 2008
max U (c1 , c2 , l)
c1 ,c2 ,l
subject to
1. Implementability constraint
U1 c1 + U2 c2 + Ul l = 0 (6)
2. Feasibility
F (c1 + g1 , c2 + g2 , l) = 0 (7)
Fl
1 + λ − λHl = γ
Ul
Ui Fl
1 + τi =
Ul Fi
1 + λ − λHl
1 + τi =
1 + λ − λHi
There you go! If Hi > Hj , then it is optimal to tax good i more than good j.
The problem is that, it is not very helpful. Unfortunately, without imposing assumption on
U we cannot say much more. Next we consider some special (yet, interesting) cases.
6
ECN 741: Public Economics Fall 2008
then
Uii ci
Hi = −
Ui
Our goal to relate Hi to income elasticity of demand for good i. In order to do that, suppose
there is a non-wage income m, such that p1 c1 + p2 c2 = l + m. Consider FOC of consumer
(notice that I have ignored taxes for this part)
in which φ(p, m) is the lagrange multiplier on budget constrain. Let’s take derivative w.r.t
m
∂ci ∂φ Ui ∂φ
Uii = pi =
∂m ∂m φ ∂m
or
Uii ci m ∂ci m ∂φ
= .
Ui ci ∂m φ ∂m
m ∂ci
Let ηi = ci ∂m
. Then
m ∂φ 1
Hi = −
φ ∂m ηi
Therefore, Hi > Hj if and only if ηj > ηi . Combine this with the above and we get the
following:
Result 1 If preferences are additive separable, necessities should be taxed more than luxuries.
Consider the utility function in the previous section and assume that v(l) = l. Then there
is no income effect and using income elasticities for guiding us about optimal taxes is not
useful. However, we use price elasticities. Consider again the FOC of consumer
Ui (ci ) = pi φ
7
ECN 741: Public Economics Fall 2008
∂ci Ui
Uii =φ=
∂pi pi
and
1
Hi =
i
Result 2 If preferences are additive separable and quasi-linear, price-inelastic goods should
be taxed more.
Sandmo (1987) and Corlett and Hauge (1953-54) argue that goods that are more complement
with leisure should be taxed more heavily. The next example shows this
Example : U (c1 , c2 , l) = cα1 + cα2 (1 − l)β
One of the most useful and interesting result in optimal taxation is the uniform commodity
taxation result. Suppose the preferences are weakly separable in consumption and leisure
Proposition 3 Suppose preferences satisfy (8), then it is optimal to tax all goods at the
same rate, i.e. τi = τj for all i and j.
Proof.
Note that the fact that G(·) is homothetic implies that
Ui (αc, l) Ui (c, l)
=
Uj (αc, l) Uj (c, l)
or
Ui (c, l)
Ui (αc, l) = Uj (αc, l).
Uj (c, l)
8
ECN 741: Public Economics Fall 2008
u(c1 , . . . , ck , G(ck+1 , . . . , cn ), l)
in which, G(·) is homothetic. Then the result is that commodities (ck+1 , . . . , cn ) should be
taxed at uniform rate.
Exercise: Suppose consumer is endowed with y unit of good one that cannot be taxed
away. Does the uniform commodity taxation still hold? what if the utility function is
additive separable?
Exercise: Suppose government is restricted to setting taxed on c1 to zero. How would
modify the Ramsey problem? Does the uniform commodity taxation hold?
Another powerful and important result in Ramsey taxation is that intermediate good shall
not be taxed.
Suppose there are two sectors. One sector produces commodity x1 that is consumed by
private agent, c1 and by government, g. Commodity x1 is produced using intermediate good
z and labor l1 as input according to the following production function
f (x1 , z, l1 ) = 0.
The other sector, uses labor l2 as input to produce good x2 that can be used as input in
production of good x1 (that is z) or it can be consumed (c2 and g2 ). The technology is the
following
9
ECN 741: Public Economics Fall 2008
h(x2 , l2 ) = 0.
subject to
p1 (1 + τ1 )c1 + p2 (1 + τ2 )c2 ≤ l1 + l2 .
max p1 x − l1 − p2 (1 + τz )z
x1 ,z,l1
subject to
f (x1 , z, l1 ) = 0.
fz
= p2 (1 + τz ).
fl
subject to
h(x2 , l2 ) = 0.
hx fz
(1 + τz ) = − .
hl fl
τ1 p1 c1 + τ2 p2 c2 + τz p2 z = p1 g1 + p2 g2
10
ECN 741: Public Economics Fall 2008
c1 + g1 = x1
c2 + g 2 + z = x 2
f (x1 , z, l1 ) = 0
h(x2 , l2 ) = 0
subject to
U1 c1 + U2 c2 + Ul (l1 + l2 ) = 0 λ
f (c1 + g1 , z, l1 ) = 0 φ1
h(c2 + g2 + z, l2 ) = 0 φ2
FOC w.r.t z
φ1 fz = −φ2 hx
and therefore,
fl φ1 = hl φ2 .
11
ECN 741: Public Economics Fall 2008
∞
X
max β t U (ct , lt )
ct ,lt ,xt ,kt+1 ,bt+1
t=0
subject to
(1 + τct )ct + (1 + τxt )xt + bt+1 ≤ (1 − τlt )wt lt + (1 − τkt )rt kt + Rbt bt ; λt
kt+1 ≤ (1 − δ)kt + xt
−bt+1 ≤ M
k0 , b0 given
Government Budget:
Feasibility:
ct + gt + kt+1 = F (kt , lt ) + (1 − δ)kt (14)
12
ECN 741: Public Economics Fall 2008
rt = Fk (kt , lt ) (15)
wt = Fl (kt , lt )
A competitive equilibrium is: the sequence of allocations x = {ct , lt , bt+1 , kt+1 , xt }∞ t=0 , prices
∞ ∞
{rt , wt , Rbt }t=0 , policy π = {τct , τlt , τxt , τkt+1 }t=0 such that, the allocations solve consumer
problem, given prices and policy, prices are competitive, government budget holds and allo-
cations are feasible.
A Ramsey Equilibrium is a policy π, an allocation rule x(·) and price rules r(·), w(·) and
Rb (·) such that:
X∞
π ∈ arg max β t U (ct , lt )
t=0
Now multiply consumer’s budget constraint by λt and sum over t and use (16)-(17)
∞
X ∞
X
λt [(1 + τct )ct + (1 + τxt )(kt+1 − (1 − δ)kt ) + bt+1 ] = λt [(1 − τlt )wt lt + (1 − τkt )rt kt + Rbt bt ] .
t=0 t=0
∞
X
λt [(1 + τct )ct − (1 − τlt )wt lt ] = λ0 {[(1 + τx0 )(1 − δ) + (1 − τk0 )r0 ] k0 + Rb0 b0 } .
t=0
13
ECN 741: Public Economics Fall 2008
Proof.
Suppose x is the competitive equilibrium allocation, then following the steps outlines above
we can show that it should satisfy the implementability constraint (18). Now suppose an
allocation x∗ is feasible and satisfy (18) for some proof zero policies.
Note that in any competitive equilibrium, the bond holding must satisfy
∞
X [Ucs cs + Uls ls ]
bt+1 = β t−s − kt+1 (19)
s=t+1
Uct
∗
in other words, any sequence of c∗t , lt∗ and kt+1 uniquely identifies a sequence of bt that is a
part of competitive equilibrium. Candidate wage and rate of return on capital is given by
(15). Therefore, from the FOC (9)-(12) we have
1 − τlt U∗
= − ∗ lt ∗
1 + τct Flt Uct
∗ ∗
U Uct+1 ∗
(1 + τxt ) ct
= β (1 − τxt+1 )(1 − δ) + (1 − τkt+1 )Fkt+1 (20)
1 + τct 1 + τct+1
∗ ∗
Uct Uct+1
= β Rbt+1
1 + τct 1 + τct+1
any two of the four taxes can be chosen such that the above conditions hold.
14
ECN 741: Public Economics Fall 2008
subject to
∞
X
β t [Uct ct + Ult lt ] = U0 {[(1 + τx0 )(1 − δ) + (1 − τk0 )r0 ] k0 + Rb0 b0 } ;λ
t=0
subject to
ct + gt + kt+1 = F (kt , lt ) + (1 − δ)kt ; φt
Wlt
= −Flt (21)
Wct
Wct
= β(1 − δ + Fkt ) for t ≥ 1 (22)
Wct+1
Proposition 5 If the solution to the Ramsey problem converges to a steady state, then at
the steady state, the tax rate on capital income is zero.
Proof.
In (22) at the steady state we have
β(1 − δ + Fkt+1 ) = 1.
This implies that at the steady state there is no inter-temporal distortion. Compare with
(20) we have
(1 + τxt )(1 + τct+1 ) 1 − τkt+1
=β 1−δ+ Fkt+1
(1 + τct )(1 + τxt+1 ) 1 + τxt+1
15
ECN 741: Public Economics Fall 2008
Note that any feasible allocation that satisfies (18) can be implemented by two of the four
taxes (that is we only need two of the τc , τl ,τx and τk to implement the same allocations).
This in turn implies that
τkt = 0
1 + τct
= constant
1 + τxt
∞
X
β t Ucti cit + Ulti lit = U0i [(1 + τx0 )(1 − δ) + (1 − τk0 )r0 ] k0i + Rb0 bi0
(24)
t=0
Suppose government puts welfare weights ωi on consumers of type i. The Ramsey problem
is
∞
X ∞
X
t 1
max ω1 β U (c1t , l1t ) + ω2 β t U 2 (c2t , l2t )
t=0 t=0
16
ECN 741: Public Economics Fall 2008
∞
X
max β t W (c1t , c2t , l1t , l2t , λ1 , λ2 )
t=0
subject to
c1t + c2t + kt+1 = F (kt , l1t , l2t ) + (1 − δ)kt ; φt
Suppose consumer of type 1 does not hold any asset and cannot save, borrow or invest. We
call these ’Worker’. Also, assume that all the capital is held by consumer 2 who do not
supply any labor. We call these ’Capitalists’. The implementability constraint for ’Worker’
is
Uct1 c1t + Ult1 l1t = 0 ∀ t
∞
X
β t Uct2 c2t = U02 [(1 + τx0 )(1 − δ) + (1 − τk0 )r0 ] k02 + Rb0 b20
(25)
t=0
Suppose the welfare weight on ’Worker’ utility is 1 and on ’Capitalist’ utility is zero.
∞
X
max β t U 1 (c1t , l1t )
t=0
subject to
Uct1 c1t + Ult1 l1t = 0 ∀ t
∞
X
β t Uct2 c2t = U02 [(1 + τx0 )(1 − δ) + (1 − τk0 )r0 ] k02 + Rb0 b20
(26)
t=0
17
ECN 741: Public Economics Fall 2008
Define
W (c1 , c2 , l1 , l2 , λ1 , λ2 ) = U 1 (c1 , l1 ) + λi Uci ci + Uli li
φt = φt+1 (1 − δ + Fkt+1 )
1 = β(1 − δ + Fkt+1 )
1. Tax on capital income has to be uniform across different types. Does the result hold
with this restriction? Under what assumptions?
2. Tax on labor income has to be uniform across different types. Does the result hold?
Under what assumptions?
3. Tax on capital income cannot be more than 100 percent. Does the result hold? Under
what assumptions?
Suppose we write the environment as in McGrattan and Prescott (2005) with corporate taxes
and dividend taxes. Consumers can trade share of corporations, st , at price vt . Let dt be
dividend and τdt be dividend tax. Consumers solve
∞
X
max β t U (ct , lt )
ct ,st+1 ,lt
t=0
subject to
∞
X ∞
X
pt [ct + vt (st+1 − st )] ≤ pt [(1 − τdt )dt st + (1 − τlt )wt lt ]
t=0 t=0
s0 = 1
18
ECN 741: Public Economics Fall 2008
FOC implies
Uct
= −(1 − τlt )wt
Ult
pt vt = pt+1 vt+1 + pt+1 (1 − τdt+1 )dt+1
There is a corporation that maximizes the present discounted value of owners’ dividends and
pays taxes τt on corporate income.
∞
X
max pt (1 − τdt )dt
t=0
subject to
dt = f (kt , lt ) − xt − wt lt − τt (f (kt , lt ) − δkt − wt lt )
kt+1 = (1 − δ)kt + xt
flt = wt
pt (1 − τdt )
= 1 − (1 − τt+1 )(fkt+1 − δ)
pt+1 (1 − τdt+1 )
19
ECN 741: Public Economics Fall 2008
c1−σ
U (c, l) = − v(l),
1−σ
Then Ramsey taxes on capital income is zero for t ≥ 2.
Proof.
Do it as an exercise.
Exercise: Can you establish any connection between this result and uniform commodity
taxation?
Aggregate state of economy is st ∈ S (finite set) and is publicly observable. Denote the
history of aggregate shocks by st = (s0 , . . . , st ). Probability of history st is Pr(st ).
Consumer problem
Individual of type i solves
X
max β t Pr(st )U i (ct (st ), y(st )) (28)
t,st
20
ECN 741: Public Economics Fall 2008
sub. to
X X
p(st ) c(st ) + k(st ) ≤ p(st ) wt (st )(1 − τ (st ))y(st ) + R(st )k(st−1 ) − T
t,st t,st
in which R(st ) = 1 + (1 − κ(st ))(rt (st ) − δ) and T = t,st p(st )T (st ) is present value of
P
lump-sum taxes. Note that there is heterogeneity in skills θi as well as initial capital holding
k i (s0 ).
Feasibility
Let L(st ) = π i y i (st ), C(st ) = π i ci (st ), K(st ) = π i k i (st ). Then feasibility is
P P P
i i i
Govern met
Government has exogenously given sequence of expenditure g(st ) to finance. It can levy
linear tax on capital income κ(st ). It can also levy the following tax on income
F irms
As usual the firm’s problem is static and implies marginal product pricing
21
ECN 741: Public Economics Fall 2008
Therefore, given the aggregate consumption and labor output (C(st ), L(st )), the assignment
of allocation of consumption and labor output {ci (st ), y i (st )} are efficient. In other words,
given any sequence of aggregate output (C(st ), L(st )), there are weights ϕ = ϕ1 , . . . , ϕN
such that i π i ϕi = 1 and {ci (st ), y i (st )} is the solution to
P
X
U m (C(st ), L(st ); ϕ) ≡ max
i i
π i ϕi U i (ci , y i )
{c ,y }
sub. to
X X
π i ci = C(st ), π i y i = L(st )
i i
therefore
(ci (st ), y i (st )) = hi (C, L; ϕ)
ULm (st )
= −w(st )(1 − τ (st )) (35)
UCm (st )
Ucm (st ) p(st )
= ∀ i, j
Ucm (s0 ) β t Pr(st )p(s0 )
t,st
22
ECN 741: Public Economics Fall 2008
Now we can replace individual i’s allocations in terms of aggregate allocations using (33)
and (34)
X
β t UCm (C(st ), L(st ); ϕ)hic (C(st ), L(st ); ϕ) =
(36)
t,st
+ULm (C(st ), L(st ); ϕ)hiy (C(st ), L(st ); ϕ) = Uci (C(s0 ), L(s0 ); ϕ) R0 k0i − T ∀i(37)
Note that (36) is expressed entirely in terms of aggregate allocations, weights ϕ and initial
endowments.
Proposition 7 Given initial wealth R0 k0i , an aggregate allocation {C(st ), L(st ), K(st )} can
be implemented in a competitive equilibrium if and only if
1. It is feasible
2. There exists weights ϕ and lump-sum T such that implementability constraint (36)
holds for all i = 1, . . . , N
Proof.
Any equilibrium allocation is feasible and we just showed that it satisfy (36) . Suppose there
is a feasible aggregate allocation that satisfies (36) for sum weights and lump-sum taxes.
Then individual allocations and prices can be constructed using (33) and (35). Then it is
immediate that (32) (consumer optimality) holds. The individual allocations constructed as
such are also feasible since they satisfy (36) .
A Panning Problem
sub. to
X
β t UCm (C(st ), L(st ); ϕ)hic (C(st ), L(st ); ϕ)
t,st
+ULm (C(st ), L(st ); ϕ)hiy (C(st ), L(st ); ϕ) = Uci (C(s0 ), L(s0 ); ϕ) R0 k0i − T ∀i ; µi π i
23
ECN 741: Public Economics Fall 2008
sub. to
WC (C(st ), L(st ); ϕ, µ, λ)
FL (K(st−1 ), L(st ), st , t) = −
WL (C(st ), L(st ); ϕ, µ, λ)
X
WC (C(st ), L(st ); ϕ, µ, λ) = β WC (C(st+1 ), L(st+1 ); ϕ, µ, λ)R∗ (st+1 )Pr(st+1 )
st+1 |st
Optimal Taxes
24
ECN 741: Public Economics Fall 2008
One way to get this is to set the capital income taxes such that
N
X
µi k0i π i = 0
i=1
Note that we have hic (C, L; ϕ) = ωci C and hiy (C, L; ϕ) = ωyi L, with
γ −1
1/σ
(ϕi ) (θi ) γ−1 (ϕi ) γ−1
ωci =P and ωyi =P γ −1
i 1/σ
i πi (ϕ ) i πi (θi ) γ−1 (ϕi ) γ−1
and therefore
γ γ
m c1−σ i
m (y/θ ) W c
1−σ i
W (y/θ )
U = Φm
u − Φv α and W = Φu − Φv α
1−σ γ 1−σ γ
in which Φm m W W
u , Φv , Φu and Φv are some constant. Note that this implies
Φm W
v Φu
τ ∗ (C, L) = 1 −
Φm W
u Φv
and therefore
R(st+1 ) = R∗ (st+1 )
which implies
κ(st ) = 0 for all t ≥ 1
This implies that the result for optimal taxes on capital income holds from date zero (not
just for t ≥ 1 as it was the case before). When k0i = k0 for all i, taxing initial capital is
25
ECN 741: Public Economics Fall 2008
like a lump-sum tax. But since lump-tax is allowed here, it is not necessary. However, when
individuals are heterogeneous in their initial wealth, then taxing wealth for redistribution is
desirable.
Example: Now consider the following preferences
y
U i (c, y) = α log(c) + (1 − α) log(1 − )
θi
ϕi
ωi = P i i
iπ ϕ
therefore,
X
U m (C, L; φ) = α log(C) + (1 − α) log(1 − L) + α log ω i + (1 − α) log ω i /θi .
i
(1 − α)
W (C, L) = ΦW W
U (α log(C) + (1 − α) log(1 − L)) + ΦUL
1−L
and therefore
1
τ ∗ (L) =
(1 − L)ΦW W
U /ΦUL +1
also
κ(st ) = 0 for all t ≥ 1
Here, I present a 2 period version of Erosa and Gervais (2002). Individuals live 2 periods
(born at age 0, die at age 1). Each generation is indexed by its date of birth. For example
in period t, the generations alive are t − 1, t. Assume no population growth.
Each individual is endowed with one unit of time at each age j and can transform one unit
of time into zj unit of efficient labor. Let ct,j be the consumption of generation t at age j.
Other variables follow the same notation.
26
ECN 741: Public Economics Fall 2008
subject to
c l
(1 − τt,0 )ct,0 + at,1 ≤ (1 − τt,0 )wt z0 lt,0
c l k
(1 − τt,1 )ct,1 ≤ (1 − τt,1 )wt z1 lt,1 + (1 + (1 − τt,1 )(rt − δ))at,1
rt = fk (kt , lt )
wt = fl (kt , lt )
∞ ∞
" #
X X X X
c l k
pt gt = pt τt−j,j ct−j,j + τt−j,j wt zj lt−j,j + τt−1,1 (rt − δ)at−1,1
t=0 t=0 j=0,1 j=0,1
Let U t = U (ct,0 , lt,0 ) + βU (ct,1 , lt,1 ) be the lifetime utility of generation t for a given se-
quence of consumption and leisure and let 0 < γ < 1 be government’s discount factor across
generations. Government objective is to maximize
∞
X
γ tU t
t=0
Uct,0 ct,0 + Ult,0 lt,0 + β Uct,1 ct,1 + Ult,1 lt,1 = 0 (38)
27
ECN 741: Public Economics Fall 2008
Exercise: Show that a feasible allocation is implementable if and only if it satisfy (38).
Ramsey problem
subject to
Uct,0 ct,0 + Ult,0 lt,0 + β Uct,1 ct,1 + Ult,1 lt,1 = 0 ; γ t λt
Steady State: In the steady state (ct,0 , ct,1 , lt,0 , lt,1 , at,1 ) = (c0 , c1 , l0 , l1 , a1 ) and λt = λ.
Therefore,
28
ECN 741: Public Economics Fall 2008
Note that this in general does not imply zero tax on capital. When profile of labor produc-
tivity, zj , is not flat over lifetime, in general consumption and leisure allocations over lifetime
is not flat.
Question: Intuitively, why is it optimal to distort inter-temporal decision in this environ-
ment?
We can impose assumptions on preferences (both for government and individuals) to arrive
at zero capital taxation result again.
c1−σ
u(c, l) = − v(l)
1−σ
Then the Ramsey problem prescribes no inter-temporal distortions for periods t ≥ 1, provided
that labor income taxes can be age-dependent.
Proof.
Note that equation (42) becomes
Uc0,t
= β (1 − δ + fkt+1 )
Uc1,t
Uc0,t
= β (1 + (1 − τt,1 )(fkt+1 − δ))
Uc1,t
29
ECN 741: Public Economics Fall 2008
One criticism of the Ramsey approach is the ad hoc assumption of linear distortionary taxes
exclusion of Lump-sum taxation. At the same time without imposing these restrictions or
without including informational frictions the Lump-taxes are very desirable in these models.
We saw Werning (2007) as an attempt to move away from this limitations and expand
the set of instruments available to government, i.e., Lump-sum taxes together with uniform
distortionary taxes across individuals. In doing that he appeals to informational friction, i.e.,
the fact that individuals type (skill) is not observable and hence cannot be taxes. But is it
the best government can do? Is it possible for government to implement more sophisticated
instruments and achieve “better” outcomes (lets agree for now that “better” means, higher
welfare given a welfare function)? What are these instruments? How we possibly restrict
the set instruments available to government?
In Mirrleesian approach set of instruments is pinned down by information/enforcement lim-
itation of the government. The optimal tax policy is found among those policies that are
incentive compatible, given the information/enforcement restrictions. In doing that we pro-
ceed in two steps
There are N agents, indexed by n = 1, . . . , N , who live T < ∞ period. The preferences are
T
X
β t−1 (u(ct ) − v(lt )) , 1 > β > 0, u0 , −u00 , v 0 , v 00 > 0
t=1
in which ct is consumption and is observed. lt is hours worked (or effort) and it is not
observed.
Let Θ be a finite set of skills or ability. Nature makes a draw θnT = (θn1 , . . . , θnt ) ∈ ΘT =
Θ × · · · × Θ for each agent n. The θnT draws are i.i.d across agents. Let π(·) be probability
density function over ΘT draws. Each agent privately learns θnt at the beginning of period
t. Individual who has skill θnt and works lnt hours in period t can produce ynt unit of output
according to
ynt = lnt · θnt
30
ECN 741: Public Economics Fall 2008
lnt and θnt are both private information but ynt is observable. In what follows we make a
change of variable lnt = yθnt
nt
.
N
cn : ΘT −→ RT+
N
yn : ΘT −→ RT+
T X
X N T X
X N
cnt (θ1T , . . . , θnT )R−t ≤ ynt (θ1T , . . . , θnT )R−t , R>1
t=1 n=1 t=1 n=1
So far, we know the information structure, we know what allocations are and we know what
allocations are feasible. But how does this guide towards a set of tax policies? In other
words the remaining question is, given the private information, what allocations can be
implemented?
31
ECN 741: Public Economics Fall 2008
∗
y
X gn (ρ, α−n (θ−n ))
αn∗ (θn ) ∈ arg max c ∗
π(θ−n ) u(gn (ρ, α−n (θ−n ))) − v
ρ∈An
θ−n
θn
So far we have made it clear what exactly do we mean by implementability. But is it helpful?
Notice that our setup so far does not impose any restriction on the type of games (mecha-
nisms) considered. Any equilibrium outcome of some game is implementable. Think for a
moment about the following problem: we want to find the best implementable allocation.
That means we need to search in the space of games, find a game that has a BNE that
implements that best allocation as its outcome. This is a very complicated problem.
Good news is that there is a very powerful result that allows us to restrict attention to a
very particular game without lose of generality. We need couple of more definitions.
Definition 8 A truth-telling BNE of a direct mechanism (Θ, g c , g y ) is αn∗ (θn ) = θn for all n
such that y
X
c gn (ρ, θ−n ))
θn ∈ arg max π(θ−n ) u(gn (ρ, θ−n )) − v
ρ∈Θ
θ
θn
−n
In a direct mechanism, players are basically asked to report their skill type. We are interested
in equilibria in which type is revealed truthfully. It turns out there is not loss of generality
in doing that.
32
ECN 741: Public Economics Fall 2008
Proof.
Suppose allocation (cn , yn )N c y
n=1 is implementable as outcome of some mechanism (A, g , g ).
We construct a truth-full mechanism (Θ, g̃ c , g̃ y ) as the following
∗
g̃ c (θ1 , . . . , θN ) = g c (α1∗ (θ1 ), . . . , αN (θN )), ∗
g̃ y (θ1 , . . . , θN ) = g y (α1∗ (θ1 ), . . . , αN (θN ))
In which {αn∗ }N c y
n=1 is the BNE of the mechanism (A, g , g ). We only need to show that
truth-telling is a BNE of (Θ, g̃ c , g̃ y ). Suppose not, i.e., suppose there is a type θn and a
report ρ ∈ Θ such that
y
X
c g̃n (ρ, θ−n ))
π(θ−n ) u(g̃n (ρ, θ−n )) − v >
θ−n
θ n
y
X
c g̃n (θn , θ−n ))
π(θ−n ) u(g̃n (θn , θ−n )) − v =
θ−n
θn
y ∗ ∗
X
c ∗ ∗ gn (αn (θn ), α−n (θ−n ))
π(θ−n ) u(gn (αn (θn ), α−n (θ−n ))) − v
θ
θn
−n
in which the last inequality follows from definition of (g̃ c , g̃ y ). This implies that there must
exist α = αn∗−1 (ρ) ∈ An such that
∗
gny (α, α−n
X
∗ (θ−n ))
π(θ−n ) u(gnc (α, α−n (θ−n ))) −v >
θ−n
θn
y ∗ ∗
X
c ∗ ∗ gn (αn (θn ), α−n (θ−n ))
π(θ−n ) u(gn (αn (θn ), α−n (θ−n ))) − v
θ−n
θn
cn = g̃ c (θ1 , . . . , θN ), yn = g̃ y (θ1 , . . . , θN )
Using Revelation Principle we can restrict attention to direct mechanism and allocations
that are truthfully revealing. This means that the set of implementable allocations are the
33
ECN 741: Public Economics Fall 2008
yn (θ0 , θ−n )
X
X yn (θn , θ−n ) 0
π(θ−n ) u(cn (θn , θ−n )) + v ≥ π(θ−n ) u(cn (θ , θ−n )) + v
θ−n
θn θ
θn
−n
∀n, θn , θ0 ∈ Θ.
We are going to primarily focus on environment with unit measure of agents.
Consider the same environment as before (for general T < ∞) except that now there are
unit mass of agents. Nature makes a draw θT = (θ1 , . . . , θt ) ∈ ΘT = Θ × · · · × Θ for each
agent. The θT draws are i.i.d across agents. Let π(·) is probability density function over ΘT
draws. There is no aggregate uncertainty, therefore π(θT ) is also the mass of people who
have the draw θT . Let D ≡ θT |π(θT ) > 0 .
Define allocation as θt − measurable functions
ct : D −→ RT+
yt : D −→ RT+
Allocation is feasible if
T
XX T
XX
−t T
R ct (θ )π(θ ) ≤ T
R−t yt (θT )π(θT )
Θ∈D t=1 Θ∈D t=1
QT
Define a mechanism as set of actions A ⊂ t=1 Xt and outcome functions
g : A × ∆(A) −→ R2T
+
34
ECN 741: Public Economics Fall 2008
T c ∗ T
gt (αt (θ ), µ∗ )
X
t−1 c ∗ T
T ∗
β π(θ ) u(gt (αt (θ ), µ )) − v ≥
t=1
θt
T c 0 T
gt (αt (θ ), µ∗ )
X
t−1 T c 0 T ∗
β π(θ ) u(gt (αt (θ ), µ )) − v
t=1
θt
for all α0 : D −→ A (αt0 is θt − measurable) and µ∗ (a) = {θT |α(θT )=a} π(θT ). An allocation
P
T
gty (θT , µ∗ )
X
β t−1 T
π(θ ) u(gtc (θT , µ∗ )) −v ≥
t=1
θt
T
gty (αt0 (θT ), µ∗ )
X
β t−1
π(θ ) T
u(gtc (αt0 (θT ), µ∗ )) −v
t=1
θt
T X T X
yt (θT ) yt (αt0 (θT ))
X X
T T T 0 T
π(θ ) u(ct (θ )) − v ≥ π(θ ) u(ct (αt (θ ))) − v
t=1 θT ∈D
θt t=1 θT ∈D
θt
35
ECN 741: Public Economics Fall 2008
Suppose T = 1 and there are only two types, θH and θL with θH > θL . Consider the
utilitarian planer’s problem
y(θH ) y(θL )
max π(θH ) u(c(θH )) − v + π(θL ) u(c(θL )) − v
θH θL
sub. to.
y(θH ) y(θL )
u(c(θH )) − v ≥ u(c(θL )) − v
θH θ
H
y(θL ) y(θH )
u(c(θL )) − v ≥ u(c(θH )) − v
θL θL
For a moment suppose there is no private information. Then the optimal allocation must
satisfy (note that v(·) is convex):
We will show that this allocation does not satisfy I.C. constraints. Note that y(θH ) > y(θL ),
therefore
y(θL ) y(θL ) y(θH )
u(c(θL )) − v = u(c(θH )) − v > u(c(θH )) − v
θH θH θH
the I.C. for type H is violated.
So we know that when individuals have private information about their type, at least of the
the I.C. constraints is binding at the optimal solution.
36
ECN 741: Public Economics Fall 2008
Consider a relaxed planning problem with only type H’s I.C. constraint
y(θH ) y(θL )
max π(θH ) u(c(θH )) − v + π(θL ) u(c(θL )) − v
θH θL
sub. to.
y(θH ) y(θL )
u(c(θH )) − v ≥ u(c(θL )) − v ; π(θH )µ
θH θH
we will characterize the solution to this problem and then we will verify that at the solution
the I.C. constraint of type L is slack.
(1 + µ)u0 (c(θH )) = λ
π(θH )
1−µ u0 (c(θL )) = λ
π(θL )
1 0 y(θH )
(1 + µ) v = λ
θH θH
1 0 y(θL ) π(θH ) 1 0 y(θL )
v −µ v = λ
θL θL π(θL ) θH θH
and therefore
y(θH ) > y(θL )
37
ECN 741: Public Economics Fall 2008
Next we check that under these allocations, the I.C. constraints for type L is slack.
Z y(θH ) Z y(θH )
y(θH ) y(θL ) y y
v −v = v dy > v dy
θL θL y(θL ) θL y(θL ) θH
y(θH ) y(θL )
= v −v
θH θH
= u(c(θH )) − u(c(θL ))
rearrange terms
y(θL ) y(θH )
u(c(θL )) − v > u(c(θH )) − v
θL θL
So, we know that our characterization is valid. Note that under this characterization
1 0 y(θL ) 0 π(θH ) 1 0 y(θL ) 0
v − u (c(θL )) = µ v − u (c(θL ))
θL θL π(θL ) θH θH
π(θH ) 1 0 y(θH ) 0
< µ v − u (c(θH ))
π(θL ) θH θH
< 0
and hence
1 0 y(θL )
v < u0 (c(θL ))
θL θL
there is distortion for the low type.
Suppose we want to implement this allocation with a nonlinear tax function T (y). Let
T (y) = y − c if y ∈ {y(θH ), y(θL )} and T (y) = y otherwise. Consider the consumer’s
problem
y
max u(c) − v
θ
sub. to.
c = y − T (y)
FOC
1 y
u0 (c)(1 − T 0 (y)) = v
θ θ
38
ECN 741: Public Economics Fall 2008
So far we have characterized the set of achievable allocations by any mechanism. The goal
of the planner is to find the best achievable allocation.
T
yt (θT )
X X
t−1 T T
β π(θ )ω(θ1 ) u(ct (θ )) − v (43)
t=1
θt
θT ∈D
sub. to.
T X T X
yt (θT ) yt (αt0 (θT ))
X X
T T T 0 T
π(θ ) u(ct (θ )) − v ≥ π(θ ) u(ct (αt (θ ))) − v
t=1 θT
θt t=1 T
θt
θ
T
XX T
XX
ct (θT )π(θT )/Rt−1 ≤ yt (θT )π(θT )/Rt−1
Θ∈D t=1 Θ∈D t=1
Suppose θt is public information. Then the planning problem is the same, except that there
will no incentive compatibility constraint. Let λ be multiplier on feasibility.
X X
π(θT )ω(θ1 )u0 (ct (θT ))β t−1 = λ/Rt−1 π(θT )
θT θT
39
ECN 741: Public Economics Fall 2008
Note that this is the FOC with respect to a ct (θT ) at a particular draw θT . But we know
that ct (θT ) is θt − measurable, therefore we don’t need to sum over all θT ∈ D, but only
those that contain the particular history θt
X X
π(θT )ω(θ1 )u0 (ct (θT ))β t−1 = λ/Rt−1 π(θT )
θT |θt θT |θt
by measurability of ct (θT ).
u0 (ct (θT ))β t−1 ω(θ1 ) = λ/Rt−1
Note: This implies that optimal ct (θT ) is actually θ1 -measurable, i.e., it is independent from
θt for t > 1. In other words there is full insurance.
Note: The following Euler equation hold
1
= λ−1 β t−1 Rt−1 ω(θ1 )
u0 (c T
t (θ ))
1 1 t
= E 0 |θ
βR u (ct+1 (θT ))
Consider again the original planning problem (43)(with incentive constraints). Let (c∗ , y ∗ )
be the solution to this problem. Now consider the following perturbation around (c∗ , y ∗ )
y0 = y∗
u(c0t (θT )) + βu(c0t+1 (θT )) = k + u(c∗t (θT )) + βu(c∗t+1 (θT )) for all θt+1 such that π(θt+1 |θt ) > 0
40
ECN 741: Public Economics Fall 2008
X X
π(θT ) c0t (θT ) + c0t+1 (θT )/R = π(θT ) c∗t (θT ) + c∗t+1 (θT )/R
θT |θt θT |θt
max k
k,c0t (θT ),c0t+1 (θT )
sub. to
u(c0t (θT )) + βu(c0t+1 (θT )) = k + u(c∗t (θT )) + βu(c∗t+1 (θT )) for all θt , θt+1
such that π(θt+1 |θt ) > 0
X X
π(θT ) c0t (θT ) + c0t+1 (θT )/R = π(θT ) c∗t (θT ) + c∗t+1 (θT )/R
θT |θt θT |θt
βR X π(θt+1 ) 1
∗
= ∗
(44)
u0 (ct (θT )) t+1 t
π(θt ) u0 (ct+1 (θT ))
θ |θ
1
Note that I am abusing notation here. π(θt ) means the probability of history θt .
41
ECN 741: Public Economics Fall 2008
X π(θt+1 )
u0 (ct (θT )) = βR t)
u0 (ct+1 (θT )) (45)
t+1 t
π(θ
θ |θ
1
u0 (ct (θT )) = βR P π(θt+1 ) 1
θt+1 |θt π(θt ) u0 (c∗t+1 (θT ))
1
> βR 1
π(θ t+1 ) 0 ∗
u (ct+1 (θT ))
P
θ t+1 |θ t π(θ t )
X π(θt+1 )
= βR t
u0 (c∗t+1 (θT ))
t+1 t
π(θ )
θ |θ
Note that at the efficient allocation the individuals are “saving constrained”. In other words,
if individuals can privately save, they will choose to do so and it is desirable for planer to
prevent them from doing that.
Another way of seeing this is the following: suppose (45) holds. Then we must have
βR X π(θt+1 ) 1
0 ∗ T
< 0 ∗
u (ct (θ )) t+1 t
π(θ ) u (ct+1 (θT ))
t
θ |θ
Now suppose the planer wants to increase utility at time t by and decrease it at time t + 1
by β −1 . The cost of increase of utility in period t is u0 (ct (θT ))/ . On the other hand planer
hands in u0 (ct+1 (θT ))/ less at each θt+1 that follows θt . Therefore it can free up resources.
42
ECN 741: Public Economics Fall 2008
On dynamics of consumption
3. Inequality is constant
Now consider private information optimal allocations. Assume θt is i.i.d. Consider two
different history θt and θ̄t
X π(θt+1 )
u0 (ct (θT |θt )) = βR t)
u0 (ct+1 (θT |θt ))
t+1 t
π(θ
θ |θ
X π(θt+1 )
u0 (ct (θT |θ̄t )) = βR t)
u0 (ct+1 (θT |θ̄t ))
t+1 t
π(θ
θ |θ
note that π(θT |θ̄t ) = π(θT |θt ). Now suppose u0 (ct (θT |θt )) > u0 (ct (θT |θ̄t )), then there exist a
history θt+1 such that π(θt+1 |θt ) = π(θt+1 |θ̄t ) and
43
ECN 741: Public Economics Fall 2008
1 t
If V ar u0 (ct+1 (θT ))
|θ > 0 for some θt , then
1 1
V ar 0
< V ar
u (ct (θT )) u0 (ct+1 (θT ))
and therefore
V ar ct (θT ) < V ar ct+1 (θT )
a.s.
lim xt = x∞ < ∞
t→∞
This part is mostly based on Farhi and Werning (2006, 2007, 2005), PHELAN (2006) and
Atkeson and Lucas (1992).
In what follows we maintain the following assumptions:
• T =∞
• Θ = {θH , θL }
Very important note: In what follows I skip some detailed steps. Most of the arguments
are heuristic and have loose ends. For more rigorous proofs please look at the references
above.
44
ECN 741: Public Economics Fall 2008
Immiseration result
T
yt (θt )
X X
t−1 t t
w0 = max β π(θ ) u(ct (θ )) − v (46)
t=1
θt
θT ∈D
sub. to.
T X T X
yt (θt ) yt (αt0 (θt ))
X X
t t
π(θ ) u(ct (θ )) − v ≥ t
π(θ ) u(ct (αt0 (θt ))) −v
θt θt
t=1 θt t=1 θt
T
XX
π(θt ) ct (θt ) − yt (θt ) /Rt−1 ≤ 0
θt t=1
The solution to this problem must also be the solution to the following dual problem
T
XX
π(θt ) ct (θt ) − yt (θt ) /Rt−1
K(w0 ) = min (47)
θt t=1
sub. to
T X T X
yt (θt ) yt (αt0 (θt ))
X X
t t
π(θ ) u(ct (θ )) − v ≥ t
π(θ ) u(ct (αt0 (θt ))) −v
θt θt
t=1 θt t=1 θt
T
yt (θt )
X X
t−1 t t
β π(θ ) u(ct (θ )) − v ≥ w0
t=1
θt
θT ∈D
45
ECN 741: Public Economics Fall 2008
sub. to
y(θ0 , w)
y(θ, w) 0 0
u(c(θ, w)) − v + βw (θ, w) ≥ u(c(θ , w)) − v + βw0 (θ0 , w) ∀θ, θ0
θ θ
X y(θ, w) 0
π(θ) u(c(θ, w)) − v + βw (θ, w) ≥ w
θ
θ
Let µ(θ, θ0 ) be multiplier on incentive constraint and φ the multiplier on promise keeping.
First order condition with respect to c(θ, U ) is
X X
u0 (c(θ, w)) µ(θ, θ0 ) − µ(θ0 , θ) + π(θ)φ = π(θ)
θ0 θ0
and therefore
1
K 0 (w0 (θ, w)) =
u0 (c(θ, w))
Lemma 1 Given any w ∈ [w, w̄], if w0 (θ, w) = w0 (θ0 , w) for some θ, θ0 ∈ Θ, then c(θ, w) =
c0 (θ, w)
46
ECN 741: Public Economics Fall 2008
K 0 (w) = φ
therefore
X
K 0 (w) = π(θ)K 0 (w0 (θ, w))
θ
wt+1 = w0 (θt , wt )
then
K 0 (wt ) = Et [K 0 (wt+1 )]
and therefore
c(θ, w∞ ) = c(θ0 , w∞ ) ∀θ, θ0
but we know from our two type static example that the planer can do better by differentiating
various θ types. Therefore, this is a contradiction. Hence K 0 (w∞ ) = 0 and w∞ = w.
No Immiseration result
Consider the following planning problem in which planer values future consumption more
than agent (β̂ > β)
T
yt (θt )
X X
t−1 t t
w0 = max β̂ π(θ ) u(ct (θ )) − v̂ (49)
t=1
θt
θT ∈D
47
ECN 741: Public Economics Fall 2008
sub. to.
T X T X
yt (θt ) yt (αt0 (θt ))
X X
t t
π(θ ) u(ct (θ )) − v̂ ≥ t
π(θ ) u(ct (αt0 (θt ))) − v̂
θt θt
t=1 θt t=1 θt
T
XX
π(θt ) ct (θt ) − yt (θt ) /Rt−1 ≤ 0
θt t=1
assume the following (in addition to the assumption mentioned at the beginning of the
discussion)
• β̂R = 1
y
v(y)
• v̂ θ
= θ
Suppose the above problem has a solution and let λ̂be the multiplier on resources constraint.
Then the solution to the above problem must also solve the following
T
v(yt (θt ))
X X
t−1 t t t t
P (w0 ) = max β̂ π(θ ) u(ct (θ )) − − λ̂ct (θ ) + λ̂yt (θ ) (50)
t=1
θt
θT ∈D
sub. to.
T X T X
v(yt (θt )) v(yt (αt0 (θt )))
X X
t t t 0 t
π(θ ) u(ct (θ )) − ≥ π(θ ) u(ct (αt (θ ))) −
θt θt
t=1 θt t=1 t θ
T
v(yt (θt ))
X X
t−1 t t
β π(θ ) u(ct (θ )) − ≥ w0
t=1
θt
θT ∈D
Again, we can show that the solution to the above problem also solves the following Bellman
equation (after any history)
48
ECN 741: Public Economics Fall 2008
X v(y(θ, w)) 0
P (w) = max0 π(θ) u(c(θ, w)) − − λ̂c(θ, w) + λ̂y(θ, w) + β̂P (w (θ, w)) (51)
c,y,w
θ
θ
sub. to
We want to show that in this problem in the long-run the promised utility cannot be at
misery (w∞ > −∞ ). We use two lemmas to show this
Lemma 2 The value function P (w) is strictly concave and continuously differentiable on
(−∞, w̄). Moreover,
and
lim P 0 (w) = 1
v→−∞
Proof.
We are not going to prove concavity and differentiability. We take them as given.
Warning: This proof is not complete! There are some steps that needs to be filled in or
reformulated. I present it to provide the core idea of the proof as I understand it.
Warning:The proof that I presented in class is wrong! The mistake is in constructing the
upper and lower bound value function (PF I and Pm ). In particular, I mentioned in class
that Pi (w) = w − Ki (w) (for i = F I, m, etc), in which Ki (w) is the cost of delivering utility
w. This is correct only if β̂ = β. When β̂ > β individual utilities are discounted at different
rate in the planner’s objective and expected present discounted value of an allocation in the
planner’s objective is not the same as the one promised to the agent.
Correct proof has the same idea but those functions are constructed correctly.
Define value function PF I (w) as
T
v(yt (θt ))
X X
t−1 t t t t
PF I (w) = max β̂ π(θ ) u(ct (θ )) − − λ̂ct (θ ) + λ̂yt (θ )
t=1
θt
θT ∈D
49
ECN 741: Public Economics Fall 2008
sub. to.
T
v(yt (θt ))
X X
t−1 t t
β π(θ ) u(ct (θ )) − ≥w
t=1
θt
θT ∈D
PF I (w) is the value to the planner from delivering utility w to individual, if we ignore
incentive constraint. Note that PF I (w) > P (w). Also, PF I (w) is strictly concave and
differentiable.
Next consider the following maximization problem
v(y)
m = max u(c) − − λ̂c + λ̂y
c,y,θ θ
The above problem has a solution (u0 (c) = λ̂,v 0 (y) = λ̂θ, and θ belongs to a compact set).
Next, note that
T
v(yt (θt ))
X X
t−1 t t t t
PF I (w) = β̂ π(θ ) u(ct (θ )) − − λ̂ct (θ ) + λ̂yt (θ )
t=1 T
θt
θ ∈D
T
v(yt (θt ))
X X
t−1 t t t t
= β̂ π(θ ) u(ct (θ )) − − λ̂ct (θ ) + λ̂yt (θ )
t=1
θ t
θT ∈D
T
v(yt (θt ))
X X
t−1 t t t t
+ β π(θ ) u(ct (θ )) − − λ̂ct (θ ) + λ̂yt (θ )
t=1 T
θt
θ ∈D
T
v(yt (θt ))
X X
t−1 t t t t
− β π(θ ) u(ct (θ )) − − λ̂ct (θ ) + λ̂yt (θ )
t=1 T
θt
θ ∈D
T
X X h i
= w+ β t−1 π(θt ) −λ̂ct (θt ) + λ̂yt (θt ) +
t=1 θT ∈D
T
v(yt (θt ))
X X
t−1 t−1 t t t t
(β̂ − β ) π(θ ) u(ct (θ )) − − λ̂ct (θ ) + λ̂yt (θ )
t=1
θt
θT ∈D
T
X
t−1
X
t
h
t t
i 1 1
≤ w+ β π(θ ) −λ̂ct (θ ) + λ̂yt (θ ) + m −
t=1 T 1 − β̂ 1−β
θ ∈D
1 1
≤ w − λ̂K̃(w) + m −
1 − β̂ 1 − β
in which
T
X X
β t−1 π(θt ) ct (θt ) − yt (θt )
K̃(w) = min
t=1 θT ∈D
50
ECN 741: Public Economics Fall 2008
sub. to.
T
v(yt (θt ))
X X
t−1 t t
β π(θ ) u(ct (θ )) − ≥w
t=1
θt
θT ∈D
Note that K̃(w) is strictly convex and differentiable and limw→−∞ K̃ 0 (w) = 0.
Let Pmax (w) = w − K̃(w) + m. Then Pmax (w) ≥ PF I (w) and both are strictly con-
cave. Also, limw→−∞ PF I (w) ≤ limw→−∞ Pmax (w) = −∞. Therefore, limw→−∞ PF0 I (w) ≤
0
limw→−∞ Pmax (w) = 1.
Also, Note that limw→−∞ P (w) ≤ limw→−∞ PF I (w) = −∞ and therefore (since both are
strictly concave)
lim P 0 (w) ≤ lim PF0 I (w) = 1.
w→−∞ w→−∞
lim P 0 (w) ≤ 1
w→−∞
Next, consider allocations (c(w0 , θt ), y(w0 , θt )) that solve the original problem. Suppose they
attain the value P (w0 ). Define new allocations (c̃(w, θt ), ỹ(w, θt )) for w ≤ w0 as
T
v(ỹ(w, θt ))
X X
t−1 t t t t
Pm (w) = β̂ π(θ ) u(c̃(w, θ )) − − λ̂c̃(w, θ ) + λ̂ỹ(w, θ )
t=1
θt
θT ∈D
X v(y(w0 , θ1 )) + w0 − w
= π(θ1 ) u(c(w0 , θ1 )) − − λ̂c(w0 , θ1 ) + λ̂ỹ(w0 , θ1 ) +
θ1
θ1
T
v(c(w0 , θt ))
X X
t−1 t t t t
β̂ π(θ ) u(c(w0 , θ )) − − λ̂c(w0 , θ ) + λ̂y(w0 , θ )
t=2
θt
θT ∈D
X
X w − w0 v(y(w0 , θ1 ))
= π(θ1 ) + λ̂ỹ(w, θ1 ) + π(θ1 ) u(c(w0 , θ1 )) − − λ̂c(w0 , θ1 )
θ1
θ1
θ1
θ1
T
v(c(w0 , θt ))
X X
t−1 t t t t
+ β̂ π(θ ) u(c(w0 , θ )) − − λ̂c(w0 , θ ) + λ̂y(w0 , θ )
t=2 T
θ t
θ ∈D
51
ECN 741: Public Economics Fall 2008
Note that Pm (w) is strictly concave, Pm (w) ≤ P (w). Also, note that only the first term
depends on w. Therefore
1
Pm0 (w) = 1 − λ̂E 0
v (v(y(w, θ0 )) + w0 − w)
In the next lemma we show that 1 − P (w0 (w, θ)) can be bounded above and bellow for all θ.
Let’s rewrite the problem (51) again
X v(y(θ, w)) 0
P (w) = max0 π(θ) u(c(θ, w)) − − λ̂c(θ, w) + λ̂y(θ, w) + β̂P (w (θ, w)) (52)
c,y,w
θ
θ
sub. to
Suppose only high type’s incentive constraint binds. Let µ and φ be multipliers on the IC
and promise keeping (and let π(θH ) = π)
FOC w.r.t c(w, θ)
1
π 1 − λ̂ 0 + φπ + µ = 0
u (c(w, θH ))
1
(1 − π) 1 − λ̂ 0 + φπ − µ = 0
u (c(w, θL ))
52
ECN 741: Public Economics Fall 2008
π 1 π µ
− + π λ̂ 0 −φ − = 0
θH v (y(w, θH )) θH θH
(1 − π) 1 (1 − π) µ
− + (1 − π)λ̂ 0 −φ + = 0
θL v (y(w, θL )) θL θH
β β
E [1 − P 0 (w0 (w, θ))] = (1 − P 0 (w)) + 1 −
β̂ β̂
and
1 1
λ̂E 0 = (1 + φ)E =1+φ
v (y(w, θ)) θ
Assume that we know y(w, θH ) > y(w, θL ) and w0 (w, θH ) > w0 (w, θL ). Now we can prove
the next lemma.
Proof.
From the FOC for y(w, θH ) we get
" #
λ̂ λ̂ µ
(1 + φ)θH = E 0 θH ≥ 0 =1+φ+
v (y(w, θ)) v (y(w, θ)) π
µ
from this we can get the following bound on π
µ
≤ (1 + φ)(θH − 1)
π
β β
P 0 (w0 (w, θH )) ≥ −φ θH − (θH − 1)
β̂ β̂
β β
= P 0 (w) θH − (θH − 1)
β̂ β̂
53
ECN 741: Public Economics Fall 2008
β β
1 − P 0 (w0 (w, θL )) < 1 − P 0 (w0 (w, θH )) ≤ (1 − P 0 (w)) θH + 1 −
β̂ β̂
µ θH
≤ (1 + φ)(1 − θL )
1−π θL
and
0 0 β θH β θH
P (w (w, θL ) ≤ −φ 1+ − θL + − θL
β̂ θL β̂ θL
0 β θH β θH
= P (w) 1+ − θL + − θL
β̂ θL β̂ θL
and hence
0 0 0 0 0 β θH β
1 − P (w (w, θH )) > 1 − P (w (w, θL )) ≥ (1 − P (w)) 1+ − θL + 1 −
β̂ θL β̂
β
lim P 0 (w0 (w, θ)) = <1
w→−∞ β̂
In this section we derive inverse Euler equation in an environment in which there is aggregate
risk. We need to introduce new notations. Let Θ be the space of individual shocks and Z
be the space of aggregate shock. The timing is the following
54
ECN 741: Public Economics Fall 2008
2. Nature draw individual shocks θT ∈ ΘT according to πθ (θT |z T ). These draws are i.i.d
across individuals conditional on z T .
By law of large number, given z T , the fraction of population with shocks θT is πθ (θT |z T ).
We impose the following restriction:
This assumption implies that conditional on z t , (θt+1 , . . . , θT ) and (zt+1 , . . . , zT ) are inde-
pendent.
Remarks: Note that in this setup we are not imposing any restriction on time series prop-
erties of θt and zt . However, our assumption implies that by observing history of private
shocks up to date t the individuals cannot infer anything about future aggregate shocks.
As before, we assume that agents learn zt and θt at the beginning of date t and θt is private
information.
There is an initial capital stock K̄1 in the economy.
55
ECN 741: Public Economics Fall 2008
T
yt (θT , z T )
X X X
T t−1 T T T T
πz (z ) β πθ (θ |z ) u(ct (θ , z )) − v ≥ (54)
t=1
θt
z T ∈Z T θT ∈ΘT
T
yt (αt0 (θT , z T ))
X X X
T
πz (z ) β t−1 T
πθ (θ |z ) T
u(ct (αt0 (θT , z T ))) −v
t=1
θt
z T ∈Z T θT ∈ΘT
Suppose (c∗ , y ∗ , K ∗ ) is the solution to the above problem. Fix a public history z̄ t . We perturb
the solution to (c0 , y ∗ , K 0 ) such that
X
u(c0t (θt , z̄ t )) = u(c∗t (θt , z̄ t )) + β πz (z̄ t , zt+1 )δt+1 (θt , z̄ t , zt+1 ) + γ ∀θt (55)
zt+1
u(c0t+1 (θt+1 , z̄ t , zt+1 )) = u(c∗t+1 (θt+1 , z̄ t , zt+1 )) − δt+1 (θt , z̄ t , zt+1 ) (56)
0
X X
π(θt |z̄ t )c0t (θt , z̄ t ) + Kt+1 (z̄ t ) ≤ π(θt |z̄ t )c∗t (θt , z̄ t ) + Kt+1
∗
(z̄ t ) (57)
θt θt
0 0 0
X
πθ (θt+1 |z̄ t , zt+1 )ct+1 (θt+1 , z̄ t , zt+1 ) − Kt+1 (z̄ t )(1 − δ) − F (Kt+1 (z̄ t ), Yt∗ (z̄ t , zt+1 ), z̄ t , zt+1 )(58)
θt+1
≤
X
πθ (θt+1 |z̄ t , zt+1 )c∗t+1 (θt+1 , z̄ t , zt+1 ) − Kt+1
∗ ∗
(z̄ t )(1 − δ) − F (Kt+1 (z̄ t ), Yt∗ (z̄ t , zt+1 ), z̄ t , zt+1 )
θt+1
The idea is that (c∗ , y ∗ , K ∗ ) must be the solution to the following maximization problem
(with γ = 0)
0= max 0 γ
0 0 δt+1 ,ct ,ct+1 ,γ,Kt+1
56
ECN 741: Public Economics Fall 2008
sub. to (55)-(58).
Let η(θt , z̄ t ),η(θt+1 , z̄ t , zt+1 ), µ(z̄ t ) and µ(z̄ t , zt+1 ) be multipliers on (55), (56), (57) and (58).
Write the FOCs
substitute η(θt ) and η(θt+1 , z̄ t , zt+1 ) from (59) and (60) into (61)
µ(z̄ t )πθ (θt |z̄ t ) X µ(z̄ t , zt+1 )πθ (θt+1 |z̄ t , zt+1 )
βπz (z̄ t , zt+1 ) =
u0 (c∗t (θt , z̄ t )) u0 (c∗t+1 (θt+1 , z̄ t , zt+1 ))
θt+1 ,zt+1 |θt ,z̄ t
Let
−1
t t+1 t
µ(z̄ , zt+1 ) β X πθ (θ |z̄ , zt+1 )
λ(z̄ t , zt+1 ) ≡ = 0 ∗ t t
t t
µ(z̄ )πz (z̄ , zt+1 ) u (ct (θ , z̄ )) u (c∗t+1 (θt+1 , z̄ t , zt+1 ))
0
θt+1 |z̄ t ,zt+1 ,θt
then
X
∗
1= πz (z̄ t , zt+1 )λ(z̄ t , zt+1 )(1 − δ + FK (Kt+1 (z̄ t ), Yt∗ (z̄ t , zt+1 ), z̄ t , zt+1 ))
zt+1
or
−1
t β 1 t+1 t
λ(z̄ , zt+1 ) = 0 ∗ t t E θ , z̄ , zt+1
u (ct (θ , z̄ )) u0 (c∗t+1 (θt+1 , z̄ t , zt+1 ))
57
ECN 741: Public Economics Fall 2008
∗
1 = E λ(z̄ t , zt+1 )(1 − δ + FK (Kt+1 (z̄ t ), Yt∗ (z̄ t , zt+1 ), z̄ t , zt+1 ))|z̄ t
and therefore
t+1 −1
β 1
λt+1 = 0 ∗ t E θ
u (ct (θ )) u0 (c∗t+1 (θt+1 ))
∗
1 = λt+1 (1 − δ + FK (Kt+1 , Yt∗ ))
and therefore ∗
, Yt∗ ))
β(1 − δ + FK (Kt+1
1 t+1
=E 0 ∗ θ
u0 (c∗t (θt )) u (ct+1 (θt+1 ))
and therefore (using Jensen’s inequality)
The additive separable preferences that we have assumed so far are not consistent with
balanced growth fact (except if u(c) = log(c)). However, the perturbation that we used
58
ECN 741: Public Economics Fall 2008
in deriving the inverse Euler equation depends crucially on additive separability. If the
preferences are of the form
c1−σ
u(c, l) = v(l)
1−σ
then the proof presented will not work. See Farhi and Werning (2008) for derivation of the
inter-temporal optimality condition for larger class of preferences (including those that are
consistent with balanced growth).
So far we have shown that efficient allocations must satisfy a condition like the following (if
there is no aggregate shock)
∗
, Yt∗ ))
β(1 − δ + FK (Kt+1
1 t+1
=E 0 ∗ θ .
u0 (c∗t (θt )) u (ct+1 (θt+1 ))
We also showed that this implies the following
u0 (c∗t (θt ))
1 − τt = < 1.
βE u0 (c∗t+1 (θt+1 ))(1 − δ + FK (Kt+1
∗
, Yt∗ )) θt+1
But does this mean that the efficient allocations can be implemented by a positive tax on
capital?
• T = 2.
• Θ = {0, 1}, π(θ1 = 1) = 1, π(θ2 = (1, 1)) = 0.5 and π(θ2 = (1, 0)) = 0.5. (Note that
this implies y2h = y2 ((1, 1)) = l2h (1, 1) and y2l = y2 ((1, 0)) = 0)
2
• u(c, l) = log(c) − l2 , β = 1.
59
ECN 741: Public Economics Fall 2008
• F (K, Y ) = RK + wY , δ = 1.
y12 2
y2h
max log(c1 ) − + 0.5 log(c2h ) − + 0.5 [log(c2l )]
c1 ,c2h ,c2l ,y1 ,y2h ,K2 2 2
sub. to
c1 + K2 = RK1 + wy1
0.5c2h + 0.5c2l = RK2 + 0.5wy2h
2
y2h
log(c2h ) − ≥ log(c2l )
2
c1 , c2h , c2l , y1 , y2h , K2 ≥ 0
∗2
y2h
log(c∗2h ) − = log(c∗2l )
2
Rc∗1 = 0.5c∗2h + 0.5c∗2l
w ∗
= y2h
c∗2h
w
= y1∗
c∗1
• Individuals have the same endowment of k1 = K1 and choose how much to work and
how much to save.
60
ECN 741: Public Economics Fall 2008
• Government makes transfers T2h and T2l to individual who produce y2h and zero output
in the second period. Transfers can be negative.
Competitive equilibrium is allocations (c1 , y1 , c2h , y2h , c2l , k2 ) and policy (τk , T2h , T2l ) such
that:
y12 2
y2h
max log(c1 ) − + 0.5 log(c2h ) − + 0.5 [log(c2l )]
c1 ,c2h ,c2l ,y1 ,y2h ,k2 2 2
sub. to
c1 + k2 = Rk1 + wy1
2. Markets clear
c1 + K2 = RK1 + wy1
0.5c2h + 0.5c2l = RK2 + 0.5wy2h
k1 = K1
k2 = K2
We want the tax system be such that y2h > 0. Then it is necessary that
2
y2h
log(c2h ) − ≥ log(c2l )
2
61
ECN 741: Public Economics Fall 2008
Let’s assume we have such a tax system. Then the consumer FOCs are
1 .5 .5
= R(1 − τk ) +
c1 c2h c2l
w
= y1
c1
w
= y2h
c2h
∗
Question: How do we pick taxes to implement efficient allocations (c∗1 , c∗2h , c∗2l , y1∗ , y2h , K2∗ )
in the equilibrium?
Why don’t we try the simplest way to do this. And choose the following taxes
and
This way the equilibrium allocations satisfy all of the optimality conditions listed above.
They are also feasible and hence should implement the efficient allocations.
What is missing?
The taxes above guarantee that the individual saves the correct amount “if” he plans to tell
the truth about his ability in the second period. Also, they guarantee that the individual
tells the truth “if” he saves the correct amount.
Question: What if the individual saves more than K2∗ and plan to lie about their ability if
they happen to be the able type?
We first show that this double deviation is feasible and improves individuals welfare. Consider
three different plan by the agent
62
ECN 741: Public Economics Fall 2008
PLAN 1:
PLAN 2:
Note first that, because incentive compatibility in planner’s problem binds, individual is
indifferent between PLAN 1 and PLAN 2. Also,
1 .5 .5 1
∗
< R(1 − τk ) ∗ + ∗ < R(1 − τk ) ∗
c1 c2h c2l c2l
PLAN 3:
• Save K2∗ + .
Therefore under the proposed tax system individuals prefer PLAN 3 to PLAN 1. Hence this
implementation fails. In other words the efficient allocation cannot be implemented by a
separable tax system in which capital taxes do not depend on individuals income ex-post.
63
ECN 741: Public Economics Fall 2008
We implement efficient allocations using arbitrary nonlinear taxes on labor income and linear
taxes on wealth. The description of envrionment is the following
• Single representative firm that owns the production technology and takes capital rents
rt and wages wt as given.
• Agents trade supply labor and capital to the firm in a sequence of competitive markets.
T
yt (θT , z T )
X X X
T t−1 T T T T
max πz (z ) β πθ (θ |z ) u(ct (θ , z )) − v
c,k,y
t=1
θt
z T ∈Z T θT ∈ΘT
subject to
Given tax system (ψ, τ ), an eqilibrium is allocations rules (c, k, y) and prices (r, w) such
that given prices and tx system, the allocation rules solves the individual problem, r and
w are such that rt (z T ) = Fkt (Kt (z T ), Yt (z T ), z T ) and wt (z T ) = Fyt (Kt (z T ), Yt (z T ), z T ), and
alloctions are feasible for all t and all z T :
64
ECN 741: Public Economics Fall 2008
Note that the tax system described only in terms of sequence of lebor supply (and not skill
shocks). However, households choose consumption as function of past and present skills (the
θ’s). We need to make sure consumption depends on past and present skills only trough past
and present labor supply. In other words, we need to make sure observing past and present
labor supply is enough to determine individual’s consumtion. For that we need to impose
an assumption:
Suppose (c∗ , y ∗ , K ∗ ) is a socially optimal allocation. Define
T
Assumption 2 There exist a sequence of functions c = cb∗t
b∗ , where cb∗t : DOMt → R+ ,
t=1
cb∗t is (y t , z t )−measurable, and
65
ECN 741: Public Economics Fall 2008
This assumption is satisfied in static case and in i.i.d case. See Kocherlakota (2005) for
further discussion and an example for the case in which this assumption is violated.2
This assumption simply says it is possible to implement consumption allocation as function
of labor supply rather than skill. In this implementation a person who acts like (θT , z T )
(have same labor supply as type (θT , z T )), will also sonsume the same allocation as type
(θT , z T ). Whether this is in fact efficient or not is something we need to show.
Given optimal allocation (c∗ , y ∗ , K ∗ ), there exists λ∗t+1 : Z T → R+ (z t+1 −measurable) such
that
t t+1 −1
∗ β 1
λt+1 = 0 ∗ E 0 ∗ θ ,z
u (ct ) u (ct+1 (θt+1 , z̄ t , zt+1 ))
∗
Define τt+1 : RT+ × Z T → R by
∗
Note that τt+1 is (y t+1 , z t+1 )−measurable.
Let
∗ ∗ T
cb∗t (y T , z T ) + kd
∗ T T T T b∗ T T
t+1 (y , z ) = (1 − τt+1 (y , z ))(1 − δ + M P Kt (z ))kt (y , z )
kb1∗ = K1∗
66
ECN 741: Public Economics Fall 2008
ψ ∗∗ (y T , z T ) ∀(y T , z T ) ∈ DOM
∗ t t+1
ψt+1 (y T , z T ) =
2y w (z T ) T T
∀(y , z ) ∈ / DOM
t t t+1
Proposition 11 Given prices (r∗ , w∗ ) and tax system (ψ ∗ , τ ∗ ), and given that the typical
agent chooses a budget feasible y 0 , his optimal choices of (c, k) are c0t (θT , z T ) = cb∗t (y 0 (θT , y T ), z T )
and kt0 (θT , z T ) = kbt∗ (y 0 (θT , y T ), z T )
67
ECN 741: Public Economics Fall 2008
References
Atkeson, A. and J. Lucas, Robert E (1992): “On Efficient Distribution with Private
Information,” Review of Economic Studies, 59, 427–53. 3.1.1, 3.1.2
Chari, V.V., C. L. and P. Kehoe (1994): “Optimal fiscal policy in a business cycle
model,” Journal of Political Economy, 102, 617–52. 2
Chari, V. and P. J. Kehoe (1998): “Optimal fiscal and monetary policy,” Minneapolis
Fed Staff Report, SR 251. ∗, 2
Diamond, P. and J. Mirrlees (1971): “Optimal taxation and public production I: pro-
duction efficiency,” American Economic Review, 61, 8–27. 1.4
Farhi, E. and I. Werning (2005): “Inequality, Social Discounting and Estate Taxation,”
. 3.1.2, 3.1.2
——— (2007): “Inequality and Social Discounting,” Journal of Political Economy, 115, 365–
402. 3.1.2, 3.1.2
——— (2008): “Optimal Savings Distortions with Recursive Preferences,” The Journal of
Monetary Economics, 55, 21–42. 3.1.1, 3.1.4
68
ECN 741: Public Economics Fall 2008
PHELAN, C. (2006): “Opportunity and Social Mobility,” Review of Economic Studies, 73,
487–504. 3.1.2
69