T08DynamicProgramming
T08DynamicProgramming
3. Euler equation
1 Markov property
Let Sjt denote the ex-dividend price of asset j at time t and xjt the dividend on this
asset. Then the net return on asset j equals
Sj,t+1 + xj,t+1
rj,t+1 = − 1,
Sjt
for j = 1, . . . , M . Let rf,t+1 denote the return from period t to t + 1 on the riskfree
asset. Note that rj is adapted but rf is predictable.
Definition (Markov Property). The distribution of returns rj,t+1 (∀j) at time t and the
value of the riskfree rate rf,t+1 depend only on a state vector with finitely many elements
Yt . Moreover the distribution of Yt+1 at time t depends only on Yt .
Markov property ⇒ for any function g, such that the expectation exists and s > t,
1
Jessica Wachter Class notes for Finance 911
Notation:
• ι = M × 1 vector of 1s
for t = 0, . . . , T − 1, and cT ≤ WT .
• Why?
By definition:
>
Wt = ct + αt+1 + θt+1 ι.
We also know:
>
Wt+1 = αt+1 (1 + rf ) + θt+1 (ι + rt+1 ).
2
Jessica Wachter Class notes for Finance 911
Substituting in for α:
> >
Wt+1 = (Wt − ct − θt+1 ι)(1 + rf,t+1 ) + θt+1 (ι + rt+1 ).
V (WT , YT , T ) = β T u(WT ).
subject to (1).
Theorem (Principle of optimality). Solving problem P is equivalent to solving P 0 .
That is,
XT
V (Wt , Yt , t) = max Et β s u(cs ).
cs ,θs+1 ,s≥t
s=t
V (WT , YT , T ) = β T u(WT ).
Consider
max β T u(cT ).
cT ≤WT
3
Jessica Wachter Class notes for Finance 911
Because u0 > 0, the solution is cT = WT . This proves the result for time T .
Now assume that the result holds for time t + 1 and show that it holds for t. By the
induction step, we know
T
X
V (Wt+1 , Yt+1 , t + 1) = Et+1 β s u(ĉs )
s=t+1
for some {ĉs , s > t} that is adapted and satisfies the budget constraint from time t + 1
onwards. It follows from the definition of V (Wt , Yt , t) that, for some ĉt
" " T # #
X
V (Wt , Yt , t) = β t u(ĉt ) + E Et+1 β s u(ĉs ) Yt ,
s=t+1
where we have substituted in for V (Wt+1 , Yt+1 , t + 1) inside the expectation. It follows
from the Markov property and the law of iterated expectations that
T
X
V (Wt , Yt , t) = Et β s u(ĉs ),
s=t
where the max is taken over all adapted consumption paths satisfying the budget
constraint from time t onwards. Note that V ∗ solves the problem “all at once.”
It follows from the definition of V ∗ that
V ≤ V ∗.
Let {c∗s , s = t, . . . T } and {θs∗ , s = t, . . . , T } be the solution to (2). Then wealth at time
t + 1 satisfies
∗
Wt+1 = (1 + rf )(Wt − c∗t ) + θt+1 ∗
(rt+1 − rf ).
4
Jessica Wachter Class notes for Finance 911
Add β t u(c∗t ) to both sides of this equation and take the expectation at time t (use the
law of iterated expectations). It follows that
∗
β t u(c∗t ) + Et [V (Wt+1 , Yt+1 , t + 1)] ≥ V ∗ .
Because V maximizes this quantity over all possibilities of c∗t and θt+1
∗
subject to budget
constraint, it follows that V is greater than the LHS, and therefore
V ≥ V ∗,
3 Euler equation
2. VW (Wt , Yt , t) = β t u0 (c∗t )
Proof. • Proof of 2: For simplicity, we will write ct rather than c∗t and θt rather
than θt∗ .
FOC using the Bellman equation:
– WRT ct :
t 0 ∂Wt+1
β u (ct ) + Et VW (t + 1) =0
∂ct
We are using the shorthand VW (t + 1) = VW (Wt+1 , Yt+1 , t + 1). Note the
application of the chain rule.
5
Jessica Wachter Class notes for Finance 911
– WRT θj,t+1
∂Wt+1
Et VW (t + 1) =0
∂θj,t+1
This equation holds for all j = 1, . . . , M .
∂θ>
∂Wt+1 ∂ct
= (1 + rf,t+1 ) 1 − + t+1 (rt+1 − rf,t+1 ).
∂Wt ∂Wt ∂Wt
Therefore,
t 0 ∂ct ∂ct
VW (t) = β u (ct ) + Et VW (t + 1)(1 + rf,t+1 ) 1 − +
∂Wt ∂Wt
>
∂θt+1
Et VW (t + 1) (rt+1 − rf,t+1 )
∂Wt
t 0 ∂ct t 0 ∂ct
= β u (ct ) + β u (ct ) 1 −
∂Wt ∂Wt
t 0
= β u (ct )
6
Jessica Wachter Class notes for Finance 911
Why?
u0 (ct ) = βEt [u0 (ct+1 )(1 + rf,t+1 )] + βEt [u0 (ct+1 )(rj,t+1 − rf,t+1 )].
What is the change in lifetime utility from shifting a small amount of consumption from
time t to time t + 1?
For small shifts away from the optimum, the change should be zero. Note that if we
reduce consumption at t, we could invest it in risky asset j and have that same amount,
multiplied by 1 + rj,t+1 . Let c∗ be the optimum and c be the new value. Then
∆ in lifetime utility = β t (u(ct )−u(c∗t ))+β t+1 Et u(c∗t+1 + (c∗t − ct )(1 + rj,t+1 )) − u(c∗t+1 )
∆ in lifetime utility ≈ β t u0 (c∗t )(ct − c∗t ) + β t+1 Et [u0 (c∗t+1 )(1 + rj,t+1 )](c∗t − ct )
7
Jessica Wachter Class notes for Finance 911
subject to
>
Wt+1 = (1 + rf,t+1 )(Wt − ct ) + θt+1 (rt+1 − rf,t+1 ι)
V (WT , YT , T ) = β T log WT .
We assume that the conjecture holds at t + 1 and show it holds at t. Set up the
following change of variables:
ct θt+1
ĉt = θ̂t+1 = .
Wt Wt
Define the normalized budget constraint:
Wt+1 >
= (1 + rf )(1 − ĉt ) + θ̂t+1 (rt+1 − rf ι).
Wt
Apply the Bellman equation and the induction step:
= β t + g(t + 1) log Wt +
t Wt+1
max β log ĉt + Et g(t + 1) log + k(Yt+1 , t + 1).
ct ,θt+1 Wt
Recursively define
g(t) = β t + g(t + 1)
and
t Wt+1
k(Yt , t) = max β log ĉt + Et g(t + 1) log + k(Yt+1 , t + 1)
ĉt ,θ̂t+1 Wt
8
Jessica Wachter Class notes for Finance 911
where the maximization is subject to the normalized budget constraint. Note that
k in the equation above is a function of Yt and t alone. This verifies the conjecture.
Moreover, it follows from g(T ) = β T and the recursive definition of g that
T
X
g(t) = β s.
s=t
• Now use the envelope condition to derive implications for the consumption policy:
βt g(t)
= ,
ct Wt
so
βt
ct = Wt
βt + · · · + βT
1−β
= Wt .
1 − β T −t+1
ct
Implication: Wt
is non-stochastic (does not depend on Yt ).
It may seem surprising that this does not depend on investment opportunities
(distribution of returns). But in general, we would expect improvements in
investment opportunities to go in two directions. If investment opportunities are
better, the agent might want to invest more to take advantage of them
(substitution effect). However, the agent is better off in general and can enjoy
some of that wealth now by consuming more (income effect). In the log utility
case, these cancel out exactly.
• Implications for portfolio choice: It follows from the Euler equation that
1 − β T −t 1
Et (rj,t+1 − rf ) = 0.
1 − β Wt+1
Therefore,
1
Et (rj,t+1 − rf ) = 0. (*)
Wt+1
Note that the above holds for all j and therefore constitutes M equations in M
unknowns.
9
Jessica Wachter Class notes for Finance 911
We will sometimes be interested in what happens when we take the horizon out to
infinity. We are then solving: "∞ #
X
max E β t u(ct )
θ,c
t=0
subject to
>
Wt+1 = (1 + rf,t+1 )(Wt − ct ) + θt+1 (rt+1 − rf,t+1 ) ∀t ≥ 0
We would like a condition to replace our previous boundary condition cT ≤ WT . Now
we need to be careful that the agent does not borrow more and more money (See
Duffie, chapter 4 for details).
In practice, the solution to these problems is often less complicated than for the finite
horizon problems. Rather than applying backward induction, we can search for a fixed
point. It is possible to show that solving this problem is equivalent to solving for:
10
Jessica Wachter Class notes for Finance 911
and that ∞
X
G(Wt , Yt ) = Et β s−t u(c∗s )
s=t
Note that this is also the limit of the value function (divided by β t ) as the horizon goes
to infinity.
Theorem. Assume that markets are dynamically complete. Consider a SME (securities
market equilibrium) with M risky securities with return vector rt and one riskless
security with return rf,t . Then ∃ a representative agent with utility function ut such that
Ct is a solution to " T #
X
max E ut (CtR )
θ,C R
t=1
subject to
>
Wt+1 = (1 + rf,t+1 )(Wt − CtR ) + θt+1 (rt+1 − rf,t+1 )
CTR ≤ WT
11