0% found this document useful (0 votes)

53 views

Infinite-Horizon Dynamic Programming: Tianxiao Zheng Saif

The document discusses infinite-horizon dynamic programming. It begins by noting the differences from the finite-horizon case, including that there is no general theory guaranteeing a solution to the Bellman equation. It then covers: 1) The principle of optimality for infinite horizons, including the dynamic programming principle and verification theorem. 2) Using dynamic programming to solve an optimal stopping problem of exercising an American call option. 3) Viewing the Bellman equation as a fixed-point problem, and using the contraction mapping theorem to guarantee a unique solution.

Uploaded by

mateo

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

Infinite-Horizon Dynamic Programming: Tianxiao Zheng Saif

Uploaded by

mateo

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Infinite-Horizon Dynamic Programming

Tianxiao Zheng
SAIF

1. Introduction
Unlike the finite-horizon case, the infinite-horizon model has a stationarity structure in that
both the one-period rewards and the stochastic kernels for the state process are time homogeneous.
Intuitively, we may view the infinite-horizon model as the limit of the finite-horizon model as the
time horizon goes to infinity. The difficulty of the infinite-horizon case is that there is no general
theory to guarantee the existence of a solution to the Bellman equation. For bounded rewards, we
can use the powerful Contraction Mapping Theorem to deal with this issue.

2. Principle of Optimality
The principle of optimality in the infinite horizon states that

1. Dynamic programming principle

The value function is defined as
∞
X
V (st ) = max
∞
Et β j u(st+j , at+j ).
{aj }j=t
j=0

It can be shown that the value function (independent of time) satisfies the Bellman equa-
tion (a functional equation)

V (s) = max u(s, a) + βEV (s0 ),

a∈Γ(s)

where s0 is the state variable in the next period, and Γ(s) is the set of feasible action a.
2. Verification theorem
Given any s0 , for any feasible policy πt , we can use the Bellman equation to derive

V ∗ (st ) ≥ u(st , πt ) + βEt V ∗ (st+1 ), for t = 0, 1, ..., n

where V ∗ is the solution to the Bellman equation. Multiplying by β t and rearranging yield

β t V ∗ (st ) − β t+1 Et V ∗ (st+1 ) ≥ β t u(st , πt ).

1
Taking expectation conditional on time 0 and summing over t = 0, 1, ..., n − 1, we obtain

n−1
X
V ∗ (s0 ) − β n E0 V ∗ (sn ) ≥ E0 β t u(st , πt )
t=0

If the transversality condition (recall in the finite horizon, VT∗ (sT ) = uT (sT ))

lim E0 β n V ∗ (sn ) = 0
n→∞

is satisfied, by taking the limit n → ∞ we deduce that

∞
X
V ∗ (s0 ) ≥ E0 β t u(st , πt ).
t=0

The equality holds if we replace π by π ∗ (optimal policy generated from the Bellman equation).
In this case, the right-hand-side of the above inequality becomes V (s0 ), giving V ∗ (s0 ) = V (s0 ).
The result demonstrates that under the transversality condition the solution to the Bellman
equation gives the value function for the Markov decision problem. In addition, any plan
generated by the optimal policy correspondence from the dynamic programming problem is
optimal for the Markov decision problem.
We should emphasize that the transversality condition is a sufficient condition, but not a
necessary one. It is quite strong because it requires the limit to converge to zero for any
feasible policy. This condition is often violated in many applications with unbounded rewards.
However, if any feasible plan that violates the transversality condition is dominated by some
feasible plan that satisfies this condition, the solution to the Bellman equation is the value
function and that the associated policy function generates an optimal plan. (See Stokey,
Lucas, and Prescott, 1989).
3. Any optimal policy obtained from solving the Markov decision problem can be generated by
solving the Bellman equation.

3. Solving optimal stopping problem

Here, we give a specific example to show how to solve the optimal stopping problem with the
method of dynamic programming.

Example 3.1. Option exercise

An agent decides when to exercise an American call option on a stock. Let zt represents the
stock price and I represents the strike price. If the agent chooses to wait, he receives nothing in the
current period. Then he draws a new stock price in the next period and decides whether he should
exercise the option. In this problem, we formulate the option exercise as a infinite-horizon optimal
stopping. Continuation at date t generates a payoff of ft (zt ) = 0, while stopping (option exercise)
at date t yields an immediate payoff gt (zt ) = zt − I and zero payoff in the future. The decision

2
maker is risk neutral so that he maximizes his expected return. The discount factor is equal to the
inverse of the gross interest rate, β = 1/R.

• Find the Bellman equation of and solve the dynamic programming problem. For simplicity,
we assume that zt is i.i.d. The cumulative distribution function is F (z), z ∈ [0, B], B > I.
• Find the mean waiting period until the option is exercised.

The Bellman equation of the problem

V (z) = max u(z, a) + βEV (z 0 )

a∈{0,1}
(1)
0
= max(z − I, βEV (z ))

Note that βEV (z 0 ) is a constant under iid z t. Therefore, if z − I > βEV (z 0 ), the decision maker
chooses to exercise the option and to wait otherwise. As a result,
(
z − I, if z > z ∗ ,
V (z) =
const., if z < z ∗ .

The threshold z ∗ is determined by V (z ∗ ) = z ∗ − I = βEV (z 0 ),

Z z∗ Z B Z B Z B
z∗ − I = β (z ∗ − I)dF (z) + β (z − I)dF (z) = β (z ∗ − I)dF (z) + β (z − z ∗ )dF (z)
0 z∗ 0 z∗

This gives Z B
β
z∗ − I = (z − z ∗ )dF (z)
1−β z∗

From the equation, we see that z ∗ ∈ [I, B]. The decision maker will not exercise the option for
zt ∈ (I, z ∗ ) because there is option value of waiting.

The probability of not exercising the option at each period is λ = F (z ∗ ). Consequently, the
probability of exercising the option at time period t is λj (1 − λ). The mean waiting period is then

∞ ∞
X d X j λ
jλj (1 − λ) = (1 − λ)λ λ =
dλ 1−λ
j=0 j=0

4. Bellman equation as a fixed-point problem

Define a Bellman operator T̂ so that

T̂ f = max u(s, a) + βEf (s0 ),

a∈Γ(s)

where f is a continuous function on S. Then the solution to the Bellman equation is a fixed point
of T̂ in that T̂ V = V .

3
The set of bounded and continuous function on the state space S endowed with the sup norm
is a Banach space (C(S)). The operator T is a contraction if (1) u is bounded and continuous; (2)
Γ is nonempty, compact, and continuous; (3) The stochastic kernel P (s, a; s0 ) satisfies the property
that f (s0 )P (s, a; s0 )ds0 is continuous in (s, a) for any bounded and continuous function f ; (4)
R

β ∈ (0, 1).
The contraction property of the Bellman operator T̂ gives the existence and uniqueness of the
solution to the Bellman equation. It justifies the guess-and-verify method for finding the value
function. (As long as we find a solution, it is the solution.) Below is an simple example.

Example 4.1. A social planner’s problem

∞
X
max
∞
E0 β t log(ct )
{cj }j=0
t=0

subject to ct + Kt+1 = zt Ktα , where zt follows a Markov process with transition equation log(zt+1 ) =
ρ log(zt ) + σεt+1 . Here, ρ ∈ (0, 1) and εt is normal distribution with mean 0 and variance 1.
We write the Bellman equation as

V (K, z) = max log c + βEV (zK α − c, z 0 )

Given the log utility, we could guess the value function takes the functional form V (K, z) =
d0 + d1 log z + d2 log K. The maximization problem

max log c + βEV (zK α − c, z 0 ) = max log c + βd0 + βd2 log(zK α − c) + βd1 E log z 0
c c

yields c = zK α /(1 + βd2 ). Therefore,

zK α zK α βd2
d0 + d1 log z + d2 log K = log + βd2 log + βd1 ρ log z + βd0
1 + βd2 1 + βd2

Comparing the coefficients on both side yields

d2 = α + αβd2 ,
d1 = 1 + βd2 + βρd1 , (2)
βd2
d0 = − log(1 + βd2 ) + βd2 log + βd0 .
1 + βd2

Solving these equations, we have

α
d2 = ,
1 − αβ
1
d1 = , (3)
(1 − αβ)(1 − ρβ)
1 αβ
d0 = [log(1 − αβ) + log(αβ)]
1−β 1 − αβ

4
The decision rule can also be derived: Kt+1 = αβzt Ktα .

4.1. Value function iteration

The contraction property of the Bellman operator T̂ also gives a globally convergent algorithm
to solve for the value function. Specifically, for any v0 ∈ C(S), because T̂ is a contraction operator

lim T̂ N v0 = V.
N →∞

This property gives rise to a numerical algorithm known as value function iteration for finding V .
We start with an arbitrary guess V0 (s) and iterate the Bellman operator

V1 = T̂ V0 , V2 = T̂ V1 , ... Vn (s) = T̂ Vn−1 = max u(s, a) + βEVn−1 (s0 ), ...

a∈Γ(s)

until Vn is convergent. The contraction mapping theorem guarantees the convergence of this algo-
rithm. In particular, the contraction property implies that kVn (s) − V (s)k converges to zero at a
geometric rate.
Note that in the case where we have set v0 (s) = 0, the value function iteration algorithm is
equivalent to solving a finite horizon problem by backward induction. Suppose we stop the iteration
at N because convergence is attained, e.g. kVN (s)−VN −1 (s)k ≈ 10−15 . The equivalent finite horizon
problem is then define with ut (st , at ) = u(st , at ) , for t = 0, 1, ..., N − 1 and uN (sN , aN ) = 0.

4.2. Policy function iteration (Howard’s improvement algorithm)

We digress here to introduce a usually much faster algorithm to solve the Bellman equation. It
is known as policy function iteration and consists of the following three steps:

1. Choose a arbitrary policy g0 , and compute the value function associated implied by g0 .

V0 (s) = u(s, g0 (s)) + βEV0 (s0 ).

On discretized grids of the state space, this is usually done by solving a linear system. There
also exists a fast method to compute V0 (s) by defining an operator B̂,

B̂V0 (s) = u(s, g0 (s)) + βEV0 (s0 ),

and finding the fix point V0 = B̂V0 . Iterate on B for a small number of times to obtain an
approximation of V0 .
2. Generate a improved policy g1 (s) that solves the two-period problem

max u(s, a) + βEV0 (s0 )

a∈Γ(s)

5
3. Given g1 , one continues the cycle of value function evaluation step and the policy improvement
step until the first iteration n such that kgn − gn−1 k → 0 (or alternatively kVn − Vn−1 k → 0).
Since such a gn satisfies the Bellman equation, it is optimal.

5. Application to optimal control

The associated dynamic programming problem of infinite-horizon optimal control is give by
Z
V (x, z) = max u(x, z, a) + β V (φ(x, a, z, z 0 ), z 0 )Q(z, z 0 )dz 0
a∈Γ(x,z)

5.1. Characterization of the value function: monotonicity, concavity and differentiability

Analogous to the finite-horizon optimal control, we have the following properties of the solution
to the Bellman equation (i.e. V and policy G):

• under the condition (1) u(., z, a) is continuous and bounded for each z, a, (2) u(., z, a) is
strictly increasing, (3) for each z, Γ(., z) is increasing (x < x0 implies Γ(x, z) ⊂ Γ(y, z)), (4)
φ(., a, z, z 0 ) is increasing for each a, z, z 0 , then V (., z) is strictly increasing for each z.
• under the condition (1), (2), (3), (4), (5) at each z, for all x, a, x0 , a0 and θ ∈ (0, 1), u(θx +
(1 − θ)x0 , z, θa + (1 − θ)a0 ) ≥ θu(x, z, a) + (1 − θ)u(x0 , z, a0 ), (6) φ(., ., z, z 0 ) is concave for each
z, z 0 , then V (., z) is strictly concave for each z; G is a single-valued continuous function.
• under (5), (7) for each z, u(., z, .) is continuously differentiable on the interior of X × A, (8)
for each z, z 0 , φ(., ., z, z 0 ) is differentiable on the interior of X × A, (9) at each z, for all x, x0 ,
a ∈ Γ(x, z) and a0 ∈ Γ(x0 , z) imply that θa + (1 − θ)a0 ∈ Γ(θx + (1 − θ)x0 , z), then V (., z)$ is
continuously differentiable.

The envelope condition is then given by

Z
Vx (x, z) = ux (x, z, a) + β Vx (φ(x, a, z, z 0 ), z 0 )φx (x, a, z, z 0 )Q(z, z 0 )dz 0

The first order condition is given by

Z
0 = ua (x, z, a) + β Vx (φ(x, a, z, z 0 ), z 0 )φa (x, a, z, z 0 )Q(z, z 0 )dz 0

5.2. Maximum Principle

In this note, we do not discuss problems with unbounded rewards. The difficulty is that there is
no general fixed-point theorem to guarantee the existence of a solution. For readers interested in
this problem, we pointed out two relevant work. Alvarez and Stokey, 1998 consider general dynamic
programming problems with a homogeneity property. They show that the Weighted Contraction
Mapping Theorem can be applied for general cases with positive degree of homogeneity. Durán,
2000 extends the weighted norm approach to general problems without the homogeneity property.

6
Analyzing the existence and properties of the value function is nontrivial for unbounded reward
functions. By contrast, unbounded reward functions do not pose any difficulty for the Maximum
Principle to work. To present the infinite horizon maximum principle, we write the Lagrangian
form for the optimal control problem

∞
" #
X
t t+1
L=E β u(xt , zt , at ) − β µt+1 (xt+1 − φ(xt , zt , at , zt+1 ))
t=0

F.O.C

at : 0 = ua (xt , zt , at ) + βEt µt+1 φa (xt , zt , at , zt+1 ), t>0

(4)
xt : µt = ux (xt , zt , at ) + βEt µt+1 φx (xt , at , zt , zt+1 ), t≥1

Setting µt = Vx (xt , zt ), we can see that the two conditions above are equivalent to the first order
condition and envelope condition of Bellman equation. The Lagrange multiplier µt is interpreted
as the shadow value of the value function.
∂uT
Recall that in the finite horizon case, we have a terminal condition µT = ∂xT to solve the
problem by backward induction. There is no well-defined terminal condition in the infinite horizon
case. Here, a sufficient boundary condition is in the form of transversality condition

lim E[β T µT xT ] = 0
T →∞

For a special class of control problems -- the Euler class, we could prove the transversality condition
is also necessary. (cf. Ekeland and Scheinkman, 1986 and Kamihigashi, 2000)

5.2.1. Euler class

In practice, it may be possible to use simple tricks to transform the general optimal control
problem to a special class of control problems -- the Euler class. Suppose it is possible to perform
a change of variables such that the state transition equation becomes

xt+1 = at .

This could simplify the solution by Bellman equation and maximum principle.

1. Bellman equation
The envelope condition becomes very simple

Vx (x, z) = ux (x, z, g(x, z))

where x0 = a = g(x, z) is the optimal policy. Substituting the envelope condition into the

7
first order condition yields the Euler equation
Z
0 = ua (x, z, a) + β ux (x0 , z 0 , a0 )Q(z, z 0 )dz 0
Z
= ua (x, z, g(x, z)) + β ux g(x, z), z 0 , g g(x, z) Q(z, z 0 )dz 0

This is a functional equation for the optimal policy g. Instead of solving the original Bellman
equation, for the Euler class, we could solve the Euler equation.
2. Maximum principle
The first order conditions become

at : 0 = ua (xt , zt , at ) + βEt [µt+1 ], t>0

(5)
xt : µt = ux (xt , zt , at ), t≥1

Substituting the second equation into the first one, we get the sequential form of Euler
equation
0 = ua (xt , zt , xt+1 ) + βEt [ux (xt+1 , zt+1 , xt+2 )].

The transversality condition can be expressed as

lim E[β T µT xT ] = lim E[β T ux (xT , zT , xT +1 )xT ] = 0.

T →∞ T →∞

By using the Euler equation, we have

lim E[β T ux (xT , zT , xT +1 )xT ] = lim E[β T −1 aT −1 βET −1 ux (xT , zT , xT +1 )]

T →∞ T →∞
(6)
= − lim E[β T −1 ua (xT −1 , zT −1 , xT )aT −1 ].
T →∞

Therefore, the condition can be rewritten as

lim E[β T ua (xT , zT , xT +1 )aT ] = lim E[β T ua (xT , zT , xT +1 )xT +1 ] = 0.

T →∞ T →∞

To get some economic sense of the transversality condition, we consider a simple example.

Example 5.1. A social planner’s resource allocation problem.

The planner’s objective is to choose sequences of consumption (ct ) so as to
" T #
X
t
max E β u(ct ) , β ∈ (0, 1)
{ct }T
t=0 t=0

subject to the resource constraint

Kt+1 = zt F (Kt ) − ct , (x0 , z0 ) given.

8
By defining Kt+1 ≡ at , the problem can be rewritten as

T
" #
X
t
max E β u(zt F (Kt ) − at ) , β ∈ (0, 1)
{ct }T
t=0 t=0

subject to Kt+1 = at . (Kt is the state variable; at is the control variable.)

In the last period, the agent solves the problem

max E[β T u(zT F (KT ) − KT +1 )]

KT +1

KT +1 should be non-negative.
• If KT +1 = 0, the following condition should be satisfied E[β T u0 (cT )] > 0;
• If KT +1 > 0, the following condition should be satisfied E[β T u0 (cT )] = 0
We can combine the conditions as E[β T u0 (cT )KT +1 ] = 0. This is the transversality condition
in the finite horizon case. The economic meaning is that the expected discounted shadow value
of the terminal state (e.g., capital or wealth) must be zero. In the infinite horizon case, we
take the limit of the condition.

Finally, let us consider an example to illustrate the above theoretical results.

Example 5.2. A consumption-saving problem

" T #
X c1−γ
t
max E , γ > 0, γ 6= 1
{ct }∞
t=0 1 −γ
t=0

subject to xt+1 = Rt+1 (xt − ct ), xt+1 > 0, x0 > 0 given, where Rt+1 > 0 is i.i.d. drawn from a
distribution. By defining yt+1 = xt+1 /Rt+1 , yt+1 = at = xt − ct = yt Rt − ct , ct = yt Rt − at .
F.O.C

at : 0 = ua (xt , zt , at ) + βEt µt+1 = −(yt Rt − at )−γ + βEt µt+1 = −c−γ

t + βEt µt+1
(7)
xt : µt = ux (xt , zt , at ) = Rt (yt Rt − at )−γ = Rt c−γ
t ,

The resulting Euler equation is

c−γ −γ
t = βEt [Rt+1 ct+1 ]

An obvious guess of the consumption policy is that ct = Cxt (0 < C < 1). Plugging the conjecture
into the Euler equation yields

(Cxt )−γ = βEt [Rt+1 (Cxt+1 )−γ ] = βEt [Rt+1 C −γ (Rt+1 xt − Rt+1 Cxt )−γ ]

9
1−γ 1/γ
The above equation gives us C = 1 − (βEt [Rt+1 ]) . Consider the Bellman equation

V (x) = max u(c) + βEV (x0 ),

c
(Cx)1−γ (8)
= u(Cx) + βEV (R0 x(1 − C)) = + βEV (R0 x(1 − C))
1−γ

An obvious guess of the value function is V (x) = Bx1−γ /(1 − γ). Plugging the conjecture into the
Euler equation yields h i−γ
1−γ 1/γ
B = 1 − (βEt [Rt+1 ])

Now, we could check the transversality condition

lim E0 β t V (xt ) = 0
t→∞

βtB βtB
E0 β t V (xt ) = E0 [x1−γ
t ] = E0 [Rt1−γ (xt−1 − ct−1 )1−γ ]
1−γ 1−γ
βtB
= (1 − C)1−γ E0 [Rt1−γ xt−1
1−γ
] (9)
1−γ
t
βtB
(1 − C)t(1−γ) x01−γ E0 [ Rj1−γ ]
Y
=
1−γ
j=1

and the transversality condition

lim E0 [β T ux (xT , zT , xT +1 )xT ] = lim E0 [β T RT c−γ T −γ

T xT /RT ] = lim E0 [β cT xT ]
T →∞ T →∞ T →∞

E0 [β T c−γ T 1−γ
T xT ] = β BE0 [xT ]

References
Alvarez, F., Stokey, N. L., 1998. Dynamic programming with homogeneous functions. Journal of
Economic Theory 82, 167 – 189.

Durán, J., 2000. On dynamic programming with unbounded returns. Economic Theory 15, 339–352.

Ekeland, I., Scheinkman, J. A., 1986. Transversality conditions for some infinite horizon discrete
time optimization problems. Mathematics of Operations Research 11, 216–229.

Kamihigashi, T., 2000. A simple proof of ekeland and scheinkman’s result on the necessity of a
transversality condition. Economic Theory 15, 463–468.

Stokey, N. L., Lucas, R. E., Prescott, E. C., 1989. Recursive Methods in Economic Dynamics.
Harvard University Press.

A Child's Guide To Dynamic Programming
No ratings yet
A Child's Guide To Dynamic Programming
20 pages
Deflections in Thin Plates
No ratings yet
Deflections in Thin Plates
29 pages
Dynamic Programming: Quantitative Macroeconomics (Econ 5725)
No ratings yet
Dynamic Programming: Quantitative Macroeconomics (Econ 5725)
55 pages
SLchapt 3
No ratings yet
SLchapt 3
10 pages
Lecture SM 1 DP
No ratings yet
Lecture SM 1 DP
71 pages
dp-intro dynamic programming
No ratings yet
dp-intro dynamic programming
4 pages
Computational Economics: Session 16: Numerical Dynamic Programming
No ratings yet
Computational Economics: Session 16: Numerical Dynamic Programming
17 pages
cs229 Notes13
No ratings yet
cs229 Notes13
15 pages
3 Recursive
No ratings yet
3 Recursive
8 pages
MS&E 221: Stochastic Modeling: Session 7: Nonlinear Optimization, Markov Decision Processes
No ratings yet
MS&E 221: Stochastic Modeling: Session 7: Nonlinear Optimization, Markov Decision Processes
18 pages
Handout 10 Dynamic Programming Nov14
No ratings yet
Handout 10 Dynamic Programming Nov14
113 pages
Dynamic Programming 3 Bellman
No ratings yet
Dynamic Programming 3 Bellman
8 pages
Notas - Dynamic Optimation and Optimal Control
No ratings yet
Notas - Dynamic Optimation and Optimal Control
26 pages
Laibson Notes 2013 0
No ratings yet
Laibson Notes 2013 0
54 pages
mdp-cheatsheet
No ratings yet
mdp-cheatsheet
3 pages
Lecture4 BellmanOperator Handout
No ratings yet
Lecture4 BellmanOperator Handout
13 pages
Dynamic Programming Handout - : 14.451 Recitation, February 18, 2005 - Todd Gormley
No ratings yet
Dynamic Programming Handout - : 14.451 Recitation, February 18, 2005 - Todd Gormley
11 pages
Dynamic Programming Value Iteration
100% (1)
Dynamic Programming Value Iteration
36 pages
Typeset by AMS-TEX
No ratings yet
Typeset by AMS-TEX
27 pages
2-dynamic
No ratings yet
2-dynamic
50 pages
SanchezPajueloKai_PS3
No ratings yet
SanchezPajueloKai_PS3
6 pages
EE675A Lec12
No ratings yet
EE675A Lec12
5 pages
Dynamic Equilibrium Models III: Infinite Periods
No ratings yet
Dynamic Equilibrium Models III: Infinite Periods
15 pages
Dynamic Programming
No ratings yet
Dynamic Programming
52 pages
Paulo Brito Ecomat Discreto
No ratings yet
Paulo Brito Ecomat Discreto
49 pages
Formula Sheet: Section 1 - Deterministic Dynamic Programming
No ratings yet
Formula Sheet: Section 1 - Deterministic Dynamic Programming
10 pages
Optimal Control Theory
No ratings yet
Optimal Control Theory
28 pages
Economia Discreta en El Tiempo
No ratings yet
Economia Discreta en El Tiempo
26 pages
Bellman Equation
No ratings yet
Bellman Equation
13 pages
Fa19 Lecture 15 MDPs II
No ratings yet
Fa19 Lecture 15 MDPs II
76 pages
EC004 OutputDynamics - Microfoundation 2022 Lecture4
No ratings yet
EC004 OutputDynamics - Microfoundation 2022 Lecture4
15 pages
EC004 OutputDynamics - Microfoundation 2022 Lecture5
No ratings yet
EC004 OutputDynamics - Microfoundation 2022 Lecture5
21 pages
09 - Monte Carlo Learning
No ratings yet
09 - Monte Carlo Learning
24 pages
Homework - 06 - 223 - Spring 2024
No ratings yet
Homework - 06 - 223 - Spring 2024
5 pages
Master LN
No ratings yet
Master LN
113 pages
Lecture26 Ri
No ratings yet
Lecture26 Ri
55 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Stopping Time Markov Processes
No ratings yet
Stopping Time Markov Processes
19 pages
note5 (2)
No ratings yet
note5 (2)
8 pages
Necessary and Sufficient Conditions For Dynamic Optimization
No ratings yet
Necessary and Sufficient Conditions For Dynamic Optimization
18 pages
exact (RL IITH)
No ratings yet
exact (RL IITH)
47 pages
Mathii at Su and Sse: John Hassler Iies, Stockholm University February 25, 2005
No ratings yet
Mathii at Su and Sse: John Hassler Iies, Stockholm University February 25, 2005
87 pages
EC744 Lecture Note 1: Prof. Jianjun Miao
No ratings yet
EC744 Lecture Note 1: Prof. Jianjun Miao
18 pages
10 - Reinforcement Learning
No ratings yet
10 - Reinforcement Learning
24 pages
Bellman
100% (1)
Bellman
8 pages
Master LN
No ratings yet
Master LN
135 pages
Sanchez_Kai_PS2
No ratings yet
Sanchez_Kai_PS2
11 pages
Bellman's Equations: T t+1 T T 0 T T t+1 T T, T
No ratings yet
Bellman's Equations: T t+1 T T 0 T T t+1 T T, T
7 pages
Optimization Methods (MFE) : Elena Perazzi
No ratings yet
Optimization Methods (MFE) : Elena Perazzi
28 pages
Dynamic Optimization in Continuous
No ratings yet
Dynamic Optimization in Continuous
27 pages
Optimal Control Theory
No ratings yet
Optimal Control Theory
28 pages
Bouchardtalk
No ratings yet
Bouchardtalk
78 pages
EC744 Lecture Note 5 Applications of Deterministic DP: Prof. Jianjun Miao
No ratings yet
EC744 Lecture Note 5 Applications of Deterministic DP: Prof. Jianjun Miao
23 pages
l1 Mdps Exact Methods
No ratings yet
l1 Mdps Exact Methods
69 pages
EC744 Lecture Note 3 Dynamic Programming Under Certainty: Prof. Jianjun Miao
No ratings yet
EC744 Lecture Note 3 Dynamic Programming Under Certainty: Prof. Jianjun Miao
17 pages
Lectures On Stochastic Control and Its Applications To Finance Chap 4 Martingale Approach Pham
No ratings yet
Lectures On Stochastic Control and Its Applications To Finance Chap 4 Martingale Approach Pham
84 pages
(Touzi) Deterministic and Stochastic Control, Application To Finance
No ratings yet
(Touzi) Deterministic and Stochastic Control, Application To Finance
117 pages
Markov Decision Processes and Exact Solution Methods
No ratings yet
Markov Decision Processes and Exact Solution Methods
34 pages
Long-Memory Time Series: Theory and Methods
From Everand
Long-Memory Time Series: Theory and Methods
Wilfredo Palma
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
A Short Course in Automorphic Functions
From Everand
A Short Course in Automorphic Functions
Joseph Lehner
No ratings yet
Trigonometry DUJAT - 3
No ratings yet
Trigonometry DUJAT - 3
14 pages
Simultaneous Linear Equation
0% (1)
Simultaneous Linear Equation
25 pages
Relations and Functions New QN Bank
No ratings yet
Relations and Functions New QN Bank
3 pages
Lab 3 Math Core
No ratings yet
Lab 3 Math Core
8 pages
SASMO 2014 Round 1 Secondary 1 Problems
100% (1)
SASMO 2014 Round 1 Secondary 1 Problems
3 pages
C4 Partial Fractions B - Questions
No ratings yet
C4 Partial Fractions B - Questions
1 page
Algebraic Structures: Algebraic Systems Semi Groups Monoids Groups Sub Groups Homomorphism Isomorphism
0% (1)
Algebraic Structures: Algebraic Systems Semi Groups Monoids Groups Sub Groups Homomorphism Isomorphism
43 pages
2.72 Elements of Mechanical Design: Mit Opencourseware
No ratings yet
2.72 Elements of Mechanical Design: Mit Opencourseware
44 pages
Chapter 1 - Vectors and Vector Functions
No ratings yet
Chapter 1 - Vectors and Vector Functions
44 pages
Engineering mathematics 2(with fourier)
No ratings yet
Engineering mathematics 2(with fourier)
103 pages
Quantitative Risk Management WS1920 Assignment 7
No ratings yet
Quantitative Risk Management WS1920 Assignment 7
2 pages
Chapter 7 Exercise (Add Maths)
No ratings yet
Chapter 7 Exercise (Add Maths)
3 pages
Linear Algebra and Differential Equations Lotka-Volterra Equations
No ratings yet
Linear Algebra and Differential Equations Lotka-Volterra Equations
20 pages
Notes 1 2018-19 Infinitely Many Primes
No ratings yet
Notes 1 2018-19 Infinitely Many Primes
12 pages
Inequalities 2024
No ratings yet
Inequalities 2024
22 pages
STPM Trials 2009 Math T Paper 1 Answer Scheme (Kelantan)
No ratings yet
STPM Trials 2009 Math T Paper 1 Answer Scheme (Kelantan)
9 pages
Bat Dang Thuc Luong Giac
No ratings yet
Bat Dang Thuc Luong Giac
8 pages
Instant Access to Mathematics of the Bond Market A Lévy Processes Approach Encyclopedia of Mathematics and its Applications 1st Edition Michał Barski ebook Full Chapters
100% (3)
Instant Access to Mathematics of the Bond Market A Lévy Processes Approach Encyclopedia of Mathematics and its Applications 1st Edition Michał Barski ebook Full Chapters
55 pages
Lesson 1
No ratings yet
Lesson 1
20 pages
Integration Technique
No ratings yet
Integration Technique
125 pages
Torsion Tensor
No ratings yet
Torsion Tensor
7 pages
Linear Differential Equation (Slides For Video Lecture)
No ratings yet
Linear Differential Equation (Slides For Video Lecture)
28 pages
Bit Serial Iterative Cordic Implementation For The Calculation of Trigonometric Functions
No ratings yet
Bit Serial Iterative Cordic Implementation For The Calculation of Trigonometric Functions
8 pages
MATH
No ratings yet
MATH
28 pages
Class 10 Mathematics Important Formulae: Real Numbers
No ratings yet
Class 10 Mathematics Important Formulae: Real Numbers
25 pages
The Pendulum, Elliptic Functions and Imaginary Time: Math 241 Homework John Baez
No ratings yet
The Pendulum, Elliptic Functions and Imaginary Time: Math 241 Homework John Baez
4 pages
Multivariable and Vector Analysis: Wwlchen
No ratings yet
Multivariable and Vector Analysis: Wwlchen
14 pages
FFT
100% (1)
FFT
22 pages
SIUE 2008 Fall Math 320 Chapter 15 Study Guide
No ratings yet
SIUE 2008 Fall Math 320 Chapter 15 Study Guide
3 pages