0% found this document useful (0 votes)

3 views11 pages

MA 668 2024 Lecture 23

The document discusses the Dynamic Programming Principle (DPP) in the context of algorithmic and high-frequency trading, focusing on the relationship between the value function and its expected future value. It presents the theorem that the value function satisfies the DPP for diffusions and introduces the Dynamic Programming Equation (DPE) as an infinitesimal version of the DPP. The document also outlines the derivation of the DPE and its implications in stochastic control problems.

Uploaded by

vikasrajpoot.iitg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views11 pages

MA 668 2024 Lecture 23

Uploaded by

vikasrajpoot.iitg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

MA668: Algorithmic and High Frequency Trading

Lecture 23

Prof. Siddhartha Pratim Chakrabarty

Department of Mathematics
Indian Institute of Technology Guwahati
The Dynamic Programming Principle (Contd ...)
1 Note that on the right-hand side of the above, the arbitrary control u only
acts over the interval [t, T ] and the optimal one is implicitly incorporated
in the value function H(τ, Xuτ ) but starting at the point to which the
arbitrary control u caused the process x to flow, namely, xuτ .
2 Taking supremum over admissible strategies on the left-hand side, so that
the left-hand side also reduces to the value function, we have that:
 
Zτ
H(t, x) ≤ sup Et,x H(τ, Xuτ ) + F (s, Xus , us ) ds  . (1)
u∈A
t

3 Next, we aim to show that the inequality above can be reversed. Take an
arbitrary admissible control u ∈ A and consider what is known as an
ϵ-optimal control denoted by vϵ ∈ A and defined as a control which is
better than H(t, x) − ϵ, but of course not as good as H(t, x) i.e., a control
such that ϵ
H(t, x) ≥ H v (t, x) ≥ H(t, x) − ϵ. (2)
The Dynamic Programming Principle (Contd ...)
1 Such a control exists, assuming that the value function is continuous in
the space of controls.
2 Consider next the modification of the ϵ-optimal control.

vϵ = ut 1t≤τ + vϵ 1t>τ ,
e (3)

i.e., the modification is ϵ-optimal after the stopping time T , but

potentially sub-optimal on the interval [t, T ]. Then we have:
ϵ
H(t, x) ≥ H ev (t, x)
 
Zτ
eϵ eϵ eϵ
= Et,x H v (τ, Xvτ ) + F s, Xvs , e
vsϵ ds  ,
t
 
Zτ
vϵ
= Et,x H e
(τ, Xuτ ) + F (s, Xus , us ) ds  , (using (3))
t
 
Zτ
≥ Et,x H(τ, Xuτ ) + F (s, Xus , us ) ds  . (by (2))
t
The Dynamic Programming Principle (Contd ...)
1 Taking limit as ϵ ↓ 0, we have,
 
Zτ
H(t, x) ≥ Et,x H(τ, Xuτ ) + F (s, Xus , us ) ds  .
t

2 Moreover, since the above holds true for every u ∈ A, we have that:
 
Zτ
H(t, x) ≥ sup Et,x H(τ, Xuτ ) + F (s, Xus , us ) ds  . (4)
u∈A
t

3 The upper bound (1) and lower bound (4) form the dynamic programming
inequalities. Putting them together, we obtain the following Theorem.
The Dynamic Programming Principle (Contd ...)

Theorem

Dynamic Programming Principle for Diffusions. The value function satisfies the
DPP:  
Zτ
u u
H(t, x) = sup Et,x H(τ, Xτ ) + F (s, Xs , us ) ds  , (5)
u∈A
t

for all (t, x) ∈ [0, T ] × Rn and all stopping times τ ≤ T .

1 This equation is really a sequence of equations that tie the value function
to its future expected value, plus the running reward/penalty.
2 Since it is a sequence of equations, an even more powerful equation can
be found by looking at its infinitesimal version, the so-called DPE.
DPE/HJB Equation
The DPE is an infinitesimal version of the dynamic programming principle
(DPP). There are two key ideas involved:

Idea 1
Setting the stopping time τ in the DPP to be the minimum between:
(a) The time it takes for the process Xut to exit a ball of size ϵ around its
starting point AND
(b) A fixed (small) time h: all while keeping it bounded by T .
This can be viewed in Figure 5.2 and can be stated precisely as:

τ = T ∧ inf {s > t : (s − t, |Xus − x|) ∈

/ [0, h) × [0, ϵ)} .

Notice that as h ↓ 0, τ ↓ t, a.s. and that τ = t + h whenever h is sufficiently

small: since as the time span h shrinks, it is less and less likely that X will exit
the ball first.
Figure 5.2

Figure: Figure 5.2

Idea 2
1 Writing the value function (for an arbitrary admissible control u) at the
stopping time τ in terms of the value function at t using Ito’s lemma.
Specifically, assuming enough regularity of the value function, we can
write:
Zτ Zτ
H(τ, Xuτ ) = H(t, x) + (∂s + Lus ) H(s, Xus )ds + Dx H(s, Xus )′ σsu dWs ,
t t
(6)
where σtu := σ (t, Xut , ut ), Lut represents the infinitesimal generator of Xut
and Dx H(·) denotes the vector of partial derivatives with components
[Dx H(·)]i = ∂xi H(·).
2 For example, in the one-dimensional case:
1 u 2 1
Lut = µut ∂x + (σt ) ∂xx = µ(t, x, u)∂x + σ 2 (t, u, x)∂xx .
2 2
DPE/HJB Equation (Contd ...)
1 As before, we derive the DPE in two stages by obtaining two inequalities.
First, taking v ∈ A to be constant over the interval [t, τ ], applying the
lower bound and substituting (6) into the right-hand side implies that:
 
Zτ
H(t, x) ≥ sup Et,x H(τ, Xuτ ) + F (s, Xus , us ) ds  ,
u∈A
t
 
Zτ
≥ Et,x H(τ, Xvτ ) + F (s, Xvs , vs ) ds  ,
t

Zτ
= Et,x H(t, x) + (∂s + Lvs ) H(s, Xvs )ds
t

Zτ Zτ
+ Dx H(s, Xvs )′ σsv dWs + F (s, Xvs , vs ) ds  .
t t
DPE/HJB Equation (Contd ...)
1 The integrand in the stochastic integral above, Dx H(s, Xvs )′ σsv , is bounded
on the interval [t, τ ], since we have ensured that |Xvt − x| ≤ ϵ on the
interval.
2 Hence, this stochastic integral is the increment of a martingale and we
can be assured that its expectation is zero.
3 Therefore:
   
Zτ  Zτ 
H(t, x) ≥ Et,x H(t, x) + (∂s + Lvs ) H(s, Xvs ) + F (s, Xvs , v) ds  ,
 
t t

and recall that τ = t + h.

DPE/HJB Equation (Contd ...)
1 Moving the H(t, x) on the left-hand side over to the right-hand side,
dividing by h and taking the limit as h ↓ 0 yields:
 τ 
Z
1
0 ≥ lim Et,x  {(∂s + Lvs ) H(s, Xvs ) + F (s, Xvs , v)} ds  ,
h↓0 h
t
= (∂t + Lvt ) H(t, x) + F (t, x, v).

2 The second line follows from:

(i) As h ↓ 0, τ = t + h a.s. since the process will not hit the barrier of ϵ
in extremely short periods of time,
(ii) The condition that |Xuτ − x| ≤ ϵ, which implies that if the process
does hit the barrier it is bounded,
Zt+h
1
(iii) The Mean-Value Theorem allows us to write lim ωs ds = ωt , and
h↓0 h
t
(iv) The process starts at Xvt = x.

1.introduction To Relations
No ratings yet
1.introduction To Relations
11 pages
Algebra 4 60
No ratings yet
Algebra 4 60
60 pages
Notes
No ratings yet
Notes
21 pages
Fom Entrance Exam 1
No ratings yet
Fom Entrance Exam 1
3 pages
Asset Prices in An Exchange Economy, Robert Lucas
No ratings yet
Asset Prices in An Exchange Economy, Robert Lucas
18 pages
Handouts MTH303-1
No ratings yet
Handouts MTH303-1
192 pages
Xii Maths-041 Set-1 MS
No ratings yet
Xii Maths-041 Set-1 MS
13 pages
MHF4U Unit 1 Notes
No ratings yet
MHF4U Unit 1 Notes
9 pages
MA 668 2024 Lecture 22
No ratings yet
MA 668 2024 Lecture 22
10 pages
Domain and Range Question With Solution
No ratings yet
Domain and Range Question With Solution
21 pages
Part 10
No ratings yet
Part 10
57 pages
Note Hamiltonian
No ratings yet
Note Hamiltonian
8 pages
Squigonometry - The Study of Imperfect Circles
No ratings yet
Squigonometry - The Study of Imperfect Circles
292 pages
Lecture 3 and 4
No ratings yet
Lecture 3 and 4
14 pages
Full
No ratings yet
Full
241 pages
Continuity - PLPN MhtCet
No ratings yet
Continuity - PLPN MhtCet
46 pages
JEE - Crash Course Physics CTC Rev
No ratings yet
JEE - Crash Course Physics CTC Rev
336 pages
Aquino Ged102 Week 2 Wgn2
No ratings yet
Aquino Ged102 Week 2 Wgn2
7 pages
Classroom - Notes7 Fall 2021
No ratings yet
Classroom - Notes7 Fall 2021
33 pages
Q2 Mathematics 8 - Module 3
No ratings yet
Q2 Mathematics 8 - Module 3
24 pages
Classroom - Notes7 Fall 2022
No ratings yet
Classroom - Notes7 Fall 2022
33 pages
GenMath Week 1.2 - 051141
No ratings yet
GenMath Week 1.2 - 051141
49 pages
DP Slides
No ratings yet
DP Slides
263 pages
Gujarat Technological University: 1 Year, Subject Code: 3110015
No ratings yet
Gujarat Technological University: 1 Year, Subject Code: 3110015
3 pages
Hu Mingshang-非线性期望下HJB
No ratings yet
Hu Mingshang-非线性期望下HJB
19 pages
Question Paper MAL 101 Advanced Calculus Mid Sem
No ratings yet
Question Paper MAL 101 Advanced Calculus Mid Sem
2 pages
Summary of Error Analysis and Rules For Error Propagation.: UIC Physics Department Physics 141 Appendix 1
No ratings yet
Summary of Error Analysis and Rules For Error Propagation.: UIC Physics Department Physics 141 Appendix 1
2 pages
Student Text Homework Helper: Unit 1
No ratings yet
Student Text Homework Helper: Unit 1
216 pages
Orthogonal Representation, Fourier Series and Power Spectra
No ratings yet
Orthogonal Representation, Fourier Series and Power Spectra
24 pages
Solving High-Dimensional Hamilton-Jacobi-Bellman Equations With Functional Hierarchical Tensor
No ratings yet
Solving High-Dimensional Hamilton-Jacobi-Bellman Equations With Functional Hierarchical Tensor
24 pages
Infinite Time Horizon Optimal Control of Mckean-Vlasov Sdes: Silvia Rudà
No ratings yet
Infinite Time Horizon Optimal Control of Mckean-Vlasov Sdes: Silvia Rudà
42 pages
Bezziou Ezzobir
No ratings yet
Bezziou Ezzobir
41 pages
RL Monograph1
No ratings yet
RL Monograph1
48 pages
Lec1 PDF
No ratings yet
Lec1 PDF
9 pages
Controle Stochastique M2 S10
No ratings yet
Controle Stochastique M2 S10
203 pages
10.3934 dcdss.2024060
No ratings yet
10.3934 dcdss.2024060
20 pages
Poly Cont Sto
No ratings yet
Poly Cont Sto
126 pages
Sde 3
No ratings yet
Sde 3
19 pages
General Direction: Each Item Below Has Four Choices. Select The Best Answer and Write Only The Letter of Your
No ratings yet
General Direction: Each Item Below Has Four Choices. Select The Best Answer and Write Only The Letter of Your
4 pages
Aits Jee M A CBT Class Xii Xiii v1
No ratings yet
Aits Jee M A CBT Class Xii Xiii v1
4 pages
Optimal Control Under Unknown Intensity With Bayesian Learning
No ratings yet
Optimal Control Under Unknown Intensity With Bayesian Learning
23 pages
HW1
No ratings yet
HW1
2 pages
Dynamic Programming Principles PDFalgorithm
No ratings yet
Dynamic Programming Principles PDFalgorithm
16 pages
Bouchardtalk
No ratings yet
Bouchardtalk
78 pages
Continuous Time 1
No ratings yet
Continuous Time 1
86 pages
Integrals PDF
No ratings yet
Integrals PDF
24 pages
CET'S Study Plan: Mathematics
No ratings yet
CET'S Study Plan: Mathematics
6 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Milestone 3 Final Report
No ratings yet
Milestone 3 Final Report
8 pages
Andersson Djehiche - AMO 2011
No ratings yet
Andersson Djehiche - AMO 2011
16 pages
Optimal Control Theory Chapter 12
No ratings yet
Optimal Control Theory Chapter 12
55 pages
Important Genetic Algorithm
No ratings yet
Important Genetic Algorithm
7 pages
Lectures On Stochastic Control and Its Applications To Finance Chap 4 Martingale Approach Pham
No ratings yet
Lectures On Stochastic Control and Its Applications To Finance Chap 4 Martingale Approach Pham
84 pages
Lectures on Integral Equations
From Everand
Lectures on Integral Equations
Harold Widom
4.5/5 (2)
Lecture10 - Pontryagins Minimum Principle
No ratings yet
Lecture10 - Pontryagins Minimum Principle
9 pages
1 Problems in Oksendal's Book
0% (1)
1 Problems in Oksendal's Book
22 pages
SC Dec22
No ratings yet
SC Dec22
82 pages
(Touzi) Deterministic and Stochastic Control, Application To Finance
No ratings yet
(Touzi) Deterministic and Stochastic Control, Application To Finance
117 pages
Detailed Syllabus For MCA Entrance
No ratings yet
Detailed Syllabus For MCA Entrance
3 pages
MIT6 231F15 Notes PDF
No ratings yet
MIT6 231F15 Notes PDF
303 pages
Innite Horizon Forward-Backward Stochastic Dierential Equations
No ratings yet
Innite Horizon Forward-Backward Stochastic Dierential Equations
18 pages
Dynamic Programming
No ratings yet
Dynamic Programming
52 pages
Typeset by AMS-TEX
No ratings yet
Typeset by AMS-TEX
27 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Chapter 1 PDF
No ratings yet
Chapter 1 PDF
45 pages
Pontryagin's Maximum Principle: Emo Todorov
No ratings yet
Pontryagin's Maximum Principle: Emo Todorov
12 pages
Exercise 8
No ratings yet
Exercise 8
6 pages
Deterministic Continuous Time Optimal Control and The Hamilton-Jacobi-Bellman Equation
No ratings yet
Deterministic Continuous Time Optimal Control and The Hamilton-Jacobi-Bellman Equation
7 pages
Vol I Dimitri PDF
No ratings yet
Vol I Dimitri PDF
30 pages
Master LN
No ratings yet
Master LN
135 pages
Dynamic Optimization in Continuous
No ratings yet
Dynamic Optimization in Continuous
27 pages
cs229 Notes13
No ratings yet
cs229 Notes13
15 pages
MIT6 231F11 Notes Short
No ratings yet
MIT6 231F11 Notes Short
125 pages
A Child's Guide To Dynamic Programming
No ratings yet
A Child's Guide To Dynamic Programming
20 pages
5 - HJB
No ratings yet
5 - HJB
12 pages
Dynamic Programming: Quantitative Macroeconomics (Econ 5725)
No ratings yet
Dynamic Programming: Quantitative Macroeconomics (Econ 5725)
55 pages
ECE 551 Lecture 3
No ratings yet
ECE 551 Lecture 3
8 pages
Dynamic Programming and Optimal Control: Third Edition Dimitri P. Bertsekas
0% (1)
Dynamic Programming and Optimal Control: Third Edition Dimitri P. Bertsekas
54 pages
Pham
No ratings yet
Pham
77 pages
Dynamic Programing and Optimal Control
No ratings yet
Dynamic Programing and Optimal Control
276 pages
Dynamic Programing and Optimal Control PDF
No ratings yet
Dynamic Programing and Optimal Control PDF
276 pages
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
5.1 Dynamic Programming and The HJB Equation: k+1 K K K K
No ratings yet
5.1 Dynamic Programming and The HJB Equation: k+1 K K K K
30 pages
Dynamic Programming and Optimal Control, Volumes I Solution Selected
No ratings yet
Dynamic Programming and Optimal Control, Volumes I Solution Selected
30 pages
1 13 Optimal Control Proofs
No ratings yet
1 13 Optimal Control Proofs
9 pages
MIT Dynamic Programming Lecture Slides
No ratings yet
MIT Dynamic Programming Lecture Slides
261 pages
DC 13 Appendix A
No ratings yet
DC 13 Appendix A
42 pages
Stat 220 Notes
No ratings yet
Stat 220 Notes
109 pages
Stochastic Control Princeton
No ratings yet
Stochastic Control Princeton
14 pages
Stochastic Control:: With Applications To Financial Mathematics
No ratings yet
Stochastic Control:: With Applications To Financial Mathematics
66 pages

MA 668 2024 Lecture 23

Uploaded by

MA 668 2024 Lecture 23

Uploaded by

MA668: Algorithmic and High Frequency Trading

Prof. Siddhartha Pratim Chakrabarty

i.e., the modification is ϵ-optimal after the stopping time T , but

for all (t, x) ∈ [0, T ] × Rn and all stopping times τ ≤ T .

τ = T ∧ inf {s > t : (s − t, |Xus − x|) ∈

Notice that as h ↓ 0, τ ↓ t, a.s. and that τ = t + h whenever h is sufficiently

Figure: Figure 5.2

and recall that τ = t + h.

2 The second line follows from:

You might also like