0% found this document useful (0 votes)

9 views3 pages

Lecture optimalStoppingTime

Hhhh bbb

Uploaded by

josselin.arj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views3 pages

Lecture optimalStoppingTime

Hhhh bbb

Uploaded by

josselin.arj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Optimal Stopping Time

Jae Yun JUN KIM*

Reference: Neil Walton’s lecture notes

1 Optimal stopping problem

An optimal stopping problem is a Markov Decision Process with two actions:
a = 0: to stop

a = 1: to continue

and with two types costs (

k(x), for a = 0,
c(x, a) = (1)
c(x), for a = 1.

2 Bellman equation for optimal stopping problem

Assuming that the time is discrete and finite, the Bellman equation for the optimal stopping
problem can be defined as

Cs (x) = min{k(x), c(x) + EX [Cs−1 (X̂]} (2)

with c(x) = 0 for s = 0, and C0 (x) = k(x).

Note: In this problem, we want to neither control the process nor influence the environment.
But, we only observe the process and decide the right moment to stop.

3 One step look ahead (OLSA) rule

In the one step look ahead (OLSA) rule, we stop when ever x ∈ S, where

S = {x : k(x) ≤ c(x) + E[k(X̂)]}. (3)

That is, you stop whenever it is better to stop now rather than to continue one step further
and then stop.
Let us say that I am in state x.
Then, what is the stopping cost? k(x)
On the other hand, what is the cost that I continue now and I stop at one step further?
c(x) + E[k(X̂)].
* ECE Paris Graduate School of Engineering, 37 quai de Grenelle 75015 Paris, France; [email protected]

1
4 Closed stopping set
We say that the set S ⊂ X is closed (where X is the state space), if once inside the stopping
set, you cannot leave from it. That is,

Px,y = 0, ∀x ∈ S, y 6∈ S. (4)

Suppose that S is given by the OSLA. S is a closed stopping set if

Cs−1 (x) = k(x), for x ∈ S =⇒ Cs (x) = k(x). (5)

That is, if S is a closed stopping set, then x ∈ S is the current state implies that the next state
x̂ is also x̂ ∈ S.
Then, using the Bellman equation, we have

Cs (x) = min{k(x), c(x) + E[Cs−1 (x̂)]} = min{k(x), c(x) + E[k(X̂)]} = k(x) (6)

5 Optimal policy for the optimal stopping problem

For the finite time stopping problem, given by the one step look ahead rule, the set S is closed.
Then, the one step look ahead rule is an optimal policy.
Proof: (By induction on s).
We know that C0 (x) = k(x), ∀x.
Hence, by the fact that we saw for defining the closed stopping set,

Cs (x) = k(x), ∀x ∈ S and s ∈ Z+ . (7)

So, it is always optimal to stop for x ∈ S.

Further, if x 6∈ S, then k(x) > c(x) + E[k(X̂)].
In conclusion, ∀x ∈ S, it is optimal to stop; and, ∀x 6∈ S, it is optimal to continue.

6 Example: Finding parking space

You look for a parking space on street. Each space is free with probability p = 1 − q. You can
not tell if a space is free until you reach it. You can not go backward. Once at a space, you
must decide to stop or to continue. From position s (i.e., s spaces from your destination), the
cost of stopping is s. The cost of passing your destination without parking is D. Construct
the strategy that will return the optimal parking space for a destination.
Answer
Let the state at time s be defined as

xs = I[space x is free]. (8)

Now, using the Bellman equation, we have

Cs (1) = min{s, p Cs−1 (1) + q Cs−1 (0)},
(9)
Cs (0) = p Cs−1 (1) + q Cs−1 (0).

2
Let us now consider the following stopping set

S = {s : s ≤ K(s − 1)}, (10)

where K(s − 1) is the cost of taking the next available space from position s − 1 onwards.
Let us define
K(s) = p s + q K(s − 1), (11)
with K(0) = q D.
If we solve this difference equation, we have
q
K(s) = − + s + c q s+1 , (12)
p

with c = D + p1 .
Substituting this into the expression of the stopping set, we have

S = {s : (D p + 1) q s ≥ 1}. (13)

Hence, the optimal policy is to take the next available space once the condition (D p + 1)q s ≥ 1
is met.
In conclusion, the OSLA rule is optimal.

Shiryaev A., Peskir G. Optimal Stopping and Free-Boundary Problems S
No ratings yet
Shiryaev A., Peskir G. Optimal Stopping and Free-Boundary Problems S
497 pages
Dynamic Programming and Optimal Control
No ratings yet
Dynamic Programming and Optimal Control
62 pages
Deep Optimal Stopping
No ratings yet
Deep Optimal Stopping
25 pages
3 - Chapter 3 Optimal State Values and Bellman Optimality Equation
No ratings yet
3 - Chapter 3 Optimal State Values and Bellman Optimality Equation
21 pages
Dynamic Programming and Optimal Control Script
No ratings yet
Dynamic Programming and Optimal Control Script
58 pages
When Does Stabilizability Imply The Existence of Infinite Horizon Optimal Control in Nonlinear Systems
No ratings yet
When Does Stabilizability Imply The Existence of Infinite Horizon Optimal Control in Nonlinear Systems
25 pages
Dynamic Programming and Linear Quadratic (LQ) Control (Discrete-Time and Continuous Time Cases)
No ratings yet
Dynamic Programming and Linear Quadratic (LQ) Control (Discrete-Time and Continuous Time Cases)
53 pages
CH 2 (B) - Problem Solving State-Space Search and Control Strategies
No ratings yet
CH 2 (B) - Problem Solving State-Space Search and Control Strategies
37 pages
Stopping Time Markov Processes
No ratings yet
Stopping Time Markov Processes
19 pages
Alur2001 Chapter OptimalPathsInWeightedTimedAut
No ratings yet
Alur2001 Chapter OptimalPathsInWeightedTimedAut
14 pages
SR 3
No ratings yet
SR 3
13 pages
0e Disj-Intv
No ratings yet
0e Disj-Intv
11 pages
Lecture 3 and 4
No ratings yet
Lecture 3 and 4
14 pages
Homework - 07 - 223 - Spring 2024
No ratings yet
Homework - 07 - 223 - Spring 2024
6 pages
15 - Optimal Policies For Passive Learning Controllers
No ratings yet
15 - Optimal Policies For Passive Learning Controllers
7 pages
Resource and Energy Economics: Solving Optimal Timing Problems in Environmental Economics
No ratings yet
Resource and Energy Economics: Solving Optimal Timing Problems in Environmental Economics
8 pages
Infinite-Horizon Discrete-Time Pontryagin Principles Via Results of Michel
No ratings yet
Infinite-Horizon Discrete-Time Pontryagin Principles Via Results of Michel
11 pages
Economics Department of The University of Pennsylvania Institute of Social and Economic Research - Osaka University
No ratings yet
Economics Department of The University of Pennsylvania Institute of Social and Economic Research - Osaka University
26 pages
Economia Discreta en El Tiempo
No ratings yet
Economia Discreta en El Tiempo
26 pages
Homework - 06 - 223 - Spring 2024
No ratings yet
Homework - 06 - 223 - Spring 2024
5 pages
Chapter2 2
No ratings yet
Chapter2 2
12 pages
Dynamic Prog
No ratings yet
Dynamic Prog
25 pages
Different Types of Systems: TF X (TF)
No ratings yet
Different Types of Systems: TF X (TF)
20 pages
Institute of Mathematics of The Polish Academy of Sciences: IM PAN Preprint 700 (2009)
No ratings yet
Institute of Mathematics of The Polish Academy of Sciences: IM PAN Preprint 700 (2009)
21 pages
Optimale Regelung: Optimal Control With Engineering Applications
No ratings yet
Optimale Regelung: Optimal Control With Engineering Applications
9 pages
Optimal Control
No ratings yet
Optimal Control
51 pages
Dp-Intro Dynamic Programming
No ratings yet
Dp-Intro Dynamic Programming
4 pages
Deterministic Continuous Time Optimal Control and The Hamilton-Jacobi-Bellman Equation
No ratings yet
Deterministic Continuous Time Optimal Control and The Hamilton-Jacobi-Bellman Equation
7 pages
Dynamic Programming
No ratings yet
Dynamic Programming
52 pages
Optimal Control Theory
No ratings yet
Optimal Control Theory
28 pages
2002optimal Trading of An Asset Driven by A Hidden Markov Process in The Presence of Fixed Transaction Costs
No ratings yet
2002optimal Trading of An Asset Driven by A Hidden Markov Process in The Presence of Fixed Transaction Costs
15 pages
Optimal Control Theory Chapter 2 V6
No ratings yet
Optimal Control Theory Chapter 2 V6
86 pages
Sieber, J. (2006) - Dynamics of Delayed Relay Systems.: University of Bristol - Explore Bristol Research
No ratings yet
Sieber, J. (2006) - Dynamics of Delayed Relay Systems.: University of Bristol - Explore Bristol Research
42 pages
Slides CO-course C4C Part I
No ratings yet
Slides CO-course C4C Part I
38 pages
Naidu Cap 2
No ratings yet
Naidu Cap 2
5 pages
Bellman Routingproblem 1958
No ratings yet
Bellman Routingproblem 1958
5 pages
Notas - Dynamic Optimation and Optimal Control
No ratings yet
Notas - Dynamic Optimation and Optimal Control
26 pages
Elements of Optimal Control Theory Pontryagin's Maximum Principle
No ratings yet
Elements of Optimal Control Theory Pontryagin's Maximum Principle
11 pages
Statement of Principle of Optimality
No ratings yet
Statement of Principle of Optimality
3 pages
PhysRevResearch.5.013122 Physics of Networks
No ratings yet
PhysRevResearch.5.013122 Physics of Networks
9 pages
5.1 Dynamic Programming and The HJB Equation: k+1 K K K K
No ratings yet
5.1 Dynamic Programming and The HJB Equation: k+1 K K K K
30 pages
Predictor-Corrector Method
No ratings yet
Predictor-Corrector Method
10 pages
Chapter 8 Problems
100% (2)
Chapter 8 Problems
21 pages
Namic Programming
No ratings yet
Namic Programming
18 pages
307C - Operations Research
No ratings yet
307C - Operations Research
22 pages
Nonlinear - Local Controllability
No ratings yet
Nonlinear - Local Controllability
12 pages
A Closed-Form Optimal Control For Linear Systems With Equal State and Input Delays
No ratings yet
A Closed-Form Optimal Control For Linear Systems With Equal State and Input Delays
6 pages
Homework Week 10
No ratings yet
Homework Week 10
2 pages
Solutions To Exercises: Min Max Min
No ratings yet
Solutions To Exercises: Min Max Min
18 pages
Optimal Control Theory
No ratings yet
Optimal Control Theory
28 pages
Numerical Analysis Project
No ratings yet
Numerical Analysis Project
2 pages
Bellman
100% (1)
Bellman
8 pages
To Design and Implement An FIR Filter For Given Specifications
100% (1)
To Design and Implement An FIR Filter For Given Specifications
8 pages
Optimization Via The Hamilton-Jacobi-Bellman Method Theory and Applications
No ratings yet
Optimization Via The Hamilton-Jacobi-Bellman Method Theory and Applications
9 pages
Tutorial: Using MATLAB For Mathematical Programming: APS502 - Financial Engineering I
No ratings yet
Tutorial: Using MATLAB For Mathematical Programming: APS502 - Financial Engineering I
22 pages
NI-Predictive Maintenance and Machine Health Monitoring
100% (1)
NI-Predictive Maintenance and Machine Health Monitoring
34 pages
Module-I Machine Learning1
No ratings yet
Module-I Machine Learning1
20 pages
Scilab in Systems and Control
No ratings yet
Scilab in Systems and Control
31 pages
MLP Question Bank of AI and ML and NLP
No ratings yet
MLP Question Bank of AI and ML and NLP
7 pages
Lecture Notes Adversarial Search
No ratings yet
Lecture Notes Adversarial Search
15 pages
22011A0554 Water Jug Problem
No ratings yet
22011A0554 Water Jug Problem
6 pages
P 3.1.3 Hierarchical
No ratings yet
P 3.1.3 Hierarchical
30 pages
Paper 01
No ratings yet
Paper 01
17 pages
Quiz 05inp Lagrange Solution
No ratings yet
Quiz 05inp Lagrange Solution
8 pages
Experiment 2a2q2020
No ratings yet
Experiment 2a2q2020
25 pages
0 1 Knapsack
No ratings yet
0 1 Knapsack
54 pages
Woolseylecture 1
No ratings yet
Woolseylecture 1
4 pages
CSI 2110 Midterm 2014
No ratings yet
CSI 2110 Midterm 2014
11 pages
Unit 2 - Fip
No ratings yet
Unit 2 - Fip
29 pages
Your Results For: "Multiple Choice"
No ratings yet
Your Results For: "Multiple Choice"
12 pages
Loyd Lesson Plan1
No ratings yet
Loyd Lesson Plan1
4 pages
Text Compression
No ratings yet
Text Compression
25 pages
2D Steady Convection - Numerical Solution - SimCafe - Dashboard
No ratings yet
2D Steady Convection - Numerical Solution - SimCafe - Dashboard
3 pages
Lec 10 BST
No ratings yet
Lec 10 BST
20 pages
1 for - w - ≤ 6 0 for 6 ≤ -: H e e, 0≤w≤ π π w≤π
No ratings yet
1 for - w - ≤ 6 0 for 6 ≤ -: H e e, 0≤w≤ π π w≤π
3 pages
Section 3: Chapter 5.3-1
No ratings yet
Section 3: Chapter 5.3-1
41 pages
Autoencoder: Tuan Nguyen - AI4E
No ratings yet
Autoencoder: Tuan Nguyen - AI4E
35 pages
Root Finding Solutions
No ratings yet
Root Finding Solutions
10 pages
Btech Cse 3 Sem Data Structure and Algorithms PCC cs301 2024
No ratings yet
Btech Cse 3 Sem Data Structure and Algorithms PCC cs301 2024
1 page
Excel Equation Solver
No ratings yet
Excel Equation Solver
3 pages
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
From Everand
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
Shubhankar Paul
No ratings yet
Optimization in Function Spaces
From Everand
Optimization in Function Spaces
Amol Sasane
No ratings yet
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
Trigonometric Ratios to Transformations (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Trigonometric Ratios to Transformations (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
5/5 (1)
Transformation of Axes (Geometry) Mathematics Question Bank
From Everand
Transformation of Axes (Geometry) Mathematics Question Bank
Mohmmad Khaja Shareef
3/5 (1)
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
From Everand
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet

Lecture optimalStoppingTime

Uploaded by

Lecture optimalStoppingTime

Uploaded by

Optimal Stopping Time

Jae Yun JUN KIM*

Reference: Neil Walton’s lecture notes

1 Optimal stopping problem

and with two types costs (

2 Bellman equation for optimal stopping problem

Cs (x) = min{k(x), c(x) + EX [Cs−1 (X̂]} (2)

with c(x) = 0 for s = 0, and C0 (x) = k(x).

3 One step look ahead (OLSA) rule

S = {x : k(x) ≤ c(x) + E[k(X̂)]}. (3)

Suppose that S is given by the OSLA. S is a closed stopping set if

Cs−1 (x) = k(x), for x ∈ S =⇒ Cs (x) = k(x). (5)

5 Optimal policy for the optimal stopping problem

Cs (x) = k(x), ∀x ∈ S and s ∈ Z+ . (7)

So, it is always optimal to stop for x ∈ S.

6 Example: Finding parking space

xs = I[space x is free]. (8)

Now, using the Bellman equation, we have

S = {s : s ≤ K(s − 1)}, (10)

You might also like