Lecture optimalStoppingTime
Lecture optimalStoppingTime
a = 1: to continue
That is, you stop whenever it is better to stop now rather than to continue one step further
and then stop.
Let us say that I am in state x.
Then, what is the stopping cost? k(x)
On the other hand, what is the cost that I continue now and I stop at one step further?
c(x) + E[k(X̂)].
* ECE Paris Graduate School of Engineering, 37 quai de Grenelle 75015 Paris, France; [email protected]
1
4 Closed stopping set
We say that the set S ⊂ X is closed (where X is the state space), if once inside the stopping
set, you cannot leave from it. That is,
Px,y = 0, ∀x ∈ S, y 6∈ S. (4)
That is, if S is a closed stopping set, then x ∈ S is the current state implies that the next state
x̂ is also x̂ ∈ S.
Then, using the Bellman equation, we have
Cs (x) = min{k(x), c(x) + E[Cs−1 (x̂)]} = min{k(x), c(x) + E[k(X̂)]} = k(x) (6)
2
Let us now consider the following stopping set
where K(s − 1) is the cost of taking the next available space from position s − 1 onwards.
Let us define
K(s) = p s + q K(s − 1), (11)
with K(0) = q D.
If we solve this difference equation, we have
q
K(s) = − + s + c q s+1 , (12)
p
with c = D + p1 .
Substituting this into the expression of the stopping set, we have
S = {s : (D p + 1) q s ≥ 1}. (13)
Hence, the optimal policy is to take the next available space once the condition (D p + 1)q s ≥ 1
is met.
In conclusion, the OSLA rule is optimal.