Dynamic Programming (DP)
Dynamic Programming (DP)
(e) When the current state is known, an optimal policy for the
remaining stages is independent of the policy of the previous ones.
(f) To identify the optimal policy for each state of the system, a
recursive equation is formulated with n stages remaining, given the
optimal policy for each state with (n − 1) stages left.