0% found this document useful (0 votes)

83 views

Dynamic Programming (DP)

Dynamic programming is an optimization technique used to solve multi-stage decision problems. It works by breaking down a complex problem into simpler sub-problems and storing the results to build up the solution. The stagecoach problem example illustrates how dynamic programming can be used to determine the lowest cost route through multiple stages. Key characteristics of dynamic programming problems include dividing the problem into stages, having states associated with each stage, and using an optimal substructure property to efficiently solve sub-problems just once.

Uploaded by

Lyka Alvarez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

83 views

Dynamic Programming (DP)

Uploaded by

Lyka Alvarez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 32

Dynamic Programming (DP)

~Concept, Properties and Principle of Optimality

~Stagecoach Problem

Prepared by: Kevin DO. Roa

Concept/Introduction
Consider a company that has to decide on the production plan of an
item for the next three months so as to meet the demands in different
months at minimum cost. The different months for which the production is
to be decided constitute the stages. So it is a multi-stage problem. For such
a problem, decisions are made sequentially over several periods using a
technique called Dynamic Programming (DP) technique. One thing
common to all models in this category is that current decisions influence
both present and future periods.

The dynamic programming approach divides the problem into several

sub-problems or stages and then these sub-problems are solved
sequentially until the initial problem is finally solved. The common
characteristic of all dynamic programming models is expressing the
decision problem by means of recursive formulation.
Dynamic Programming
Dynamic programming is a useful mathematical
technique for making a sequence of interrelated
decisions. It provides a systematic procedure for
determining the optimal combination of decisions.

Incontrast to linear programming, there does not

exist a standard mathematical formulation of “the”
dynamic programming problem. Rather, dynamic
programming is a general type of approach to
problem solving, and the particular equations used
must be developed to fit each situation.
Terminology
Terminologies which are commonly used in dynamic
programming are given below:

Stage - The point at which a decision is made is known as a stage.

The end of a stage marks the beginning of the immediate
succeeding stage. (For instance, in the salesmen allocation
problem, each territory represents a stage; in the shortest route
problem, each city represents a stage.)

State - The variable that links two stages in a multistage decision

problem is called a state variable. At any stage, the values that
state variables can take describe the status of the problem. These
values are referred to as states. (For example, in the shortest route
problem, a city is referred to as state variable.)
Principle of optimality: The principle of
optimality states that the optimal decision
from any state in a stage to the end, is
independent of how one actually arrives at
that state.

Optimal policy: A policy which optimizes the

value of an objective function is called an
optimal policy.
Bellman’s principle of optimality : It states that
“an optimal policy (a sequence of decisions) has the
property that whatever the initial state and decisions
are, the remaining decisions must constitute an
optimal policy with regard to the state resulting
from the first decision.”

Return function : At each stage, a decision is made

which can affect the state of the system at the next
stage and help in arriving at the optimal solution at
the current stage. Every decision has its merit which
can be represented in an algebraic equation form.
This equation is called a return function.
Characteristics of DP
The basic features which characterize the dynamic
programming problem are as follows:

(a) The problem can be subdivided into stages with a

policy decision required at each stage. A stage is a
device to sequence the decisions. That is, it
decomposes a problem into sub-problems such that an
optimal solution to the problem can be obtained from
the optimal solutions to the sub-problems.

(b) Every stage consists of a number of states

associated with it.
(c) Decision at each stage converts the current stage into state
associated with the next stage.

(d) The state of the system at a stage is described by a set of

variables, called state variables.

(e) When the current state is known, an optimal policy for the
remaining stages is independent of the policy of the previous ones.

(f) To identify the optimal policy for each state of the system, a
recursive equation is formulated with n stages remaining, given the
optimal policy for each state with (n − 1) stages left.

(g) Using recursive equation approach each time, the solution

procedure moves backward stage by stage for obtaining the
optimum policy of each state for that particular stage, till it attains
the optimum policy beginning at the initial stage.
The Stagecoach Problem

Image source: https://www.stahlsauto.com/automobiles/1860-

stagecoach/
The Stagecoach Problem
The STAGECOACH PROBLEM is a problem specially
constructed to illustrate the features and to introduce the
terminology of dynamic programming.

“It concerns a mythical fortune seeker in Missouri who

decided to go west to join the gold rush in California
during the mid-19th century. The journey would require
traveling by stagecoach through unsettled country where
there was serious danger of attack by marauders.
Although his starting point and destination were fixed,
he had considerable choice as to which states (or
territories that subsequently became states) to travel
through en route.”
The possible routes are shown in Fig. 11.1, where
each state is represented by a circled letter and the
direction of travel is always from left to right in the
diagram. Thus, four stages (stagecoach runs) were
required to travel from his point of embarkation in
state A (Missouri) to his destination in state J
(California).
This fortune seeker was a prudent man who
was quite concerned about his safety. After
some thought, he came up with a rather clever
way of determining the safest route.

Life insurance policies were offered to

stagecoach passengers. Because the cost of the
policy for taking any given stagecoach run
was based on a careful evaluation of the safety
of that run, the safest route should be the one
with the cheapest total life insurance policy.
The cost for the standard policy on the
stagecoach run from state i to state j, which
will be denoted by cij , is

These costs are also shown in Fig 11.1.

We shall now focus on the question of
which route minimizes the total cost of the
policy.
Solving the Problem
Solving the Problem
First note that the shortsighted approach of selecting the
cheapest run offered by each successive stage need not
yield an overall optimal decision.

Following this strategy would give the route A->B->F->

I->J, at a total cost of 13. However, sacrificing a little on
one stage may permit greater savings thereafter. For
example, A->D->F is cheaper overall than A->B->F.

One possible approach to solving this problem is to use

trial and error. However, the number of possible routes is
large (18), and having to calculate the total cost for each
route is not an appealing task.
Fortunately, dynamic programming provides a
solution with much less effort than exhaustive
enumeration.

Dynamic programming starts with a small

portion of the original problem and finds the
optimal solution for this smaller problem. It
then gradually enlarges the problem, finding
the current optimal solution from the
preceding one, until the original problem is
solved in its entirety.
For the stagecoach problem, we start with the smaller problem
where the fortune seeker has nearly completed his journey and
has only one more stage (stagecoach run) to go.

The obvious optimal solution for this smaller problem is to go

from his current state (whatever it is) to his ultimate destination
(state J).

At each subsequent iteration, the problem is enlarged by

increasing by 1 the number of stages left to go to complete the
journey.

For this enlarged problem, the optimal solution for where to go

next from each possible state can be found relatively easily
from the results obtained at the preceding iteration. The details
involved in implementing this approach follow.
Solution Procedure
Characteristics of DP Problems
The stagecoach problem is a literal prototype of
dynamic programming problems. In fact, this
example was purposely designed to provide a
literal physical interpretation of the rather
abstract structure of such problems.

Therefore, one way to recognize a situation that

can be formulated as a dynamic programming
problem is to notice that its basic structure is
analogous to the stagecoach problem.
These basic features that characterize dynamic
programming problems are presented and discussed here.

1. The problem can be divided into stages, with a policy

decision required at each stage.
2. Each stage has a number of states associated with the
beginning of that stage.
3. The effect of the policy decision at each stage is to
transform the current state to a state associated with the
beginning of the next stage (possibly according to a
probability distribution).
4. The solution procedure is designed to find an optimal
policy for the overall problem, (i.e., a prescription of the
optimal policy decision at each stage for each of the
possible states.)
5. Given the current state, an optimal policy
for the remaining stages is independent of the
policy decisions adopted in previous stages.

6. The solution procedure begins by finding

the optimal policy for the last stage.
References/Sources
Giri, B. C. (n.d.) Operations Research.
Chapter 8: Dynamic Programming.
Department of Mathematics, Jadavpur
University. Kolkata, India. Pdf.

Chapter 11: Dynamic Programming (n.d.)

Retrieved from https://www.ime.unicamp.br/~
andreani/MS515/capitulo7.pdf

Economics of Education: Meaning, Nature and Scope
100% (4)
Economics of Education: Meaning, Nature and Scope
5 pages
Frederick S. Hillier, Gerald J. Lieberman (Late) - Introduction To Operations Research (2015
No ratings yet
Frederick S. Hillier, Gerald J. Lieberman (Late) - Introduction To Operations Research (2015
6 pages
Dynamic Programming
100% (1)
Dynamic Programming
15 pages
Human Rights of Mentally Ill
No ratings yet
Human Rights of Mentally Ill
47 pages
Art. 353 - 364 - Crimes Against Honor
89% (9)
Art. 353 - 364 - Crimes Against Honor
13 pages
LESSON PLAN Defence Mechanism
100% (4)
LESSON PLAN Defence Mechanism
15 pages
Test Initial Engleza A 6-A
No ratings yet
Test Initial Engleza A 6-A
4 pages
Dynamic Programming
No ratings yet
Dynamic Programming
8 pages
Dynamic Programming 7707
No ratings yet
Dynamic Programming 7707
51 pages
Variables. These Variables Provide Information For Analyzing The Possible Effects That The Current Decision
No ratings yet
Variables. These Variables Provide Information For Analyzing The Possible Effects That The Current Decision
5 pages
Dynammic Programming Shortest Route
No ratings yet
Dynammic Programming Shortest Route
18 pages
Deterministic Dynamic Programming: To The Next
No ratings yet
Deterministic Dynamic Programming: To The Next
52 pages
PPT3 - W2-S3 - Dynamic Programming - R0
No ratings yet
PPT3 - W2-S3 - Dynamic Programming - R0
29 pages
Dynamic Programming
No ratings yet
Dynamic Programming
9 pages
Dynamic Programming
No ratings yet
Dynamic Programming
30 pages
Dynamic Programming - Part 1
No ratings yet
Dynamic Programming - Part 1
23 pages
IE 303 - LN9_1
No ratings yet
IE 303 - LN9_1
17 pages
04 - OR2 - Dynamic Programming
No ratings yet
04 - OR2 - Dynamic Programming
14 pages
Dynamic Programming
No ratings yet
Dynamic Programming
10 pages
Process Optimisation: Dynamic Programming
No ratings yet
Process Optimisation: Dynamic Programming
35 pages
IEI2P3 - Penelitian Operasional 2: Stagecoach Problem
No ratings yet
IEI2P3 - Penelitian Operasional 2: Stagecoach Problem
18 pages
Scan 09-Sep-2020
No ratings yet
Scan 09-Sep-2020
3 pages
Operation Research 2 Dynamic Programming
No ratings yet
Operation Research 2 Dynamic Programming
34 pages
The Analysis of Forward and Backward Dynamic Programming For Multistage Graph
No ratings yet
The Analysis of Forward and Backward Dynamic Programming For Multistage Graph
7 pages
Week 10 -Dynamic Programming
No ratings yet
Week 10 -Dynamic Programming
12 pages
Lecture 2 Deterministic
No ratings yet
Lecture 2 Deterministic
21 pages
Group 5 Dyn Prog
No ratings yet
Group 5 Dyn Prog
15 pages
Lecture 8 Dynamic Programming
No ratings yet
Lecture 8 Dynamic Programming
32 pages
Operational Reseach 1
No ratings yet
Operational Reseach 1
9 pages
Characteristics of Dynamic Programming Problems
No ratings yet
Characteristics of Dynamic Programming Problems
13 pages
Opt Class CH17102 - Unit 4
No ratings yet
Opt Class CH17102 - Unit 4
26 pages
Chapter VI DP and Network
No ratings yet
Chapter VI DP and Network
66 pages
IEI2P3 - Penelitian Operasional 2: Stagecoach Problem
No ratings yet
IEI2P3 - Penelitian Operasional 2: Stagecoach Problem
18 pages
DP Methods
No ratings yet
DP Methods
61 pages
Dynamic Programming
No ratings yet
Dynamic Programming
16 pages
DAA Material
No ratings yet
DAA Material
12 pages
Dyanamic Programing
No ratings yet
Dyanamic Programing
6 pages
Decision Models: Assignment 1: Dynamic Programming
No ratings yet
Decision Models: Assignment 1: Dynamic Programming
2 pages
OR
No ratings yet
OR
34 pages
Modified Dynamic
No ratings yet
Modified Dynamic
63 pages
Stagecoach Problem
No ratings yet
Stagecoach Problem
18 pages
Introduction_To_Dynamic_Programming
No ratings yet
Introduction_To_Dynamic_Programming
15 pages
25-Introduction To Dynamic Programming-08-03-2024
No ratings yet
25-Introduction To Dynamic Programming-08-03-2024
43 pages
UNIT-IV
No ratings yet
UNIT-IV
23 pages
RSH Qam11 Module02 Render
No ratings yet
RSH Qam11 Module02 Render
24 pages
RSH Qam11 Module02
No ratings yet
RSH Qam11 Module02
24 pages
Csc 411 Dynamic Programming
No ratings yet
Csc 411 Dynamic Programming
7 pages
Dynamic
No ratings yet
Dynamic
14 pages
unit-4-new
No ratings yet
unit-4-new
56 pages
Optimization: Dynamic Programming
No ratings yet
Optimization: Dynamic Programming
49 pages
Dynamic Programming Morshed sir
No ratings yet
Dynamic Programming Morshed sir
19 pages
CH 9 MDP
No ratings yet
CH 9 MDP
97 pages
Lesson 8 Complexity Theory and DP
No ratings yet
Lesson 8 Complexity Theory and DP
13 pages
Markov Decision Process: Fundamentals and Applications
From Everand
Markov Decision Process: Fundamentals and Applications
Fouad Sabry
No ratings yet
Dynamic Programming
No ratings yet
Dynamic Programming
11 pages
Hiller - Dynamic Programming PDF
No ratings yet
Hiller - Dynamic Programming PDF
6 pages
Dynamic_Programming
No ratings yet
Dynamic_Programming
37 pages
dynamic programming
No ratings yet
dynamic programming
6 pages
Dynamic Programming: of Optimality
No ratings yet
Dynamic Programming: of Optimality
11 pages
Dynamic Programming: Xiaolan Xie
No ratings yet
Dynamic Programming: Xiaolan Xie
97 pages
Notas - Dynamic Optimation and Optimal Control
No ratings yet
Notas - Dynamic Optimation and Optimal Control
26 pages
Daa Unit-3
No ratings yet
Daa Unit-3
32 pages
InOpe - 6 - Dynamic Programming Exercises To Submit
No ratings yet
InOpe - 6 - Dynamic Programming Exercises To Submit
3 pages
Or Q&a
No ratings yet
Or Q&a
9 pages
5.4-Reinforcement Learning-Part1-Introduction
No ratings yet
5.4-Reinforcement Learning-Part1-Introduction
15 pages
MBA Thesis Guide
No ratings yet
MBA Thesis Guide
51 pages
Dynamics of People and Organization
No ratings yet
Dynamics of People and Organization
3 pages
Quick Check From Job Order Costing
No ratings yet
Quick Check From Job Order Costing
22 pages
Strategic Management Process
No ratings yet
Strategic Management Process
19 pages
21-22 - Laboratory Examination and Work-Up For Hematologic Disorders - DR - Ariful Hayat
No ratings yet
21-22 - Laboratory Examination and Work-Up For Hematologic Disorders - DR - Ariful Hayat
111 pages
Download Complete Conversation Analysis Comparative Perspectives 1st Edition Jack Sidnell PDF for All Chapters
100% (4)
Download Complete Conversation Analysis Comparative Perspectives 1st Edition Jack Sidnell PDF for All Chapters
77 pages
Motor Sizing Calculations: Selection Procedure
100% (1)
Motor Sizing Calculations: Selection Procedure
9 pages
Inicijalni 5. Raz
100% (2)
Inicijalni 5. Raz
3 pages
RD Main Session 1 English
No ratings yet
RD Main Session 1 English
20 pages
Moulana Rumi A Sufi Shia Muslim and His Matnavi
No ratings yet
Moulana Rumi A Sufi Shia Muslim and His Matnavi
91 pages
Lab Experiment Bernoulli
No ratings yet
Lab Experiment Bernoulli
13 pages
Leadership Is A Process by Which An Executive Can Direct, Guide and Influence The
No ratings yet
Leadership Is A Process by Which An Executive Can Direct, Guide and Influence The
8 pages
Understanding Ethnicity and Race
No ratings yet
Understanding Ethnicity and Race
2 pages
The Philippines
No ratings yet
The Philippines
17 pages
The Impact of Works and Writings of Dr. Jose Rizal in Our National Identity
No ratings yet
The Impact of Works and Writings of Dr. Jose Rizal in Our National Identity
15 pages
Subject Methods in History and Govt - 1
No ratings yet
Subject Methods in History and Govt - 1
140 pages
Company Brochure - Seventh Sense Talent Solutions
100% (2)
Company Brochure - Seventh Sense Talent Solutions
20 pages
Utbk Preparation Test For Students Grade 12 PDF
No ratings yet
Utbk Preparation Test For Students Grade 12 PDF
7 pages
Hostel Management System
No ratings yet
Hostel Management System
7 pages
A To Z of Women's Modern Fiqh (Complicated)
100% (2)
A To Z of Women's Modern Fiqh (Complicated)
127 pages
A Project Report On: Marketing Strategy of Dabur Vatika Hair Oil & Dabur Chyawanprash
50% (2)
A Project Report On: Marketing Strategy of Dabur Vatika Hair Oil & Dabur Chyawanprash
88 pages
Knitter, Issue 188, Apr 2023
100% (2)
Knitter, Issue 188, Apr 2023
100 pages
The Spatial and Temporal Properties of Elearning: An Exploratory Study Regarding The Students' Perspective
No ratings yet
The Spatial and Temporal Properties of Elearning: An Exploratory Study Regarding The Students' Perspective
14 pages
English 6 DLP 15 Writing Specific Direction On Given Situation
No ratings yet
English 6 DLP 15 Writing Specific Direction On Given Situation
19 pages
Psychology Practical Classes Upload 1
No ratings yet
Psychology Practical Classes Upload 1
4 pages
Roll of The Dice
No ratings yet
Roll of The Dice
6 pages
Opentuition Com: Project and Relationship Management
No ratings yet
Opentuition Com: Project and Relationship Management
102 pages
Total Nutritional Therapy: A Nutrition Education Program For Physicians
No ratings yet
Total Nutritional Therapy: A Nutrition Education Program For Physicians
7 pages
Assignment Defense Mechanisms
No ratings yet
Assignment Defense Mechanisms
4 pages