0% found this document useful (0 votes)

17 views24 pages

Session 10

MARKOV

Uploaded by

Irfan Khilji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views24 pages

Session 10

MARKOV

Uploaded by

Irfan Khilji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

MARKOV DECISION PROCESS

AND APPLICATIONS – SESSION 10

Sumanta Basu
Professor
Operations Management Group
Indian Institute of Management Calcutta
MARKOV DECISION PROCESS (MDP)
AN EXAMPLE
 A manufacturer has one key machine used in the
production process. Because of the heavy use, the
machine deteriorates rapidly in both quality and
output. Therefore a thorough inspection is done
at the end of each week to classify the condition
of the machine into one of the four possible
states:
State Condition
0 Good as new
1 Operable – minor deterioration
2 Operable – major deterioration
3 Inoperable
EXAMPLE: MDP
 From historical data, the following matrix shows
the relative frequency (probability) of each
possible transition from the state in one week to
the state in the following week. (All states follow
Markovian property)
0 1 2 3
0 0 7/8 1/16 1/16
1 0 3/4 1/8 1/8
2 0 0 1/2 1/2
3 0 0 0 1
EXAMPLE: MDP
 So the revised transition probability matrix will
be:
0 1 2 3
0 0 7/8 1/16 1/16
1 0 3/4 1/8 1/8
2 0 0 1/2 1/2
3 0 0 0 1

0 1 2 3
0 0 7/8 1/16 1/16
1 0 3/4 1/8 1/8
2 0 0 1/2 1/2
3 1 0 0 0
MDP: DECISION ALTERNATIVES
 Given the scenario, what are the other decisions
we can take?
Decision Action Relevant States
1 Do nothing 0 (New),1 (Minor), 2
(Major)
2 Overhaul (Return system to 2 (Major)
state 1)
3 Overhauling can(Return
Replace take place only
system to when
1, 2,the system is
3 (Inoperable)
in state 2state
and0)it takes 1 week.
MDP: COST STRUCTURE FOR DECISION
MAKING
Decision State Expected cost for Maintenance / Cost of lost Total cost
producing defective Replacement cost production per week
items
Do 0 0 0 0 0
nothing
1 1000 0 0 1000

2 3000 0 0 3000

Overhaul 2 0 2000 2000 4000

Replace 1,2,3 0 4000 2000 6000

MDP: POLICIES CHOSEN
 Decision set is as follows:
Decision Action Relevant States
1 Do nothing 0,1,2
2 Overhaul (return system to state 1) 2
3 Replace (Return system to state 0) 1,2,3

 Policy is a combination of multiple decisions to be

chosen in different states: Decision State

Policy Verbal Description d0[R] d1[R] d2[R] d3[R]

Ra Replace in state 3 1 1 1 3
Rb Replace in state 3, 1 1 2 3
overhaul in state 2
Rc Replace in states 2 & 3 1 1 3 3
Rd Replace in states 1, 2 & 3 1 3 3 3
MDP: TRANSITION PROBABILITY MATRIX
FOR POLICY RA

 Policy Ra:
Policy Verbal Description d0[R] d1[R] d2[R] d3[R]
Ra Replace in state 3 1 1 1 3

0 1 2 3 Initial
0 0 7/8 1/16 1/16 TPM
1 0 3/4 1/8 1/8
2 0 0 1/2 1/2
3 0 0 0 1

0 1 2 3
0 0 7/8 1/16 1/16
1 0 3/4 1/8 1/8
2 0 0 1/2 1/2
3 1 0 0 0
MDP: TRANSITION PROBABILITY MATRIX
FOR POLICY RB

 Policy Rb:
Policy Verbal Description d0[R] d1[R] d2[R] d3[R]
Rb Replace in state 3, 1 1 2 3
overhaul in state 2
0 1 2 3 Initial
0 0 7/8 1/16 1/16 TPM
1 0 3/4 1/8 1/8
2 0 0 1/2 1/2
3 0 0 0 1

0 1 2 3
0 0 7/8 1/16 1/16
1 0 3/4 1/8 1/8
2 0 1 0 0
3 1 0 0 0
MDP: TRANSITION PROBABILITY MATRIX
FOR POLICY RC

 Policy Rc:
Policy Verbal Description d0[R] d1[R] d2[R] d3[R]
Rc Replace in states 2 & 3 1 1 3 3

0 1 2 3 Initial
0 0 7/8 1/16 1/16 TPM
1 0 3/4 1/8 1/8
2 0 0 1/2 1/2
3 0 0 0 1

0 1 2 3
0 0 7/8 1/16 1/16
1 0 3/4 1/8 1/8
2 1 0 0 0
3 1 0 0 0
MDP: TRANSITION PROBABILITY MATRIX
FOR POLICY RD

 Policy Rd:
Policy Verbal Description d0[R] d1[R] d2[R] d3[R]
Rd Replace in states 1, 2 & 3 1 3 3 3

0 1 2 3 Initial
0 0 7/8 1/16 1/16 TPM
1 0 3/4 1/8 1/8
2 0 0 1/2 1/2
3 0 0 0 1

0 1 2 3
0 0 7/8 1/16 1/16
1 1 0 0 0
2 1 0 0 0
3 1 0 0 0
MDP: EXPECTED AVERAGE COST
Decision State Expected cost for Maintenance Cost of lost Total cost
producing defective items cost production per week
Do nothing 0 0 0 0 0
(1)
1 1000 0 0 1000

2 3000 0 0 3000

Overhaul (2) 2 0 2000 2000 4000

Replace (3) 1,2,3 0 4000 2000 6000

 Expected average cost of each policy is calculated as :

Policy Decisions in states (π0, π1, π2, π3) E(C)
(0,1,2,3)
Ra (1,1,1,3) (2/13, 7/13, 2/13, 2/13) 1/13[2(0) + 7(1) + 2(3) + 2(6)] = Rs. 1923
Rb (1,1,2,3) (2/21, 5/7, 2/21, 2/21) 1/21[2(0) + 15(1) + 2(4) + 2(6)] = Rs. 1667
Rc (1,1,3,3,) (2/11, 7/11, 1/11, 1/11) 1/11[2(0) + 7(1) + 1(6) + 1(6)] = Rs. 1727
Rd (1,3,3,3) (1/2, 7/16, 1/32, 1/32) 1/32[16(0) + 14(6) + 1(6) + 1(6)] = Rs. 3000
STEPS IN MARKOV DECISION PROCESS
1. Identify the basic transition probability matrix
after state identification
2. Identify the possible decisions which can be
exercised in each state
3. Develop policy by defining the decision to be taken
in each state
4. Re-create TPM for each policy
5. Calculate steady state probability values of each
state in each policy
6. Re-create cost structure for each policy
7. Identify expected average cost of each policy
8. Choose the policy with minimum expected average
cost
ABSORBING STATES
 State ‘k’ is called an absorbing state if pkk = 1.
 fik : Probability of absorption into state ‘k’
starting from state ‘i’

 Application of Absorbing state:

 Gambler ruin problem
 Credit evaluation problem
EXAMPLE: CREDIT EVALUATION
Consider a credit card company which classifies
customers based on fully paid (state 0), 1 to 30 days
due (state 1), 31 to 60 days due (state 2) or bad
debt (state 3). Accounts are checked in each billing
cycle to determine the state of each customer. In
general, credit is not extended and customers are
supposed to pay their bills within 30 days. Part
payment is accepted from customers. If the part
payment is made by customers in state 1 (1 to 30
days due), they will remain in that state. If the part
payment is received from customers in state 2 (31
to 60 days due), they will move to state 1.
Customers from Bad-debt category (state 3) cannot
move up to any other state.
EXAMPLE: CREDIT EVALUATION

Amount
Due
0 Min Amount Due Total Amount Due

Date Due

0 30 days 60 days

State 0 State 1 State 2 State 3

EXAMPLE: CREDIT EVALUATION
 After examining data over past several years on
the progression data of an individual customer,
credit card company developed the following
transition matrix:
State 0: Fully 1: 1 to 30 2: 31 to 60 3: Bad debt
paid days due days due (> 60 days due)
0: Fully paid 1 0 0 0
1: 1 to 30 days 0.7 0.2 0.1 0
2: 31 to 60 days 0.5 0.1 0.2 0.2
3: Bad debt 0 0 0 1

Approximately what percentage of customers from state ‘1 to 30 days’ will end

up in being into bad debt category?
EXAMPLE OF PROBABILITY OF
ABSORPTION: CREDIT EVALUATION
 fik: Probability of absorption into state ‘k’
starting from state ‘i’ = ∑ ∀ = 0, 1, … ,

• fik = 0, if state i is recurrent or

i j k
another absorbing state
• fkk = 1
pij fjk
Single-step Probability of
transition absorption
from ‘j’ to ‘k’ 1 0 0 0
f13 = p10 f03 +p11 f13 + p12 f23 + p13 f33 0.7 0.2 0.1 0
f23 = p20 f03 +p21 f13 + p22 f23 + p23 f33 0.5 0.1 0.2 0.2
0 0 0 1
f13 = 0.032
f23 = 0.254
FIRST PASSAGE TIMES
 Number of transitions made by the process in
going from state ‘i’ to state ‘j’ for the first time

 Number of transitions made by the process to

come back to a particular state ‘i’ for the first
time is called the recurrence time for state ‘i’

Recurrence time for state 3

X0 X1 X2 X3 X4 X5
3 2 1 0 3 1

First passage time to go to

state 1 from state 3
FIRST PASSAGE TIMES (FPT): INVENTORY
EXAMPLE
 µij : expected first passage time from state ‘i’ to state ‘j’
To calculate expected FPT,
we may consider all possible
i j i k j ways by which values of FPT
can be calculated along with
corresponding probability
FPT is 1 with probability pij FPT is 2 with probability pik * pkj

 Transition probability matrix for the inventory

example:
0.080 0.184 0.368 0.368
0.632 0.368 0 0
0.264 0.368 0.368 0
0.080 0.184 0.368 0.368

 What will be the expected first passage times from state ‘3’
to state ‘0’ (µ30)?
FIRST PASSAGE TIMES (FPT)
Understanding FPT from state ‘i’ to state ‘j’ by considering a direct path
and through indirect path:

pij j The FPT is 1 with probability pij

i
ij = 1 + ∑
Time
period = 1
pik
k
j The FPT is (1+kj)
kj kj
FIRST PASSAGE TIMES (FPT): INVENTORY
EXAMPLE
0.080 0.184 0.368 0.368 What will be the expected first
0.632 0.368 0 0 passage times from state ‘3’ to state
0.264 0.368 0.368 0 ‘0’ (µ30)?
0.080 0.184 0.368 0.368

ij = 1 + ∑

=1+ + +
=1+ + +
=1+ + +
10 = 1.58 weeks
After inserting the transition
probability values:
20 = 2.51 weeks
30 = 3.50 weeks
= 1 + 0.184 + 0.368 + 0.368
= 1 + 0.368 + 0.368
= 1 + 0.368
RECURRENCE TIMES: INVENTORY
EXAMPLE
 μii : expected recurrence time for state ‘i’
 What will be the expected recurrence time for
state ‘0’, value of µ00 ?
 µ00 = 1+ p0110 + p0220 + p0330 = 3.50 weeks

 µ00 = 3.50 weeks = (1/π0)

Human Resource Management (CP-204)
No ratings yet
Human Resource Management (CP-204)
89 pages
Case-Based Reasoning Book PDF
100% (1)
Case-Based Reasoning Book PDF
183 pages
Presented By: Dr. G.L.Pahuja NIT Kurukshetra: Pahuja - Gl@yahoo - Co.in
No ratings yet
Presented By: Dr. G.L.Pahuja NIT Kurukshetra: Pahuja - Gl@yahoo - Co.in
152 pages
Reasoning Under Uncertainty
100% (1)
Reasoning Under Uncertainty
17 pages
VIT CSE BTech Course Plan
50% (2)
VIT CSE BTech Course Plan
76 pages
P & S Important Questions
100% (1)
P & S Important Questions
8 pages
Mathematical Tripos: at The End of The Examination
No ratings yet
Mathematical Tripos: at The End of The Examination
28 pages
Stochastic Processes Homework Solutions
100% (1)
Stochastic Processes Homework Solutions
8 pages
An Introduction To Markovchain Package
No ratings yet
An Introduction To Markovchain Package
35 pages
Session 7
No ratings yet
Session 7
35 pages
Wang 2018
No ratings yet
Wang 2018
34 pages
Bayesian Analysis For The Social Sciences 1st Edition Simon Jackman All Chapters Instant Download
100% (3)
Bayesian Analysis For The Social Sciences 1st Edition Simon Jackman All Chapters Instant Download
55 pages
Random Dynamical Systems in Finance Anatoliy Swishchuk Shafiqul Islam Download
No ratings yet
Random Dynamical Systems in Finance Anatoliy Swishchuk Shafiqul Islam Download
89 pages
Deterioration Prediction of Building Components
No ratings yet
Deterioration Prediction of Building Components
9 pages
Chapter 14 15 Homework
No ratings yet
Chapter 14 15 Homework
8 pages
A Novel Markov-Based Temporal-SoC Analysis For Characterizing PEV Charging Demand
No ratings yet
A Novel Markov-Based Temporal-SoC Analysis For Characterizing PEV Charging Demand
11 pages
Discretization of The Markov Regime Switching AR (1) Process: Yan Liu Wuhan University
No ratings yet
Discretization of The Markov Regime Switching AR (1) Process: Yan Liu Wuhan University
13 pages
Denoising Diffusion Implicit Models
No ratings yet
Denoising Diffusion Implicit Models
22 pages
International Journal of Network Security & Its Applications (IJNSA) - ERA, WJCI Indexed
No ratings yet
International Journal of Network Security & Its Applications (IJNSA) - ERA, WJCI Indexed
40 pages
Msgarch 1
No ratings yet
Msgarch 1
16 pages
Organizational Behaviour II
No ratings yet
Organizational Behaviour II
23 pages
Bayesian Model Averaging For Linear Regression Models
No ratings yet
Bayesian Model Averaging For Linear Regression Models
14 pages
Predicting The Learning Path To Learners Optimum 1
No ratings yet
Predicting The Learning Path To Learners Optimum 1
12 pages
Solved Problems: Nifweareinperiod0
No ratings yet
Solved Problems: Nifweareinperiod0
4 pages
Physics 6th Sem Syllabus
No ratings yet
Physics 6th Sem Syllabus
15 pages
Class Notes 3 On Pitch Strategy With Habitus Model
No ratings yet
Class Notes 3 On Pitch Strategy With Habitus Model
11 pages
Homework 1: This Is Problem 9.2 in Mor Harchol-Balter's Book
No ratings yet
Homework 1: This Is Problem 9.2 in Mor Harchol-Balter's Book
3 pages
Pset3 Solutions
No ratings yet
Pset3 Solutions
6 pages
Organizational Change at SBI
No ratings yet
Organizational Change at SBI
10 pages
Lovasz Discrete and Continuous
No ratings yet
Lovasz Discrete and Continuous
23 pages
Call For Papers, Journal: Mathematics (MDPI) - Special Issue: "Markov-Chain Modelling and Applications". Deadline: 28 February, 2021
No ratings yet
Call For Papers, Journal: Mathematics (MDPI) - Special Issue: "Markov-Chain Modelling and Applications". Deadline: 28 February, 2021
2 pages
CS2A April2025 Exam
No ratings yet
CS2A April2025 Exam
8 pages
Class Notes 1 On Communication Strategy PRath 2024 Section B
No ratings yet
Class Notes 1 On Communication Strategy PRath 2024 Section B
4 pages
Markov Chain
100% (2)
Markov Chain
14 pages
Markov Analysis
100% (2)
Markov Analysis
34 pages
Markov Analysis
No ratings yet
Markov Analysis
7 pages
Markov Chains
100% (7)
Markov Chains
91 pages
Algorithms: Evaluation of Diversification Techniques For Legal Information Retrieval
No ratings yet
Algorithms: Evaluation of Diversification Techniques For Legal Information Retrieval
24 pages
Maintenance I
No ratings yet
Maintenance I
32 pages
AI Lec4 MarkovDecisionProcess&RL
No ratings yet
AI Lec4 MarkovDecisionProcess&RL
34 pages
RL Module 4
No ratings yet
RL Module 4
50 pages
Markov Process: Properties, Analysis and Applications: Ajay Kumar
No ratings yet
Markov Process: Properties, Analysis and Applications: Ajay Kumar
113 pages
Markov Process: Properties, Analysis and Applications: Ajay Kumar
No ratings yet
Markov Process: Properties, Analysis and Applications: Ajay Kumar
113 pages
Lec5 Markov Chain
No ratings yet
Lec5 Markov Chain
43 pages
Exercises Markov
No ratings yet
Exercises Markov
14 pages
Week 9 - Probabilistic Dynamic Programming
No ratings yet
Week 9 - Probabilistic Dynamic Programming
45 pages
SP 10 Markov Decision Process
No ratings yet
SP 10 Markov Decision Process
20 pages
Markov Processes
No ratings yet
Markov Processes
60 pages
Materi Markov Chain
No ratings yet
Materi Markov Chain
26 pages
Stochastic Processes
No ratings yet
Stochastic Processes
8 pages
Schaefer MDP
No ratings yet
Schaefer MDP
47 pages
Markov 2
No ratings yet
Markov 2
14 pages
Session 9
No ratings yet
Session 9
14 pages
Markov Chain - Monte Carlo Extension Lecture
No ratings yet
Markov Chain - Monte Carlo Extension Lecture
48 pages
Chapter 5 - Markov Processes - Part 1 - Ergodic and Terminating MP
No ratings yet
Chapter 5 - Markov Processes - Part 1 - Ergodic and Terminating MP
21 pages
DTMC Ss
No ratings yet
DTMC Ss
18 pages
Markov Processes
100% (1)
Markov Processes
19 pages
MSC Day 3
No ratings yet
MSC Day 3
117 pages
Markov Chain
No ratings yet
Markov Chain
32 pages
Dynamic Programming and Markov Processes
No ratings yet
Dynamic Programming and Markov Processes
152 pages
Marko
No ratings yet
Marko
25 pages
16.6-2. A Manufacturer Has A Machine That, When Operational at The Beginning of A Day, Has A
No ratings yet
16.6-2. A Manufacturer Has A Machine That, When Operational at The Beginning of A Day, Has A
12 pages
Bahan Bacaan 3
No ratings yet
Bahan Bacaan 3
5 pages
Markov
No ratings yet
Markov
11 pages
Marko
No ratings yet
Marko
25 pages
Cadeia de Markov
No ratings yet
Cadeia de Markov
178 pages
Motivation Example: Isye 3232C Stochastic Manufacturing and Service Systems Fall 2015 Yl. Chang
No ratings yet
Motivation Example: Isye 3232C Stochastic Manufacturing and Service Systems Fall 2015 Yl. Chang
28 pages
Discrete-Time Markov Chains: MIE1605 Lecture 2
No ratings yet
Discrete-Time Markov Chains: MIE1605 Lecture 2
51 pages
Chapter 19solutions Ch19
No ratings yet
Chapter 19solutions Ch19
26 pages
Safety System Availability: 1oo2d and TMR
No ratings yet
Safety System Availability: 1oo2d and TMR
9 pages
A Markov Chain Model in Decision Making
No ratings yet
A Markov Chain Model in Decision Making
8 pages
Week 3 Decision Makıng Under Uncertaınıty
No ratings yet
Week 3 Decision Makıng Under Uncertaınıty
46 pages
Chapter 4 Markov Chain
No ratings yet
Chapter 4 Markov Chain
39 pages
Tut 4
No ratings yet
Tut 4
3 pages
EE5712 Power System Reliability:: Review of Stochastic Process
No ratings yet
EE5712 Power System Reliability:: Review of Stochastic Process
73 pages
New CZ3005 Module 4 - Markov Decision Process
No ratings yet
New CZ3005 Module 4 - Markov Decision Process
38 pages
Markovian Decision Process
No ratings yet
Markovian Decision Process
27 pages
2.markov Chaains
No ratings yet
2.markov Chaains
17 pages
Anderson Sweeney Williams: Quantitative Methods For Business 8E
No ratings yet
Anderson Sweeney Williams: Quantitative Methods For Business 8E
23 pages
12 dtmc1
No ratings yet
12 dtmc1
43 pages
Markov Chains Applications & Properties
No ratings yet
Markov Chains Applications & Properties
26 pages
19.5 Markov Decision Processes: Resolving Unbounded Expected Rewards
No ratings yet
19.5 Markov Decision Processes: Resolving Unbounded Expected Rewards
13 pages
IE 301 Fall 2019 Recitation 12 Solutions PDF
No ratings yet
IE 301 Fall 2019 Recitation 12 Solutions PDF
6 pages
Notes Markov Analysis
No ratings yet
Notes Markov Analysis
5 pages
Linear Programming Assignment 5: 1 Chapter 17 (Section 17.2)
No ratings yet
Linear Programming Assignment 5: 1 Chapter 17 (Section 17.2)
4 pages
HW7Solutions PDF
No ratings yet
HW7Solutions PDF
3 pages
Multivariable Predictive Control: Applications in Industry
From Everand
Multivariable Predictive Control: Applications in Industry
Sandip K. Lahiri
No ratings yet
Markov Decision Process
No ratings yet
Markov Decision Process
11 pages
Seatwork Or2
No ratings yet
Seatwork Or2
3 pages
The Magnesium Stearate Handbook
From Everand
The Magnesium Stearate Handbook
Patrick C. Okoye
No ratings yet
Power, Distribution & Specialty Transformers World Summary: Market Values & Financials by Country
From Everand
Power, Distribution & Specialty Transformers World Summary: Market Values & Financials by Country
Editorial DataGroup
No ratings yet
CISA EXAM-Testing Concept-Recovery Time Objective (RTO) & Recovery Point Objective (RPO)
From Everand
CISA EXAM-Testing Concept-Recovery Time Objective (RTO) & Recovery Point Objective (RPO)
Hemang Doshi
1/5 (2)
Analog Dialogue, Volume 48, Number 1: Analog Dialogue, #13
From Everand
Analog Dialogue, Volume 48, Number 1: Analog Dialogue, #13
Analog Dialogue
4/5 (1)
Joint Photographic Experts Group: Unlocking the Power of Visual Data with the JPEG Standard
From Everand
Joint Photographic Experts Group: Unlocking the Power of Visual Data with the JPEG Standard
Fouad Sabry
No ratings yet

Session 10

Uploaded by

Session 10

Uploaded by

MARKOV DECISION PROCESS

AND APPLICATIONS – SESSION 10

Overhaul 2 0 2000 2000 4000

Replace 1,2,3 0 4000 2000 6000

 Policy is a combination of multiple decisions to be

Policy Verbal Description d0[R] d1[R] d2[R] d3[R]

Overhaul (2) 2 0 2000 2000 4000

Replace (3) 1,2,3 0 4000 2000 6000

 Expected average cost of each policy is calculated as :

 Application of Absorbing state:

State 0 State 1 State 2 State 3

Approximately what percentage of customers from state ‘1 to 30 days’ will end

• fik = 0, if state i is recurrent or

 Number of transitions made by the process to

Recurrence time for state 3

First passage time to go to

 Transition probability matrix for the inventory

pij j The FPT is 1 with probability pij

 µ00 = 3.50 weeks = (1/π0)

You might also like