Policy Evaluation with Variance Related Risk Criteria in Markov Decision Processes

Tamar, Aviv; Di Castro, Dotan; Mannor, Shie

Computer Science > Machine Learning

arXiv:1301.0104 (cs)

[Submitted on 1 Jan 2013]

Title:Policy Evaluation with Variance Related Risk Criteria in Markov Decision Processes

Authors:Aviv Tamar, Dotan Di Castro, Shie Mannor

View PDF

Abstract:In this paper we extend temporal difference policy evaluation algorithms to performance criteria that include the variance of the cumulative reward. Such criteria are useful for risk management, and are important in domains such as finance and process control. We propose both TD(0) and LSTD(lambda) variants with linear function approximation, prove their convergence, and demonstrate their utility in a 4-dimensional continuous state space problem.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1301.0104 [cs.LG]
	(or arXiv:1301.0104v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1301.0104
Journal reference:	JMLR Workshop and Conference Proceedings 28 (3): 495-503, 2013

Submission history

From: Aviv Tamar [view email]
[v1] Tue, 1 Jan 2013 16:25:17 UTC (104 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2013-01

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Aviv Tamar
Dotan Di Castro
Shie Mannor

export BibTeX citation

Computer Science > Machine Learning

Title:Policy Evaluation with Variance Related Risk Criteria in Markov Decision Processes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Policy Evaluation with Variance Related Risk Criteria in Markov Decision Processes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators