DocumentCode :
2923037
Title :
Finite horizon analysis of infinite CTMDPs
Author :
Buchholz, Peter
Author_Institution :
Inf. IV, Tech. Univ. Dortmund, Dortmund, Germany
fYear :
2012
fDate :
25-28 June 2012
Firstpage :
1
Lastpage :
12
Abstract :
Continuous Time Markov Decision Processes (CTMDPs) are used to describe optimization problems in many applications including system maintenance and control. Often one is interested in a control strategy or policy to optimize the gain of a system over a finite interval which is denoted as finite horizon. The computation of an ε-optimal policy, i.e., a policy that reaches the optimal gain up to some small ε, is often hindered by state space explosion which means that state spaces of realistic models can be very large or even infinite. The paper presents new algorithms to compute approximately optimal policies for CTMDPs with large or infinite state spaces. The new approach allows one to compute bounds on the achievable gain and a policy to reach the lower bound using a variant of uniformization on a finite subset of the state space. It is also shown how the approach can be applied to models with unbounded rewards or transition rates for which uniformization cannot be applied per se.
Keywords :
Markov processes; infinite horizon; optimisation; ε-optimal policy; continuous time markov decision processes; control strategy; finite horizon analysis; finite interval; infinite CTMDP; infinite state spaces; large-state spaces; lower bound; optimal gain; optimization problems; realistic models; system gain optimization; transition rates; unbounded rewards; uniformization-based method; Computational modeling; Markov processes; Numerical models; Optimization; Transient analysis; Upper bound; Vectors; Continuous Time Markov Decision Processes; Finite Horizons; Numerical Analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable Systems and Networks (DSN), 2012 42nd Annual IEEE/IFIP International Conference on
Conference_Location :
Boston, MA
ISSN :
1530-0889
Print_ISBN :
978-1-4673-1624-8
Electronic_ISBN :
1530-0889
Type :
conf
DOI :
10.1109/DSN.2012.6263929
Filename :
6263929
Link To Document :
بازگشت