DocumentCode :
3312160
Title :
Asymptotic properties of Markov decision processes
Author :
Brockett, Roger
Author_Institution :
Sch. of Eng. & Appl. Sci., Harvard Univ., Cambridge, MA, USA
fYear :
2009
fDate :
15-18 Dec. 2009
Firstpage :
3587
Lastpage :
3591
Abstract :
In our earlier paper we derived the optimal control policy for a time varying, finite horizon, continuous time Markov processes subject to a quadratic penalty on the amount of the rate adjustment. Some aspects of the time invariant, infinite horizon problem were resolved but questions remained. In this paper we consider a more general class of performance measures and constraints on the controls. After deriving the appropriate Hamilton-Jacobi equation, we discuss asymptotic properties. The central question here relates to the possibility of that non constant (e.g., periodic) steady state policies may have better average performance than the best constant policy.
Keywords :
Jacobian matrices; Markov processes; optimal control; Hamilton-Jacobi equation; continuous time Markov decision processes; infinite horizon problem; optimal control policy; quadratic penalty; rate adjustment; Bismuth; Cost function; Counting circuits; Differential equations; Feedback control; Infinite horizon; Markov processes; Optimal control; State-space methods; Steady-state;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control, 2009 held jointly with the 2009 28th Chinese Control Conference. CDC/CCC 2009. Proceedings of the 48th IEEE Conference on
Conference_Location :
Shanghai
ISSN :
0191-2216
Print_ISBN :
978-1-4244-3871-6
Electronic_ISBN :
0191-2216
Type :
conf
DOI :
10.1109/CDC.2009.5400564
Filename :
5400564
Link To Document :
بازگشت