Title of article :
Another set of conditions for Markov decision processes with average sample-path costs
Author/Authors :
Quanxin Zhu، نويسنده ,
Issue Information :
دوهفته نامه با شماره پیاپی سال 2006
Pages :
16
From page :
1199
To page :
1214
Abstract :
This paper deals with discrete-time Markov decision processes with average sample-path costs (ASPC) in Borel spaces. The costs may have neither upper nor lower bounds. We propose new conditions for the existence of ε-ASPC-optimal (deterministic) stationary policies in the class of all randomized historydependent policies. Our conditions are weaker than those in the previous literature. Moreover, some sufficient conditions for the existence of ASPC optimal stationary policies are imposed on the primitive data of the model. In particular, the stochastic monotonicity condition in this paper has first been used to study the ASPC criterion. Also, the approach provided here is slightly different from the “optimality equation approach” widely used in the previous literature. On the other hand, under mild assumptions we show that average expected cost optimality and ASPC-optimality are equivalent. Finally, we use a controlled queueing system to illustrate our results. © 2006 Elsevier Inc. All rights reserved
Keywords :
Average sample-path cost , Optimality inequality , Optimal stationarypolicy , Discrete-time Markov decision process
Journal title :
Journal of Mathematical Analysis and Applications
Serial Year :
2006
Journal title :
Journal of Mathematical Analysis and Applications
Record number :
934857
Link To Document :
بازگشت