Title of article :
Another set of conditions for Markov decision processes
with average sample-path costs
Author/Authors :
Quanxin Zhu، نويسنده ,
Issue Information :
دوهفته نامه با شماره پیاپی سال 2006
Abstract :
This paper deals with discrete-time Markov decision processes with average sample-path costs (ASPC)
in Borel spaces. The costs may have neither upper nor lower bounds. We propose new conditions for
the existence of ε-ASPC-optimal (deterministic) stationary policies in the class of all randomized historydependent
policies. Our conditions are weaker than those in the previous literature. Moreover, some sufficient
conditions for the existence of ASPC optimal stationary policies are imposed on the primitive data
of the model. In particular, the stochastic monotonicity condition in this paper has first been used to study
the ASPC criterion. Also, the approach provided here is slightly different from the “optimality equation
approach” widely used in the previous literature. On the other hand, under mild assumptions we show that
average expected cost optimality and ASPC-optimality are equivalent. Finally, we use a controlled queueing
system to illustrate our results.
© 2006 Elsevier Inc. All rights reserved
Keywords :
Average sample-path cost , Optimality inequality , Optimal stationarypolicy , Discrete-time Markov decision process
Journal title :
Journal of Mathematical Analysis and Applications
Journal title :
Journal of Mathematical Analysis and Applications