Title of article :
Semi-Markov decision problems and performance sensitivity analysis
Author/Authors :
Cao، Xi-Ren نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2003
Pages :
12
From page :
758
To page :
769
Abstract :
Recent research indicates that Markov decision processes (MDPs) can be viewed from a sensitivity point of view; and the perturbation analysis (PA), MDPs, and reinforcement learning (RL) are three closely related areas in optimization of discrete-event dynamic systems that can be modeled as Markov processes. The goal of this paper is two-fold. First, we develop the PA theory for semi-Markov processes (SMPs); and then we extend the aforementioned results about the relation among PA, MDP, and RL to SMPs. In particular, we show that performance sensitivity formulas and policy iteration algorithms of semi-Markov decision processes can be derived based on the performance potential and realization matrix. Both the long-run average and discounted-cost problems are considered. This approach provides a unified framework for both problems, and the long-run average problem corresponds to the discounted factor being zero. The results indicate that performance sensitivities and optimization depend only on firstorder statistics. Single sample path-based implementations are discussed.
Keywords :
heat transfer , Analytical and numerical techniques , natural convection
Journal title :
IEEE Transactions on Automatic Control
Serial Year :
2003
Journal title :
IEEE Transactions on Automatic Control
Record number :
97500
Link To Document :
بازگشت