DocumentCode :
1188042
Title :
The potential structure of sample paths and performance sensitivities of Markov systems
Author :
Cao, Xi-Ren
Author_Institution :
Hong Kong Univ. of Sci. & Technol., China
Volume :
49
Issue :
12
fYear :
2004
Firstpage :
2129
Lastpage :
2142
Abstract :
We study the structure of sample paths of Markov systems by using performance potentials as the fundamental units. With a sample path-based approach, we show that performance sensitivity formulas (performance gradients and performance differences) of Markov systems can be constructed intuitively, by first principles, with performance potentials (or equivalently, perturbation realization factors) as building blocks. In particular, we derive sensitivity formulas for two Markov chains with possibly different state spaces. The proposed approach can be used to obtain flexibly the sensitivity formulas for a wide range of problems, including those with partial information. These formulas are the basis for performance optimization of discrete event dynamic systems, including perturbation analysis, Markov decision processes, and reinforcement learning. The approach thus provides insight on on-line learning and performance optimization and opens up new research directions. Sample path based algorithms can be developed.
Keywords :
Markov processes; discrete event systems; learning (artificial intelligence); optimisation; perturbation techniques; Markov decision processes; Markov systems; discrete event dynamic system performance optimization; performance sensitivities; perturbation analysis; reinforcement learning; sample path potential structure; Control system analysis; Control theory; Information analysis; Information technology; Learning; Markov processes; Operations research; Optimization; Performance analysis; State-space methods;
fLanguage :
English
Journal_Title :
Automatic Control, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9286
Type :
jour
DOI :
10.1109/TAC.2004.838494
Filename :
1369391
Link To Document :
بازگشت