DocumentCode :
404182
Title :
Constructing performance sensitivities of Markov systems with potentials as building blocks
Author :
Cao, Xi-Ren
Author_Institution :
Hong Kong Univ. of Sci. & Technol., Kowloon, China
Volume :
5
fYear :
2003
fDate :
9-12 Dec. 2003
Firstpage :
4405
Abstract :
We study the structure of sample paths of Markov systems by using performance potentials as the fundamental units. With a sample path-based approach, we show that performance sensitivities of Markov systems can be constructed by using performance potentials (or equivalently, perturbation realization factors) as building blocks. We propose an intuitive approach to derive, by first principles, formulas for performance derivatives and performance differences for two Markov chains. These formulas are the basis for performance optimization of discrete event dynamic systems, including perturbation analysis, Markov decision processes, and reinforcement learning.
Keywords :
Markov processes; discrete event systems; learning (artificial intelligence); optimisation; perturbation techniques; stochastic systems; Markov chains; Markov decision processes; Markov systems; discrete event dynamic systems; performance derivatives; performance differences; performance optimization; performance potentials; performance sensitivity; perturbation analysis; perturbation realization factors; reinforcement learning; Approximation algorithms; Approximation methods; Learning; Markov processes; Optimization; Performance analysis; Q factor; Stochastic processes; Terminology; User-generated content;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control, 2003. Proceedings. 42nd IEEE Conference on
ISSN :
0191-2216
Print_ISBN :
0-7803-7924-1
Type :
conf
DOI :
10.1109/CDC.2003.1272203
Filename :
1272203
Link To Document :
بازگشت