DocumentCode
404182
Title
Constructing performance sensitivities of Markov systems with potentials as building blocks
Author
Cao, Xi-Ren
Author_Institution
Hong Kong Univ. of Sci. & Technol., Kowloon, China
Volume
5
fYear
2003
fDate
9-12 Dec. 2003
Firstpage
4405
Abstract
We study the structure of sample paths of Markov systems by using performance potentials as the fundamental units. With a sample path-based approach, we show that performance sensitivities of Markov systems can be constructed by using performance potentials (or equivalently, perturbation realization factors) as building blocks. We propose an intuitive approach to derive, by first principles, formulas for performance derivatives and performance differences for two Markov chains. These formulas are the basis for performance optimization of discrete event dynamic systems, including perturbation analysis, Markov decision processes, and reinforcement learning.
Keywords
Markov processes; discrete event systems; learning (artificial intelligence); optimisation; perturbation techniques; stochastic systems; Markov chains; Markov decision processes; Markov systems; discrete event dynamic systems; performance derivatives; performance differences; performance optimization; performance potentials; performance sensitivity; perturbation analysis; perturbation realization factors; reinforcement learning; Approximation algorithms; Approximation methods; Learning; Markov processes; Optimization; Performance analysis; Q factor; Stochastic processes; Terminology; User-generated content;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control, 2003. Proceedings. 42nd IEEE Conference on
ISSN
0191-2216
Print_ISBN
0-7803-7924-1
Type
conf
DOI
10.1109/CDC.2003.1272203
Filename
1272203
Link To Document