• DocumentCode
    404182
  • Title

    Constructing performance sensitivities of Markov systems with potentials as building blocks

  • Author

    Cao, Xi-Ren

  • Author_Institution
    Hong Kong Univ. of Sci. & Technol., Kowloon, China
  • Volume
    5
  • fYear
    2003
  • fDate
    9-12 Dec. 2003
  • Firstpage
    4405
  • Abstract
    We study the structure of sample paths of Markov systems by using performance potentials as the fundamental units. With a sample path-based approach, we show that performance sensitivities of Markov systems can be constructed by using performance potentials (or equivalently, perturbation realization factors) as building blocks. We propose an intuitive approach to derive, by first principles, formulas for performance derivatives and performance differences for two Markov chains. These formulas are the basis for performance optimization of discrete event dynamic systems, including perturbation analysis, Markov decision processes, and reinforcement learning.
  • Keywords
    Markov processes; discrete event systems; learning (artificial intelligence); optimisation; perturbation techniques; stochastic systems; Markov chains; Markov decision processes; Markov systems; discrete event dynamic systems; performance derivatives; performance differences; performance optimization; performance potentials; performance sensitivity; perturbation analysis; perturbation realization factors; reinforcement learning; Approximation algorithms; Approximation methods; Learning; Markov processes; Optimization; Performance analysis; Q factor; Stochastic processes; Terminology; User-generated content;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 2003. Proceedings. 42nd IEEE Conference on
  • ISSN
    0191-2216
  • Print_ISBN
    0-7803-7924-1
  • Type

    conf

  • DOI
    10.1109/CDC.2003.1272203
  • Filename
    1272203