• DocumentCode
    1341101
  • Title

    Approximating Ergodic Average Reward Continuous-Time Controlled Markov Chains

  • Author

    Prieto-Rumeau, Tomás ; Lorenzo, Jos éMaría

  • Author_Institution
    Dept. of Stat. & Oper. Res., UNED, Madrid, Spain
  • Volume
    55
  • Issue
    1
  • fYear
    2010
  • Firstpage
    201
  • Lastpage
    207
  • Abstract
    We study the approximation of an ergodic average reward continuous-time denumerable state Markov decision process (MDP) by means of a sequence of MDPs. Our results include the convergence of the corresponding optimal policies and the optimal gains. For a controlled upwardly skip-free process, we show some computational results to illustrate the convergence theorems.
  • Keywords
    Markov processes; continuous time systems; convergence; optimal control; Markov decision process; convergence theorems; ergodic average reward continuous-time control; optimal gains; optimal policies; Adaptive control; Convergence; Operations research; Optimal control; Process control; State-space methods; Statistics; Terminology; Approximation of control problems; Ergodic Markov decision processes (MDPs); policy iteration algorithm;
  • fLanguage
    English
  • Journal_Title
    Automatic Control, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9286
  • Type

    jour

  • DOI
    10.1109/TAC.2009.2033848
  • Filename
    5340606