• DocumentCode
    3482991
  • Title

    Computationally efficient adaptive control algorithms for Markov chains

  • Author

    Jalali, A. ; Ferguson, M.

  • Author_Institution
    Dept. of Electr. Eng., McGill Univ., Montreal, Que., Canada
  • fYear
    1989
  • fDate
    13-15 Dec 1989
  • Firstpage
    1283
  • Abstract
    Algorithms for adaptive control of unknown finite Markov chains are proposed. The algorithms consist of two parts: part one estimates the unknown parameters; part two computes the optimal policy. In this study the emphasis is on efficient online computation of the optimal policy. No a priori knowledge of the optimal policy is assumed. The optimal policy is computed recursively online. At each step a small amount of computation is required. At each transition of the chain, only the act corresponding to the present state of the chain is updated. The algorithms are easy to implement and converge to the optimal policy in finite time
  • Keywords
    Markov processes; adaptive control; parameter estimation; Markov chains; Markov processes; adaptive control algorithms; online computation; optimal policy; parameter estimation; Adaptive control; Automatic control; Convergence; Cost function; Heuristic algorithms; Learning automata; Maximum likelihood estimation; Optimal control; Parameter estimation; State-space methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 1989., Proceedings of the 28th IEEE Conference on
  • Conference_Location
    Tampa, FL
  • Type

    conf

  • DOI
    10.1109/CDC.1989.70344
  • Filename
    70344