• DocumentCode
    3082349
  • Title

    Adaptive control of Markov chains under the weak accessibility

  • Author

    Agrawal, Rajeev

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Wisconsin Univ., Madison, WI, USA
  • fYear
    1990
  • fDate
    5-7 Dec 1990
  • Firstpage
    1426
  • Abstract
    The author considers the adaptive control of Markov chains under the weak accessibility condition with the objective of minimizing the learning loss. First, it is shown that, by using a stationary randomized control scheme, the maximum likelihood estimate of the unknown parameter converges exponentially fast to its true value. Then a certainty equivalence control with a forcing type scheme is constructed with alternative phases of forcing and certainty equivalence control. The stationary randomized control scheme for forcing is used in such a way that by cutting and pasting the resulting observations a single Markov chain is obtained. This in turn allows the rate of forcing to be chosen appropriately, giving a learning loss of O(f(n)log n) for any function f (n)→∞ as n→∞
  • Keywords
    Markov processes; adaptive control; probability; stochastic systems; Markov chains; adaptive control; certainty equivalence control; convergence; forcing; learning loss minimization; maximum likelihood estimate; stationary randomized control scheme; weak accessibility; Adaptive control; Arm; Costs; Optimal control; Parameter estimation; State-space methods; Stochastic processes; Stochastic systems; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 1990., Proceedings of the 29th IEEE Conference on
  • Conference_Location
    Honolulu, HI
  • Type

    conf

  • DOI
    10.1109/CDC.1990.203847
  • Filename
    203847