• DocumentCode
    2974479
  • Title

    Asymptotically efficient adaptive allocation schemes for controlled Markov chains: finite parameter space

  • Author

    Agrawal, Rajeev ; Teneketzis, Demosthenis ; Anantharam, Venkatachalam

  • Author_Institution
    Dept. of Electr. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
  • fYear
    1988
  • fDate
    7-9 Dec 1988
  • Firstpage
    1198
  • Abstract
    The authors consider a controlled Markov chain whose transition probabilities and initial distribution are parameterized by an unknown parameter θ to some known parameter space Θ. There is a one-step reward associated with each pair of control and following state of the process. The objective is to maximize the expected value of the sum of one-step rewards over an infinite horizon. By introducing the loss associated with a control scheme, the authors show that the problem is equivalent to minimizing the loss. They define uniformly good adaptive control schemes and restrict attention to these schemes. They develop a lower bound on the loss associated with any uniformly good control scheme. Finally, they construct an adaptive control scheme whose loss equals the lower bound and is therefore optimal
  • Keywords
    Markov processes; adaptive control; probability; adaptive allocation schemes; adaptive control; controlled Markov chains; infinite horizon; initial distribution; one-step reward; transition probabilities; Adaptive control; Adaptive signal processing; Character generation; Communication system control; Infinite horizon; Laboratories; Optimal control; Programmable control; Stochastic systems; Weight control;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 1988., Proceedings of the 27th IEEE Conference on
  • Conference_Location
    Austin, TX
  • Type

    conf

  • DOI
    10.1109/CDC.1988.194511
  • Filename
    194511