• DocumentCode
    2409071
  • Title

    Adaptive control of i.i.d. processes and Markov chains on a compact control set

  • Author

    Agrawal, Rajeev

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Wisconsin Univ., Madison, WI, USA
  • fYear
    1992
  • fDate
    1992
  • Firstpage
    2752
  • Abstract
    The author considers the multiarmed bandit problem and the adaptive control of Markov chains with continuous arms that are chosen from a compact subset of IRd. A learning scheme based on a kernel estimator is devised. Using this learning scheme, the author constructs a class of certainty equivalence control with forcing schemes and derives asymptotic upper bounds on their learning loss
  • Keywords
    Markov processes; adaptive control; estimation theory; game theory; learning systems; Markov chains; adaptive control; asymptotic upper bounds; certainty equivalence control; compact control set; forcing schemes; i.i.d. processes; kernel estimator; learning loss; learning scheme; multiarmed bandit problem; Adaptive control; Arm; Context modeling; Control systems; Kernel; Process control; Stochastic processes; Stochastic systems; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 1992., Proceedings of the 31st IEEE Conference on
  • Conference_Location
    Tucson, AZ
  • Print_ISBN
    0-7803-0872-7
  • Type

    conf

  • DOI
    10.1109/CDC.1992.371317
  • Filename
    371317