• DocumentCode
    2379902
  • Title

    Optimal trade-off between exploration and exploitation

  • Author

    Simpkins, Alex ; De Callafon, Raymond ; Todorov, Emanuel

  • Author_Institution
    Dept. of Mech. & Aerosp. Eng., Univ. of California, San Diego, La Jolla, CA
  • fYear
    2008
  • fDate
    11-13 June 2008
  • Firstpage
    33
  • Lastpage
    38
  • Abstract
    Control in an uncertain environment often involves a trade-off between exploratory actions, whose goal is to gather sensory information, and "regular" actions which exploit the information gathered so far and pursue the task objectives. In principle both types of action can be modeled by minimizing a single cost function within the framework of stochastic optimal control. In practice however this is difficult, because the control law must be sensitive to estimation uncertainty which violates the certainty-equivalence principle. In this paper we formalize the problem in a way which captures the essence of the exploration-exploitation trade-off and yet is amenable to numerical methods for optimal control. The key to our approach is augmenting the dynamics of the partially-observable plant with the Kalman filter dynamics, thus obtaining a higher-dimensional but fully-observable plant. The resulting control laws compare favorably to other more ad-hoc approaches. Our formalism is also suitable for modeling human behavior in tasks which benefit from active exploration.
  • Keywords
    numerical analysis; optimal control; uncertain systems; Kalman filter dynamics; certainty-equivalence principle; exploration-exploitation trade-off; partially-observable plant; sensory information; single cost function; stochastic optimal control; uncertain environment; Adaptive control; Biological system modeling; Cost function; Feedback; Fingers; Humans; Optimal control; Signal generators; Stochastic processes; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    American Control Conference, 2008
  • Conference_Location
    Seattle, WA
  • ISSN
    0743-1619
  • Print_ISBN
    978-1-4244-2078-0
  • Electronic_ISBN
    0743-1619
  • Type

    conf

  • DOI
    10.1109/ACC.2008.4586462
  • Filename
    4586462