• DocumentCode
    3516280
  • Title

    Reinforcement learning with misspecified model classes

  • Author

    Joseph, Jayaraj ; Geramifard, Alborz ; Roberts, John W. ; How, Jonathan P. ; Roy, Nicholas

  • Author_Institution
    Massachusetts Inst. of Technol., Cambridge, MA, USA
  • fYear
    2013
  • fDate
    6-10 May 2013
  • Firstpage
    939
  • Lastpage
    946
  • Abstract
    Real-world robots commonly have to act in complex, poorly understood environments where the true world dynamics are unknown. To compensate for the unknown world dynamics, we often provide a class of models to a learner so it may select a model, typically using a minimum prediction error metric over a set of training data. Often in real-world domains the model class is unable to capture the true dynamics, due to either limited domain knowledge or a desire to use a small model. In these cases we call the model class misspecified, and an unfortunate consequence of misspecification is that even with unlimited data and computation there is no guarantee the model with minimum prediction error leads to the best performing policy. In this work, our approach improves upon the standard maximum likelihood model selection metric by explicitly selecting the model which achieves the highest expected reward, rather than the most likely model. We present an algorithm for which the highest performing model from the model class is guaranteed to be found given unlimited data and computation. Empirically, we demonstrate that our algorithm is often superior to the maximum likelihood learner in a batch learning setting for two common RL benchmark problems and a third real-world system, the hydrodynamic cart-pole, a domain whose complex dynamics cannot be known exactly.
  • Keywords
    learning (artificial intelligence); maximum likelihood estimation; robots; batch learning setting; hydrodynamic cart-pole; maximum likelihood model; misspecified model classes; real world robots; reinforcement learning; world dynamics; Computational modeling; Data models; Equations; Mathematical model; Measurement; Standards; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Automation (ICRA), 2013 IEEE International Conference on
  • Conference_Location
    Karlsruhe
  • ISSN
    1050-4729
  • Print_ISBN
    978-1-4673-5641-1
  • Type

    conf

  • DOI
    10.1109/ICRA.2013.6630686
  • Filename
    6630686