• DocumentCode
    3015089
  • Title

    Reinforcement learning of motor skills in high dimensions: A path integral approach

  • Author

    Theodorou, Evangelos ; Buchli, Jonas ; Schaal, Stefan

  • Author_Institution
    Comput. Learning & Motor Control Lab., Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2010
  • fDate
    3-7 May 2010
  • Firstpage
    2397
  • Lastpage
    2403
  • Abstract
    Reinforcement learning (RL) is one of the most general approaches to learning control. Its applicability to complex motor systems, however, has been largely impossible so far due to the computational difficulties that reinforcement learning encounters in high dimensional continuous state-action spaces. In this paper, we derive a novel approach to RL for parameterized control policies based on the framework of stochastic optimal control with path integrals. While solidly grounded in optimal control theory and estimation theory, the update equations for learning are surprisingly simple and have no danger of numerical instabilities as neither matrix inversions nor gradient learning rates are required. Empirical evaluations demonstrate significant performance improvements over gradient-based policy learning and scalability to high-dimensional control problems. Finally, a learning experiment on a robot dog illustrates the functionality of our algorithm in a real-world scenario. We believe that our new algorithm, Policy Improvement with Path Integrals (PI2), offers currently one of the most efficient, numerically robust, and easy to implement algorithms for RL in robotics.
  • Keywords
    control engineering computing; intelligent robots; learning (artificial intelligence); learning systems; optimal control; stochastic systems; complex motor system; continuous state-action spaces; estimation theory; gradient-based policy learning; high-dimensional control problem; learning control; motor skills; parameterized control policy; path integral approach; path integrals; policy improvement; reinforcement learning; robot dog; stochastic optimal control; Control systems; Function approximation; Inference algorithms; Integral equations; Learning systems; Optimal control; Robots; Scalability; Stochastic processes; Stochastic resonance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Automation (ICRA), 2010 IEEE International Conference on
  • Conference_Location
    Anchorage, AK
  • ISSN
    1050-4729
  • Print_ISBN
    978-1-4244-5038-1
  • Electronic_ISBN
    1050-4729
  • Type

    conf

  • DOI
    10.1109/ROBOT.2010.5509336
  • Filename
    5509336