• DocumentCode
    3023556
  • Title

    Apprenticeship learning via soft local homomorphisms

  • Author

    Boularias, Abdeslam ; Chaib-Draa, Brahim

  • Author_Institution
    Comput. Sci. & Software Eng. Dept., Laval Univ., Quebec City, QC, Canada
  • fYear
    2010
  • fDate
    3-7 May 2010
  • Firstpage
    2971
  • Lastpage
    2976
  • Abstract
    We consider the problem of apprenticeship learning when the expert´s demonstration covers only a small part of a large state space. Inverse Reinforcement Learning (IRL) provides an efficient solution to this problem based on the assumption that the expert is optimally acting in a Markov Decision Process (MDP). However, past work on IRL requires an accurate estimate of the frequency of encountering each feature of the states when the robot follows the expert´s policy. Given that the complete policy of the expert is unknown, the features frequencies can only be empirically estimated from the demonstrated trajectories. In this paper, we propose to use a transfer method, known as soft homomorphism, in order to generalize the expert´s policy to unvisited regions of the state space. The generalized policy can be used either as the robot´s final policy, or to calculate the features frequencies within an IRL algorithm. Empirical results show that our approach is able to learn good policies from a small number of demonstrations.
  • Keywords
    Markov processes; learning (artificial intelligence); robots; state-space methods; Markov decision process; apprenticeship learning; features frequency; inverse reinforcement learning; robot final policy; soft local homomorphism; state space; transfer method; Computer science; Frequency estimation; Learning; Orbital robotics; Robotics and automation; Robots; Software engineering; State estimation; State-space methods; USA Councils;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Automation (ICRA), 2010 IEEE International Conference on
  • Conference_Location
    Anchorage, AK
  • ISSN
    1050-4729
  • Print_ISBN
    978-1-4244-5038-1
  • Electronic_ISBN
    1050-4729
  • Type

    conf

  • DOI
    10.1109/ROBOT.2010.5509717
  • Filename
    5509717