• DocumentCode
    1734812
  • Title

    Discriminative Apprenticeship Learning with Both Preference and Non-preference Behavior

  • Author

    Dingsheng Luo ; Yi Wang ; Xihong Wu

  • Author_Institution
    Key Lab. of Machine Perception (Minist. of Educ.), Peking Univ., Beijing, China
  • Volume
    1
  • fYear
    2013
  • Firstpage
    315
  • Lastpage
    320
  • Abstract
    Considering that expert´s demonstrations are usually sub optimal and failed demonstrations often have some useful guidance, in this paper, a Discriminative Apprenticeship Learning algorithm is proposed, where the apprentice is taught with the join of failed attempts to acquire the ability that could discriminate the preference and non-preference cases so that to actively take a corresponding action. Since robot usually encounters changing environments, generalization ability is taken into account in the algorithm through which the reward function is recovered under the evaluation of generalization error. The problem of the representation error is also analyzed and involved in the algorithm. To ensure performance of the algorithm, theoretical guarantee is presented. Experiments on a simple car-driving robot and the comparison with a variety of inverse reinforcement learning methods are performed, which illustrate the proposed method is an effective and promising alternative.
  • Keywords
    learning (artificial intelligence); robots; car-driving robot; changing environments; discriminative apprenticeship learning; generalization ability; generalization error; nonpreference behavior; preference behavior; reinforcement learning methods; reward function; Indium phosphide; Learning (artificial intelligence); Robots; Training; Training data; Trajectory; Vectors; Discriminative apprenticeship learning; Inverse reinforcement learning; Preference and non-preference;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2013 12th International Conference on
  • Conference_Location
    Miami, FL
  • Type

    conf

  • DOI
    10.1109/ICMLA.2013.64
  • Filename
    6784634