• DocumentCode
    723902
  • Title

    Research on the control strategy of robot imitation learning based on KL divergence

  • Author

    Jianjun Yu ; Tao Liu ; Xiaogang Ruan ; Congchi Xu ; Yusen Men

  • Author_Institution
    Beijing Univ. of Technol., Beijing, China
  • fYear
    2015
  • fDate
    23-25 May 2015
  • Firstpage
    6175
  • Lastpage
    6180
  • Abstract
    Imitation learning is a significant means of human learning, but also the main research field in the mechanism of bionic robot. This paper focuses on the imitation learning strategies of robot in the framework of probabilistic model. Discrete teaching data are used as training samples of Gaussian process to acquire demonstration trajectory. RBF neural network is adopted to express imitation control strategy. The imitation trajectory with imitation control strategy which contains unknown parameters is modeled by Gaussian process. KL divergence is constructed with the probability distribution of demonstration and imitation trajectory, and gradient descent method is used to minimize the KL divergence so as to seek the optimal strategy of imitation. Then the imitation task is learned gradually by updating the optimal strategy to imitative robot. The swing behavior of the articulated robot arm is used as the simulation task of imitation learning, and the result of simulation experiments demonstrates the effectiveness of the robotic control strategy for imitation learning based on KL divergence and RBF neural network.
  • Keywords
    Gaussian processes; gradient methods; intelligent robots; learning systems; manipulators; neurocontrollers; probability; radial basis function networks; statistical distributions; Gaussian process; KL divergence minimization; RBF neural network; articulated robot arm swing behavior; bionic robot mechanism; demonstration trajectory; discrete teaching data; gradient descent method; intelligent robot; probabilistic model; probability distribution; robot imitation learning control strategy; training samples; Decision support systems; Robots; Bionic Robot; Demonstration Trajectory; Imitation Learning; KL Divergence; RBF Neural Network;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control and Decision Conference (CCDC), 2015 27th Chinese
  • Conference_Location
    Qingdao
  • Print_ISBN
    978-1-4799-7016-2
  • Type

    conf

  • DOI
    10.1109/CCDC.2015.7161922
  • Filename
    7161922