DocumentCode
723902
Title
Research on the control strategy of robot imitation learning based on KL divergence
Author
Jianjun Yu ; Tao Liu ; Xiaogang Ruan ; Congchi Xu ; Yusen Men
Author_Institution
Beijing Univ. of Technol., Beijing, China
fYear
2015
fDate
23-25 May 2015
Firstpage
6175
Lastpage
6180
Abstract
Imitation learning is a significant means of human learning, but also the main research field in the mechanism of bionic robot. This paper focuses on the imitation learning strategies of robot in the framework of probabilistic model. Discrete teaching data are used as training samples of Gaussian process to acquire demonstration trajectory. RBF neural network is adopted to express imitation control strategy. The imitation trajectory with imitation control strategy which contains unknown parameters is modeled by Gaussian process. KL divergence is constructed with the probability distribution of demonstration and imitation trajectory, and gradient descent method is used to minimize the KL divergence so as to seek the optimal strategy of imitation. Then the imitation task is learned gradually by updating the optimal strategy to imitative robot. The swing behavior of the articulated robot arm is used as the simulation task of imitation learning, and the result of simulation experiments demonstrates the effectiveness of the robotic control strategy for imitation learning based on KL divergence and RBF neural network.
Keywords
Gaussian processes; gradient methods; intelligent robots; learning systems; manipulators; neurocontrollers; probability; radial basis function networks; statistical distributions; Gaussian process; KL divergence minimization; RBF neural network; articulated robot arm swing behavior; bionic robot mechanism; demonstration trajectory; discrete teaching data; gradient descent method; intelligent robot; probabilistic model; probability distribution; robot imitation learning control strategy; training samples; Decision support systems; Robots; Bionic Robot; Demonstration Trajectory; Imitation Learning; KL Divergence; RBF Neural Network;
fLanguage
English
Publisher
ieee
Conference_Titel
Control and Decision Conference (CCDC), 2015 27th Chinese
Conference_Location
Qingdao
Print_ISBN
978-1-4799-7016-2
Type
conf
DOI
10.1109/CCDC.2015.7161922
Filename
7161922
Link To Document