DocumentCode
2057456
Title
Improving imitated grasping motions through interactive expected deviation learning
Author
Gräve, Kathrin ; Stückler, Jörg ; Behnke, Sven
Author_Institution
Autonomous Intell. Syst. Group, Univ. of Bonn, Bonn, Germany
fYear
2010
fDate
6-8 Dec. 2010
Firstpage
397
Lastpage
404
Abstract
One of the major obstacles that hinders the application of robots to human day-to-day tasks is the current lack of flexible learning methods to endow the robots with the necessary skills and to allow them to adapt to new situations. In this work, we present a new intuitive method for teaching a robot anthropomorphic motion primitives. Our method combines the advantages of reinforcement and imitation learning in a single coherent framework. In contrast to existing approaches that use human demonstrations merely as an initialization for reinforcement learning, our method treats both ways of learning as homologous modules and chooses the most appropriate one in every situation. We apply Gaussian Process Regression to generalize a measure of value across the combined state-action-space. Based on the expected value, uncertainty, and expected deviation of generalized movements, our method decides whether to ask for a human demonstration or to improve its performance on its own, using reinforcement learning. The latter employs a probabilistic search strategy, based on expected deviation, that greatly accelerates learning while protecting the robot from unpredictable movements at the same time. To evaluate the performance of our approach, we conducted a series of experiments and successfully trained a robot to grasp an object at arbitrary positions on a table.
Keywords
Gaussian processes; anthropometry; learning (artificial intelligence); mobile robots; regression analysis; flexible learning method; gaussian process regression; homologous module; human demonstration; imitated grasping motion; imitation learning; interactive expected deviation learning; probabilistic search strategy; reinforcement learning; robot anthropomorphic motion primitive; state action space; unpredictable movement; Degradation; Gaussian processes; Humans; Learning; Robots; Training; Trajectory;
fLanguage
English
Publisher
ieee
Conference_Titel
Humanoid Robots (Humanoids), 2010 10th IEEE-RAS International Conference on
Conference_Location
Nashville, TN
Print_ISBN
978-1-4244-8688-5
Electronic_ISBN
978-1-4244-8689-2
Type
conf
DOI
10.1109/ICHR.2010.5686846
Filename
5686846
Link To Document