DocumentCode
3090995
Title
Learning nonparametric policies by imitation
Author
Grimes, David B. ; Rao, Rajesh P N
Author_Institution
Dept. of Comput. Sci., Univ. of Washington, Seattle, WA
fYear
2008
fDate
22-26 Sept. 2008
Firstpage
2022
Lastpage
2028
Abstract
A long cherished goal in artificial intelligence has been the ability to endow a robot with the capacity to learn and generalize skills from watching a human teacher. Such an ability to learn by imitation has remained hard to achieve due to a number of factors, including the problem of learning in high-dimensional spaces and the problem of uncertainty. In this paper, we propose a new probabilistic approach to the problem of teaching a high degree-of-freedom robot (in particular, a humanoid robot) flexible and generalizable skills via imitation of a human teacher. The robot uses inference in a graphical model to learn sensor-based dynamics and infer a stable plan from a teacherpsilas demonstration of an action. The novel contribution of this work is a method for learning a nonparametric policy which generalizes a fixed action plan to operate over a continuous space of task variation. A notable feature of the approach is that it does not require any knowledge of the physics of the robot or the environment. By leveraging advances in probabilistic inference and Gaussian process regression, the method produces a nonparametric policy for sensor-based feedback control in continuous state and action spaces. We present experimental and simulation results using a Fujitsu HOAP-2 humanoid robot demonstrating imitation-based learning of a task involving lifting objects of different weights from a single human demonstration.
Keywords
Gaussian processes; feedback; graph theory; humanoid robots; regression analysis; sensors; Fujitsu HOAP-2 humanoid robot; Gaussian process regression; artificial intelligence; degree-of-freedom robot; graphical model; high-dimensional spaces; human teacher; nonparametric policies; probabilistic approach; probabilistic inference; sensor-based feedback control; History; Humanoid robots; Humans; Planning; Probabilistic logic; Robot sensing systems; Robots;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Robots and Systems, 2008. IROS 2008. IEEE/RSJ International Conference on
Conference_Location
Nice
Print_ISBN
978-1-4244-2057-5
Type
conf
DOI
10.1109/IROS.2008.4650778
Filename
4650778
Link To Document