Learning nonparametric policies by imitation

Author

Grimes, David B. ; Rao, Rajesh P N

Author_Institution

Dept. of Comput. Sci., Univ. of Washington, Seattle, WA

fYear

2008

fDate

22-26 Sept. 2008

Firstpage

2022

Lastpage

2028

Abstract

A long cherished goal in artificial intelligence has been the ability to endow a robot with the capacity to learn and generalize skills from watching a human teacher. Such an ability to learn by imitation has remained hard to achieve due to a number of factors, including the problem of learning in high-dimensional spaces and the problem of uncertainty. In this paper, we propose a new probabilistic approach to the problem of teaching a high degree-of-freedom robot (in particular, a humanoid robot) flexible and generalizable skills via imitation of a human teacher. The robot uses inference in a graphical model to learn sensor-based dynamics and infer a stable plan from a teacherpsilas demonstration of an action. The novel contribution of this work is a method for learning a nonparametric policy which generalizes a fixed action plan to operate over a continuous space of task variation. A notable feature of the approach is that it does not require any knowledge of the physics of the robot or the environment. By leveraging advances in probabilistic inference and Gaussian process regression, the method produces a nonparametric policy for sensor-based feedback control in continuous state and action spaces. We present experimental and simulation results using a Fujitsu HOAP-2 humanoid robot demonstrating imitation-based learning of a task involving lifting objects of different weights from a single human demonstration.

Keywords

Gaussian processes; feedback; graph theory; humanoid robots; regression analysis; sensors; Fujitsu HOAP-2 humanoid robot; Gaussian process regression; artificial intelligence; degree-of-freedom robot; graphical model; high-dimensional spaces; human teacher; nonparametric policies; probabilistic approach; probabilistic inference; sensor-based feedback control; History; Humanoid robots; Humans; Planning; Probabilistic logic; Robot sensing systems; Robots;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Robots and Systems, 2008. IROS 2008. IEEE/RSJ International Conference on

Conference_Location

Nice

Print_ISBN

978-1-4244-2057-5

Type

conf

DOI

10.1109/IROS.2008.4650778

Filename

4650778