DocumentCode :
2007013
Title :
Applying statistical generalization to determine search direction for reinforcement learning of movement primitives
Author :
Nemec, Bojan ; Forte, Domenic ; Vuga, Rok ; Tamosiunaite, Minijia ; Worgotter, Florentin ; Ude, Ales
Author_Institution :
Dept. of Automatics, Biocybernetics, & Robot., Jozef Stefan Inst., Ljubljana, Slovenia
fYear :
2012
fDate :
Nov. 29 2012-Dec. 1 2012
Firstpage :
65
Lastpage :
70
Abstract :
In this paper we present a new methodology for robot learning that combines ideas from statistical generalization and reinforcement learning. First we apply statistical generalization to compute an approximation for the optimal control policy as defined by training movements that solve the given task in a number of specific situations. This way we obtain a manifold of movements, which dimensionality is usually much smaller than the dimensionality of a full space of movement primitives. Next we refine the policy by means of reinforcement learning on the approximating manifold, which results in a learning problem constrained to the low dimensional manifold. We show that in some situations, learning on the low dimensional manifold can be implemented as an error learning algorithm. We apply golden section search to refine the control policy. Furthermore, we propose a reinforcement learning algorithm with an extended parameter set, which combines learning in constrained domain with learning in full space of parametric movement primitives, which makes it possible to explore actions outside of the initial approximating manifold. The proposed approach was tested for learning of pouring action both in simulation and on a real robot.
Keywords :
approximation theory; control engineering computing; intelligent robots; learning (artificial intelligence); mobile robots; optimal control; statistical analysis; approximating manifold; constrained domain; error learning algorithm; golden section search; low dimensional manifold; optimal control policy; parametric movement primitives; reinforcement learning; robot learning; search direction; statistical generalization; training movements; Aerospace electronics; Joints; Learning (artificial intelligence); Liquids; Manifolds; Robot sensing systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Humanoid Robots (Humanoids), 2012 12th IEEE-RAS International Conference on
Conference_Location :
Osaka
ISSN :
2164-0572
Type :
conf
DOI :
10.1109/HUMANOIDS.2012.6651500
Filename :
6651500
Link To Document :
بازگشت