DocumentCode :
2337375
Title :
An extended policy gradient algorithm for robot task learning
Author :
Cherubini, A. ; Giannone, F. ; Iocchi, L. ; Palamara, P.F.
Author_Institution :
Sapienza Univ. of Roma, Rome
fYear :
2007
fDate :
Oct. 29 2007-Nov. 2 2007
Firstpage :
4121
Lastpage :
4126
Abstract :
In real-world robotic applications, many factors, both at low-level (e.g., vision and motion control parameters) and at high-level (e.g., the behaviors) determine the quality of the robot performance. Thus, for many tasks, robots require fine tuning of the parameters, in the implementation of behaviors and basic control actions, as well as in strategic decisional processes. In recent years, machine learning techniques have been used to find optimal parameter sets for different behaviors. However, a drawback of learning techniques is time consumption: in practical applications, methods designed for physical robots must be effective with small amounts of data. In this paper, we present a method for concurrent learning of best strategy and optimal parameters, by extending the policy gradient reinforcement learning algorithm. The results of our experimental work in a simulated environment and on a real robot show a very high convergence rate.
Keywords :
gradient methods; learning (artificial intelligence); robots; extended policy gradient algorithm; learning techniques; optimal parameter sets; reinforcement learning algorithm; robot task learning; strategic decisional processes; Cognitive robotics; Design methodology; Genetic programming; Intelligent robots; Learning systems; Machine learning; Motion control; Robot kinematics; Robot sensing systems; Robot vision systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems, 2007. IROS 2007. IEEE/RSJ International Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4244-0912-9
Electronic_ISBN :
978-1-4244-0912-9
Type :
conf
DOI :
10.1109/IROS.2007.4399219
Filename :
4399219
Link To Document :
بازگشت