DocumentCode :
399706
Title :
Using policy gradient reinforcement learning on autonomous robot controllers
Author :
Grudic, Gregory Z. ; Kumar, Vijay ; Ungar, Lyle
Author_Institution :
Dept. of Comput. Sci., Colorado Univ., Boulder, CO, USA
Volume :
1
fYear :
2003
fDate :
27-31 Oct. 2003
Firstpage :
406
Abstract :
Robot programmers can often quickly program a robot to approximately execute a task under specific environment conditions. However, achieving robust performance under more general conditions is significantly more difficult. We propose a framework that starts with an existing control system and uses reinforcement feedback from the environment to autonomously improve the controller´s performance. We use the policy gradient reinforcement learning (PGRL) framework, which estimates a gradient (in controller space) of improved reward, allowing the controller parameters to be incrementally updated to autonomously achieve locally optimal performance. Our approach is experimentally verified on a Cye robot executing a room entry and observation task, showing significant reduction in task execution time and robustness with respect to un-modelled changes in the environment.
Keywords :
control engineering computing; learning (artificial intelligence); mobile robots; robot programming; state feedback; Cye robot; autonomous robot controllers; controller parameters; gradient reinforcement learning; robot programmers; Computer science; Control systems; Feedback; Information science; Learning; Optimal control; Orbital robotics; Robot control; State-space methods; Switches;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems, 2003. (IROS 2003). Proceedings. 2003 IEEE/RSJ International Conference on
Print_ISBN :
0-7803-7860-1
Type :
conf
DOI :
10.1109/IROS.2003.1250662
Filename :
1250662
Link To Document :
بازگشت