مرکز منطقه ای اطلاع رساني علوم و فناوري - Using policy gradient reinforcement learning on autonomous robot controllers

DocumentCode :

399706

Title :

Using policy gradient reinforcement learning on autonomous robot controllers

Author :

Grudic, Gregory Z. ; Kumar, Vijay ; Ungar, Lyle

Author_Institution :

Dept. of Comput. Sci., Colorado Univ., Boulder, CO, USA

Volume :

fYear :

2003

fDate :

27-31 Oct. 2003

Firstpage :

406

Abstract :

Robot programmers can often quickly program a robot to approximately execute a task under specific environment conditions. However, achieving robust performance under more general conditions is significantly more difficult. We propose a framework that starts with an existing control system and uses reinforcement feedback from the environment to autonomously improve the controller´s performance. We use the policy gradient reinforcement learning (PGRL) framework, which estimates a gradient (in controller space) of improved reward, allowing the controller parameters to be incrementally updated to autonomously achieve locally optimal performance. Our approach is experimentally verified on a Cye robot executing a room entry and observation task, showing significant reduction in task execution time and robustness with respect to un-modelled changes in the environment.

Keywords :

control engineering computing; learning (artificial intelligence); mobile robots; robot programming; state feedback; Cye robot; autonomous robot controllers; controller parameters; gradient reinforcement learning; robot programmers; Computer science; Control systems; Feedback; Information science; Learning; Optimal control; Orbital robotics; Robot control; State-space methods; Switches;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Robots and Systems, 2003. (IROS 2003). Proceedings. 2003 IEEE/RSJ International Conference on

Print_ISBN :

0-7803-7860-1

Type :

conf

DOI :

10.1109/IROS.2003.1250662

Filename :

1250662

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=399706