مرکز منطقه ای اطلاع رساني علوم و فناوري - Continuous valued Q-learning for vision-guided behavior acquisition

DocumentCode :

3467060

Title :

Continuous valued Q-learning for vision-guided behavior acquisition

Author :

Takahashi, Yasutake ; Takeda, Masanori ; Asada, Minoru

Author_Institution :

Dept. of Adaptive Machine Syst., Osaka Univ., Japan

fYear :

1999

fDate :

1999

Firstpage :

255

Lastpage :

260

Abstract :

Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to real robot tasks because of poor performance of learned behavior and a further problem of state space construction. This paper proposes a continuous valued Q-learning for real robot applications, which calculates the contribution values for estimating a continuous action value in order to make motion smooth and effective. The proposed method obtained a better performance of desired behavior than the conventional real-valued Q-learning method, with roughly quantized state and action. To show the validity of the method, we applied the method to a vision-guided mobile robot of which the task is to chase a ball. Although the task was simple, the performance was quite impressive. A further improvement is discussed

Keywords :

learning (artificial intelligence); mobile robots; motion control; robot vision; continuous action value; continuous valued Q-learning; mobile robots; motion control; reinforcement learning; vision-guided control; Dynamic programming; Learning; Mobile robots; Orbital robotics; Robot vision systems; State-space methods;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multisensor Fusion and Integration for Intelligent Systems, 1999. MFI '99. Proceedings. 1999 IEEE/SICE/RSJ International Conference on

Conference_Location :

Taipei

Print_ISBN :

0-7803-5801-5

Type :

conf

DOI :

10.1109/MFI.1999.815999

Filename :

815999

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3467060