DocumentCode
3467060
Title
Continuous valued Q-learning for vision-guided behavior acquisition
Author
Takahashi, Yasutake ; Takeda, Masanori ; Asada, Minoru
Author_Institution
Dept. of Adaptive Machine Syst., Osaka Univ., Japan
fYear
1999
fDate
1999
Firstpage
255
Lastpage
260
Abstract
Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to real robot tasks because of poor performance of learned behavior and a further problem of state space construction. This paper proposes a continuous valued Q-learning for real robot applications, which calculates the contribution values for estimating a continuous action value in order to make motion smooth and effective. The proposed method obtained a better performance of desired behavior than the conventional real-valued Q-learning method, with roughly quantized state and action. To show the validity of the method, we applied the method to a vision-guided mobile robot of which the task is to chase a ball. Although the task was simple, the performance was quite impressive. A further improvement is discussed
Keywords
learning (artificial intelligence); mobile robots; motion control; robot vision; continuous action value; continuous valued Q-learning; mobile robots; motion control; reinforcement learning; vision-guided control; Dynamic programming; Learning; Mobile robots; Orbital robotics; Robot vision systems; State-space methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Multisensor Fusion and Integration for Intelligent Systems, 1999. MFI '99. Proceedings. 1999 IEEE/SICE/RSJ International Conference on
Conference_Location
Taipei
Print_ISBN
0-7803-5801-5
Type
conf
DOI
10.1109/MFI.1999.815999
Filename
815999
Link To Document