DocumentCode :
1740162
Title :
Learning using multidimensional internal rewards
Author :
Kobayashi, Yuichi ; Yuasa, Hideo ; Arai, Tamio
Author_Institution :
Dept. of Precision Eng., Tokyo Univ., Japan
Volume :
1
fYear :
2000
fDate :
2000
Firstpage :
572
Abstract :
Complicated tasks are often difficult to be expressed as single reward systems. In the human learning process, the relation between sensory inputs and action out-puts can be understood to have been acquired before-hand using an internal multidimensional reward system. We introduce reinforcement learning under multidimensional evaluation. The internal reward system includes both immediate evaluation and delayed rewards. The proposed architecture of the learning system is as a two layered Q-Learning system, which is combined with dynamic cell structure. We assume in the pushing task by a manipulator that information from touch sensors and motion detector of the vision system are available. The simulation showed that the acquired knowledge in the lower layer greatly helps to learn the pushing task
Keywords :
image sensors; learning (artificial intelligence); robot programming; tactile sensors; action out-puts; complicated tasks; delayed rewards; dynamic cell structure; immediate evaluation; motion detector; multidimensional evaluation; multidimensional internal rewards; pushing task; reinforcement learning; sensory inputs; touch sensors; vision system; Delay; Detectors; Humans; Learning systems; Manipulators; Motion detection; Multidimensional systems; Precision engineering; Robot sensing systems; Tactile sensors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems, 2000. (IROS 2000). Proceedings. 2000 IEEE/RSJ International Conference on
Conference_Location :
Takamatsu
Print_ISBN :
0-7803-6348-5
Type :
conf
DOI :
10.1109/IROS.2000.894665
Filename :
894665
Link To Document :
بازگشت