DocumentCode :
2062053
Title :
Multiple rewards fuzzy reinforcement learning algorithm in RoboCup environment
Author :
Shi, Li ; Jinyi, Yao ; Zhen, Ye ; Zengqi, Sun
Author_Institution :
Dept. of Comput. Sci., Tsinghua Univ., Beijing, China
fYear :
2001
fDate :
2001
Firstpage :
317
Lastpage :
322
Abstract :
In order to achieve the competition tasks for multicooperating robots through learning, the paper discusses a kind of method that is designed for multi-agent systems (MAS), called the multi-reward fuzzy Q-learning algorithm (MRFQLA), which can be applied to the environment of the Robot World Cup Tournament (RoboCup). In MRFQLA., multiple reinforcement functions are established, based on the different characters of multi-agent systems. When the learning robot executes an action, these functions create multiple reinforcement signals that give the criteria of this action from different points of view. A Takagi-Sugeno (TS) model of a fuzzy inference system is built, which integrates these multiple rewards into one signal as the feedback of the learning robot. This method enhances the efficiency of learning because multiple rewards increase TD error and eliminates the conflict between the short-term target and the long-term one. Computer simulations in the RoboCup environment are shown and a discussion is given
Keywords :
fuzzy logic; inference mechanisms; learning (artificial intelligence); mobile robots; multi-agent systems; multi-robot systems; Q-Learning; RoboCup environment; Robot World Cup Tournament; Takagi-Sugeno model; competition tasks; fuzzy inference system; multi-agent systems; multiple rewards fuzzy reinforcement learning algorithm; Algorithm design and analysis; Computer errors; Computer simulation; Design methodology; Feedback; Fuzzy systems; Learning; Multiagent systems; Robots; Takagi-Sugeno model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Applications, 2001. (CCA '01). Proceedings of the 2001 IEEE International Conference on
Conference_Location :
Mexico City
Print_ISBN :
0-7803-6733-2
Type :
conf
DOI :
10.1109/CCA.2001.973884
Filename :
973884
Link To Document :
بازگشت