DocumentCode
1978752
Title
A fuzzy adaptive algorithm for expertness based cooperative learning, application to herding problem
Author
Akbarzadeh-T, M.-R. ; Rezaei-S, H. ; Naghibi-S, M.B.
Author_Institution
Dept. of Electr. Eng., Ferdowsi Univ. of Mashhad, Iran
fYear
2003
fDate
24-26 July 2003
Firstpage
317
Lastpage
322
Abstract
Cooperative learning in multi-agent systems is generally expected to improve both quality and speed of learning. This is particularly true when agents are able to recognize expert agents amongst themselves and integrate their knowledge properly. Additionally, the process of learning can be improved when the reinforcement learning signals in each agent can balance between searching behavior of the unknown knowledge (exploration) and learning behavior of the obtained knowledge (exploitation). In this paper, a fuzzy dynamic cooperative learning method, based on weighted strategy sharing (WSS), is introduced which draws a balance between exploitation and exploration behaviors. In the weighed strategy sharing method, agents share their learned knowledge by a measure of their expertness. The strategy, when applied to the classic herding problem, shows further improvement in quality and speed of learning when parameters of the learning algorithm are dynamically determined by a fuzzy routine.
Keywords
expert systems; fuzzy logic; learning (artificial intelligence); multi-agent systems; WSS; expert agents; expertness based cooperative learning; exploitation behaviour; fuzzy adaptive algorithm; fuzzy dynamic cooperative learning method; learning algorithm; learning process; multiagent systems; reinforcement learning; weighted strategy sharing; Adaptive algorithm; Fuzzy logic; Heuristic algorithms; Humans; Intelligent agent; Learning systems; Mechatronics; Multiagent systems; Robot control; Signal processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Information Processing Society, 2003. NAFIPS 2003. 22nd International Conference of the North American
Print_ISBN
0-7803-7918-7
Type
conf
DOI
10.1109/NAFIPS.2003.1226804
Filename
1226804
Link To Document