DocumentCode :
893946
Title :
Emerging Cooperation With Minimal Effort: Rewarding Over Mimicking
Author :
Yannakakis, Georgios N. ; Levine, John ; Hallam, John
Author_Institution :
Univ. of Southern Denmark, Odense
Volume :
11
Issue :
3
fYear :
2007
fDate :
6/1/2007 12:00:00 AM
Firstpage :
382
Lastpage :
396
Abstract :
This paper compares supervised and unsupervised learning mechanisms for the emergence of cooperative multiagent spatial coordination using a top-down approach. By observing the global performance of a group of homogeneous agents-supported by a nonglobal knowledge of their environment-we attempt to extract information about the minimum size of the agent neurocontroller and the type of learning mechanism that collectively generate high-performing and robust behaviors with minimal computational effort. Consequently, a methodology for obtaining controllers of minimal size is introduced and a comparative study between supervised and unsupervised learning mechanisms for the generation of successful collective behaviors is presented. We have developed a prototype simulated world for our studies. This case study is primarily a computer games inspired world but its main features are also biologically plausible. The two specific tasks that the agents are tested in are the competing strategies of obstacle-avoidance and target-achievement. We demonstrate that cooperative behavior among agents, which is supported only by limited communication, appears to be necessary for the problem´s efficient solution and that learning by rewarding the behavior of agent groups constitutes a more efficient and computationally preferred generic approach than supervised learning approaches in such complex multiagent worlds
Keywords :
genetic algorithms; multi-agent systems; unsupervised learning; agent neurocontroller; computer games; cooperative multiagent spatial coordination; supervised learning mechanisms; top-down approach; unsupervised learning mechanisms; Biological system modeling; Computational modeling; Data mining; High performance computing; Learning systems; Neurocontrollers; Robustness; Size control; Unsupervised learning; Virtual prototyping; Artificial world; genetic algorithms (GAs); machine learning; multiagent; spatial coordination;
fLanguage :
English
Journal_Title :
Evolutionary Computation, IEEE Transactions on
Publisher :
ieee
ISSN :
1089-778X
Type :
jour
DOI :
10.1109/TEVC.2006.882429
Filename :
4220689
Link To Document :
بازگشت