DocumentCode :
2743540
Title :
Reinforcement Distribution in a Team of Cooperative Q-learning Agents
Author :
Abbasi, Zahra ; Abbasi, Mohammad Ali
Author_Institution :
Islamic Azad Univ., Tehran
fYear :
2008
fDate :
6-8 Aug. 2008
Firstpage :
154
Lastpage :
160
Abstract :
In a Q-learning multi-agent group, agents cooperate each other to perform their assigned task during their learning for increasing the team performance. If the role of each agent clearly specified -which is a very hard task for a supervisor agent- the team will learn more efficiently. Indeed, in this case each agent reinforced according to its real effect on the team performance. Assuming an identical role for all agents is the most prevalent technique of current researchers to escape the modeling complexities. But we believe this is not the optimum method for reinforcement distribution. The main goal of this research is to find an indirect evaluation method which evaluates the role of each agent in the team and distributes the reinforcement signal accordingly. The expertness of each agent is used as a criterion to estimate the effect of each agentpsilas action on the team performance. Random and equal reinforcement signal distribution methods are also used in order to evaluate expertness-based reinforcement sharing. In addition, a new test bed, called EPIDEM, is developed to evaluate the proposed methods. The results show the distribution of the reinforcement signals based on the proposed method improves the team learning speed.
Keywords :
learning (artificial intelligence); multi-agent systems; cooperative Q-learning multi agent group; equal reinforcement signal distribution; indirect evaluation method; random reinforcement signal distribution; reinforcement distribution; team performance; Artificial intelligence; Cooperative systems; Distributed computing; Humans; Multiagent systems; Process design; Protocols; Software engineering; Testing; Uncertainty; Agent learning; Cooperative distributed problem; Coordination; Multiagent Systems; Multiagent learning; and adaptation; and teamwork; cooperation; evolution; solving;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2008. SNPD '08. Ninth ACIS International Conference on
Conference_Location :
Phuket
Print_ISBN :
978-0-7695-3263-9
Type :
conf
DOI :
10.1109/SNPD.2008.154
Filename :
4617364
Link To Document :
بازگشت