مرکز منطقه ای اطلاع رساني علوم و فناوري - Reinforcement Distribution in a Team of Cooperative Q-learning Agents

DocumentCode :

2743540

Title :

Reinforcement Distribution in a Team of Cooperative Q-learning Agents

Author :

Abbasi, Zahra ; Abbasi, Mohammad Ali

Author_Institution :

Islamic Azad Univ., Tehran

fYear :

2008

fDate :

6-8 Aug. 2008

Firstpage :

154

Lastpage :

160

Abstract :

In a Q-learning multi-agent group, agents cooperate each other to perform their assigned task during their learning for increasing the team performance. If the role of each agent clearly specified -which is a very hard task for a supervisor agent- the team will learn more efficiently. Indeed, in this case each agent reinforced according to its real effect on the team performance. Assuming an identical role for all agents is the most prevalent technique of current researchers to escape the modeling complexities. But we believe this is not the optimum method for reinforcement distribution. The main goal of this research is to find an indirect evaluation method which evaluates the role of each agent in the team and distributes the reinforcement signal accordingly. The expertness of each agent is used as a criterion to estimate the effect of each agentpsilas action on the team performance. Random and equal reinforcement signal distribution methods are also used in order to evaluate expertness-based reinforcement sharing. In addition, a new test bed, called EPIDEM, is developed to evaluate the proposed methods. The results show the distribution of the reinforcement signals based on the proposed method improves the team learning speed.

Keywords :

learning (artificial intelligence); multi-agent systems; cooperative Q-learning multi agent group; equal reinforcement signal distribution; indirect evaluation method; random reinforcement signal distribution; reinforcement distribution; team performance; Artificial intelligence; Cooperative systems; Distributed computing; Humans; Multiagent systems; Process design; Protocols; Software engineering; Testing; Uncertainty; Agent learning; Cooperative distributed problem; Coordination; Multiagent Systems; Multiagent learning; and adaptation; and teamwork; cooperation; evolution; solving;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2008. SNPD '08. Ninth ACIS International Conference on

Conference_Location :

Phuket

Print_ISBN :

978-0-7695-3263-9

Type :

conf

DOI :

10.1109/SNPD.2008.154

Filename :

4617364

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2743540