DocumentCode
480792
Title
A Study of Reinforcement Learning in a New Multiagent Domain
Author
Min, Hua-Qing ; Zeng, Jia-An ; Chen, Jian ; Zhu, Jin-Hui
Author_Institution
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou
Volume
2
fYear
2008
fDate
9-12 Dec. 2008
Firstpage
154
Lastpage
161
Abstract
RoboCup Keepaway is one of the most challenging multiagent systems (MAS) where a team of keepers tries to keep the ball away from the team of takers. Most of current works concentrate on the learning of keeper, not the learning of taker, which is also a great challenge to the application of reinforcement learning (RL). In this paper, we propose a task named takeaway for takers and study the learning of them. We employ an initial learning algorithm called Update on Steps (UoS) for takers and demonstrate that this algorithm has two main faults including action oscillation and reliance on designer´s experience. Thereafter we present a novel RL algorithm called dynamic CMAC advantage learning (DCMAC-AL). It makes use of advantage(lambda) learning to calculate value function as well as CMAC to generalize state space, and creates novel features based on Bellman error to improve the precision of CMAC. Empirical results show that takers with DCMAC- AL can learn efficiently.
Keywords
learning (artificial intelligence); mobile robots; multi-agent systems; multi-robot systems; Bellman error; RoboCup Keepaway; dynamic CMAC advantage learning; multiagent systems; reinforcement learning; takeaway for takers; update-on steps algorithm; Algorithm design and analysis; Computer science; Heuristic algorithms; Intelligent agent; Learning systems; Multiagent systems; Real time systems; State-space methods; Testing; Tiles;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location
Sydney, NSW
Print_ISBN
978-0-7695-3496-1
Type
conf
DOI
10.1109/WIIAT.2008.114
Filename
4740616
Link To Document