DocumentCode
2953405
Title
Learning of soccer player agents using a policy gradient method: Coordination between kicker and receiver during free kicks
Author
Igarashi, H. ; Nakamura, K. ; Ishihara, S.
Author_Institution
Dept. of Inf. Sci. & Eng., Sibaura Inst. of Technol., Tokyo
fYear
2008
fDate
1-8 June 2008
Firstpage
46
Lastpage
52
Abstract
The RoboCup Simulation League is recognized as a test bed for research on multi-agent learning. As an example of multi-agent learning in a soccer game, we dealt with a learning problem between a kicker and a receiver when a direct free kick is awarded just outside the opponentpsilas penalty area. In such a situation, to which point should the kicker kick the ball? We propose a function that expresses heuristics to evaluate an advantageous target point for safely sending/receiving a pass and scoring. The heuristics includes an interaction term between a kicker and a receiver to intensify their coordination. To calculate the interaction term, we let kicker/receiver agents have a receiver/kicker action decision model to predict his teammatepsilas action. The evaluation function makes it possible to handle a large space of states consisting of the positions of a kicker, a receiver, and their opponents. The target point of the free kick is selected by the kicker using Boltzmann selection with an evaluation function. Parameters in the function can be learned by a kind of reinforcement learning called the policy gradient method. The point to which a receiver should run to receive the ball is simultaneously learned in the same manner. The effectiveness of our solution was shown by experiments.
Keywords
gradient methods; learning (artificial intelligence); multi-agent systems; Boltzmann selection; RoboCup simulation league; free kicks; multiagent learning; policy gradient method; reinforcement learning; soccer player agents learning; Computer science; Gradient methods; Information science; Neural networks;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location
Hong Kong
ISSN
1098-7576
Print_ISBN
978-1-4244-1820-6
Electronic_ISBN
1098-7576
Type
conf
DOI
10.1109/IJCNN.2008.4633765
Filename
4633765
Link To Document