DocumentCode :
2953405
Title :
Learning of soccer player agents using a policy gradient method: Coordination between kicker and receiver during free kicks
Author :
Igarashi, H. ; Nakamura, K. ; Ishihara, S.
Author_Institution :
Dept. of Inf. Sci. & Eng., Sibaura Inst. of Technol., Tokyo
fYear :
2008
fDate :
1-8 June 2008
Firstpage :
46
Lastpage :
52
Abstract :
The RoboCup Simulation League is recognized as a test bed for research on multi-agent learning. As an example of multi-agent learning in a soccer game, we dealt with a learning problem between a kicker and a receiver when a direct free kick is awarded just outside the opponentpsilas penalty area. In such a situation, to which point should the kicker kick the ball? We propose a function that expresses heuristics to evaluate an advantageous target point for safely sending/receiving a pass and scoring. The heuristics includes an interaction term between a kicker and a receiver to intensify their coordination. To calculate the interaction term, we let kicker/receiver agents have a receiver/kicker action decision model to predict his teammatepsilas action. The evaluation function makes it possible to handle a large space of states consisting of the positions of a kicker, a receiver, and their opponents. The target point of the free kick is selected by the kicker using Boltzmann selection with an evaluation function. Parameters in the function can be learned by a kind of reinforcement learning called the policy gradient method. The point to which a receiver should run to receive the ball is simultaneously learned in the same manner. The effectiveness of our solution was shown by experiments.
Keywords :
gradient methods; learning (artificial intelligence); multi-agent systems; Boltzmann selection; RoboCup simulation league; free kicks; multiagent learning; policy gradient method; reinforcement learning; soccer player agents learning; Computer science; Gradient methods; Information science; Neural networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
ISSN :
1098-7576
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
Type :
conf
DOI :
10.1109/IJCNN.2008.4633765
Filename :
4633765
Link To Document :
بازگشت