Learning of soccer player agents using a policy gradient method: Coordination between kicker and receiver during free kicks

Author

Igarashi, H. ; Nakamura, K. ; Ishihara, S.

Author_Institution

Dept. of Inf. Sci. & Eng., Sibaura Inst. of Technol., Tokyo

fYear

2008

fDate

1-8 June 2008

Firstpage

46

Lastpage

52

Abstract

The RoboCup Simulation League is recognized as a test bed for research on multi-agent learning. As an example of multi-agent learning in a soccer game, we dealt with a learning problem between a kicker and a receiver when a direct free kick is awarded just outside the opponentpsilas penalty area. In such a situation, to which point should the kicker kick the ball? We propose a function that expresses heuristics to evaluate an advantageous target point for safely sending/receiving a pass and scoring. The heuristics includes an interaction term between a kicker and a receiver to intensify their coordination. To calculate the interaction term, we let kicker/receiver agents have a receiver/kicker action decision model to predict his teammatepsilas action. The evaluation function makes it possible to handle a large space of states consisting of the positions of a kicker, a receiver, and their opponents. The target point of the free kick is selected by the kicker using Boltzmann selection with an evaluation function. Parameters in the function can be learned by a kind of reinforcement learning called the policy gradient method. The point to which a receiver should run to receive the ball is simultaneously learned in the same manner. The effectiveness of our solution was shown by experiments.

Keywords

gradient methods; learning (artificial intelligence); multi-agent systems; Boltzmann selection; RoboCup simulation league; free kicks; multiagent learning; policy gradient method; reinforcement learning; soccer player agents learning; Computer science; Gradient methods; Information science; Neural networks;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on

Conference_Location

Hong Kong

ISSN

1098-7576

Print_ISBN

978-1-4244-1820-6

Electronic_ISBN

1098-7576

Type

conf

DOI

10.1109/IJCNN.2008.4633765

Filename

4633765