Title :
Learning the Point Gathering Task Using Shared Value Function In Mobile Robots
Author :
Raghavan, Sriram ; Raghavan, S.V.
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Madras, Chennai
Abstract :
Managing the actions of several agents to perform tasks which require coordination and cooperation pose significant research challenges. One such challenge is to synchronize the agentspsila view of the system to help them take the dasiarightpsila actions. In this paper, we propose an algorithm called MRCPG (Mobile Robot Coordination Point Gathering Algorithm) for coordinating the actions of a team of mobile robots. The aim is to gather these robots at a particular location in a 2-dimensional plane which is determined during execution. The robots are randomly deployed in the plane and they achieve the goal by communicating periodically. In addition, we impose a reinforcement learning framework and the robots learn a shared value function (SVF) based on scalar rewards received. The SVF is used to select the best possible action in each state until at least one robot reaches the goal. Then a reach-distance heuristic is used to direct the remaining robots to the goal. The algorithm was analyzed through simulations for up to 5 robots and the analysis indicates that communication helped robots perform significantly faster than when they acted independently - measured using the path-length of the first robot to reach the goal as the metric. We also observed that increasing team size enhances the effect of communication and hastens task completion.
Keywords :
learning (artificial intelligence); mobile robots; multi-robot systems; mobile robots; point gathering task; reach-distance heuristic; reinforcement learning; shared value function; Algorithm design and analysis; Computer science; Inference algorithms; Intelligent robots; Laboratories; Learning; Mobile robots; Performance analysis; Robot kinematics; Robotic assembly; Communication; Coordination; Multi Agent Robot Systems; Multi agent Reinforcement Learning; Point Gathering Task; Shared Value Function;
Conference_Titel :
Advanced Computer Control, 2009. ICACC '09. International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-3330-8
DOI :
10.1109/ICACC.2009.49