DocumentCode :
2867431
Title :
Learning Individual Skills and Team Behaviors for Distributed Object Pushing
Author :
Aminaiee, Hossein ; Ahmadabadi, Majid Nili
Author_Institution :
Dept. of Electr. & Comput. Eng., Tehran Univ.
fYear :
2006
fDate :
25-28 June 2006
Firstpage :
1202
Lastpage :
1209
Abstract :
In this paper we discuss a learning approach to distributed object pushing. In the proposed approach, first the required individual skills for single-robot object pushing are learned using a fuzzy reinforcement learning method. Then, the robots learn how to coordinate their actions to push the object to the desired configuration cooperatively in a distributed manner. The proposed team-level learning benefits from the knowledge, which is in the form of a Q-table, that the agent has gained in its individual learning phase by a special design of reward signal and state-action representation. Each robot learns a threshold on its Q-value using a single state reinforcement learning method and pushes the object when the Q-value of its best action in the current state is above this threshold. The reward signal is designed based on the robots´ Q-tables and no external critic is needed for learning cooperation. Simulation results show that the robots learn their individual skills and a cooperation protocol to push the object cooperatively
Keywords :
distributed control; fuzzy systems; learning (artificial intelligence); mobile robots; multi-robot systems; path planning; distributed object pushing; fuzzy reinforcement learning; individual skills learning; single-robot object pushing; state-action representation; team behaviors; Intelligent control; Intelligent robots; Learning; Mechatronics; Process control; Protocols; Robot control; Robot kinematics; Robotics and automation; Signal design;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mechatronics and Automation, Proceedings of the 2006 IEEE International Conference on
Conference_Location :
Luoyang, Henan
Print_ISBN :
1-4244-0465-7
Electronic_ISBN :
1-4244-0466-5
Type :
conf
DOI :
10.1109/ICMA.2006.257797
Filename :
4026257
Link To Document :
بازگشت