Adaptive reinforcement Q-Learning algorithm for swarm-robot system using pheromone mechanism

Author

Zhiguo Shi ; Jun Tu ; Yuankai Li ; Zeying Wang

Author_Institution

Sch. of Comput. & Commun. Eng., Univ. of Sci. & Technol., Beijing, China

fYear

2013

fDate

12-14 Dec. 2013

Firstpage

952

Lastpage

957

Abstract

The states and actions of the robots in uncertain environments are continuous, which will easily lead to the problem of slow learning speed and the combinatorial explosion issue of the reinforcement learning. Ant colony optimization (ACO) is an evolution algorithm based on swarm mechanism that takes full advantage of the pheromone mechanism to simplify the information sharing and collaborative issues between the swarm individuals. Adaptive robust reinforcement Q-Learning algorithm based on ACO is proposed from two parts: adaptive discretization part and pheromone part. Firstly, adaptive discretization of the continuous input space is realized by the self-organizing neural network. Secondly, the pheromone mechanism of ACO is introduced to improve the traditional reinforcement learning process, which can improve the adaptive capabilities of the system and reduce the space complexity of accelerating the learning speed of the swarm robots. Player/Stage is used as the simulation platform to verify the proposed algorithm. The results show proposed algorithm has efficiency and adaptive capacity in the swarm robotic system.

Keywords

ant colony optimisation; computational complexity; learning (artificial intelligence); neural nets; robots; self-adjusting systems; swarm intelligence; ACO; adaptive capacity; adaptive discretization; adaptive robust reinforcement Q-Learning algorithm; ant colony optimization; continuous input space; evolution algorithm; information sharing; learning speed; pheromone mechanism; reinforcement learning process; self-organizing neural network; space complexity; swarm mechanism; swarm robotic system; Adaptation models; Adaptive systems; Cities and towns; Convergence; Learning (artificial intelligence); Neurons; Robots;

fLanguage

English

Publisher

ieee

Conference_Titel

Robotics and Biomimetics (ROBIO), 2013 IEEE International Conference on

Conference_Location

Shenzhen

Type

conf

DOI

10.1109/ROBIO.2013.6739586

Filename

6739586