• DocumentCode
    3291790
  • Title

    Adaptive reinforcement Q-Learning algorithm for swarm-robot system using pheromone mechanism

  • Author

    Zhiguo Shi ; Jun Tu ; Yuankai Li ; Zeying Wang

  • Author_Institution
    Sch. of Comput. & Commun. Eng., Univ. of Sci. & Technol., Beijing, China
  • fYear
    2013
  • fDate
    12-14 Dec. 2013
  • Firstpage
    952
  • Lastpage
    957
  • Abstract
    The states and actions of the robots in uncertain environments are continuous, which will easily lead to the problem of slow learning speed and the combinatorial explosion issue of the reinforcement learning. Ant colony optimization (ACO) is an evolution algorithm based on swarm mechanism that takes full advantage of the pheromone mechanism to simplify the information sharing and collaborative issues between the swarm individuals. Adaptive robust reinforcement Q-Learning algorithm based on ACO is proposed from two parts: adaptive discretization part and pheromone part. Firstly, adaptive discretization of the continuous input space is realized by the self-organizing neural network. Secondly, the pheromone mechanism of ACO is introduced to improve the traditional reinforcement learning process, which can improve the adaptive capabilities of the system and reduce the space complexity of accelerating the learning speed of the swarm robots. Player/Stage is used as the simulation platform to verify the proposed algorithm. The results show proposed algorithm has efficiency and adaptive capacity in the swarm robotic system.
  • Keywords
    ant colony optimisation; computational complexity; learning (artificial intelligence); neural nets; robots; self-adjusting systems; swarm intelligence; ACO; adaptive capacity; adaptive discretization; adaptive robust reinforcement Q-Learning algorithm; ant colony optimization; continuous input space; evolution algorithm; information sharing; learning speed; pheromone mechanism; reinforcement learning process; self-organizing neural network; space complexity; swarm mechanism; swarm robotic system; Adaptation models; Adaptive systems; Cities and towns; Convergence; Learning (artificial intelligence); Neurons; Robots;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Biomimetics (ROBIO), 2013 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • Type

    conf

  • DOI
    10.1109/ROBIO.2013.6739586
  • Filename
    6739586