• DocumentCode
    1699568
  • Title

    A new learning model for swarm intelligence based on Q-learning

  • Author

    Li, Fuming ; He, Xiaoxian ; Xu, Jingjing

  • Author_Institution
    Coll. of Econ. & Manage., YanShan Univ., Qinhuangdao, China
  • fYear
    2010
  • Firstpage
    2769
  • Lastpage
    2775
  • Abstract
    Inspired by cooperative transport behaviors of ants, on the basis of Q-learning, a new learning method, Neighbors´ Discounted Information (NDI) learning method, is present in the paper. This is a swarm-based learning method, in which principles of swarm intelligence are strictly complied with. In NDI learning, the i-interval neighbor´s information, namely its discounted reward, is referenced when an individual selects the next state, so that it can make the best decision in a computable local neighborhood. In application, different policies of NDI learning are recommended by controlling the parameters according to time-relativity of concrete tasks. By applying this learning method, the cooperative transport of ants is simulated. Experiment results show that the transport process in simulation is very similar to the phenomenon in natural world, which proves the designed learning mechanism´s rationality.
  • Keywords
    cooperative systems; learning (artificial intelligence); Q-learning; ants cooperative transport behaviors; learning model; neighbors discounted information learning method; swarm intelligence; Biological system modeling; Computational modeling; Educational institutions; Information science; Learning systems; Markov processes; Particle swarm optimization; Neighbors´ Discounted Information learning (NDI learning); Q-learning; discounted reward; i-interval neighbor; swarm intelligence;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Control and Automation (WCICA), 2010 8th World Congress on
  • Conference_Location
    Jinan
  • Print_ISBN
    978-1-4244-6712-9
  • Type

    conf

  • DOI
    10.1109/WCICA.2010.5554902
  • Filename
    5554902