A new learning model for swarm intelligence based on Q-learning

Author

Li, Fuming ; He, Xiaoxian ; Xu, Jingjing

Author_Institution

Coll. of Econ. & Manage., YanShan Univ., Qinhuangdao, China

fYear

2010

Firstpage

2769

Lastpage

2775

Abstract

Inspired by cooperative transport behaviors of ants, on the basis of Q-learning, a new learning method, Neighbors´ Discounted Information (NDI) learning method, is present in the paper. This is a swarm-based learning method, in which principles of swarm intelligence are strictly complied with. In NDI learning, the i-interval neighbor´s information, namely its discounted reward, is referenced when an individual selects the next state, so that it can make the best decision in a computable local neighborhood. In application, different policies of NDI learning are recommended by controlling the parameters according to time-relativity of concrete tasks. By applying this learning method, the cooperative transport of ants is simulated. Experiment results show that the transport process in simulation is very similar to the phenomenon in natural world, which proves the designed learning mechanism´s rationality.

Keywords

cooperative systems; learning (artificial intelligence); Q-learning; ants cooperative transport behaviors; learning model; neighbors discounted information learning method; swarm intelligence; Biological system modeling; Computational modeling; Educational institutions; Information science; Learning systems; Markov processes; Particle swarm optimization; Neighbors´ Discounted Information learning (NDI learning); Q-learning; discounted reward; i-interval neighbor; swarm intelligence;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Control and Automation (WCICA), 2010 8th World Congress on

Conference_Location

Jinan

Print_ISBN

978-1-4244-6712-9

Type

conf

DOI

10.1109/WCICA.2010.5554902

Filename

5554902