Title :
An ant system based exploration-exploitation for reinforcement learning
Author :
Chang, Hyeong Soo
Author_Institution :
Dept. of Comput. Sci. & Eng., Sogang Univ., Seoul, South Korea
Abstract :
In this paper, we develop a novel exploration-exploitation strategy for reinforcement learning based on ant colony system. Most of the exploration-exploitation strategies use some statistics extracted from a single simulated trajectory. The novel strategy uses some statistics extracted from multiple simulated trajectories obtained from a swarm of ants. We show that the strategy preserves the convergence property of Q-learning.
Keywords :
combinatorial mathematics; learning (artificial intelligence); optimisation; Q-learning; ant colony system; convergence property; exploration-exploitation strategy; reinforcement learning; single simulated trajectory; Ant colony optimization; Biological information theory; Biological system modeling; Birds; Casting; Computational modeling; Computer science; Learning; Marine animals; Statistics;
Conference_Titel :
Systems, Man and Cybernetics, 2004 IEEE International Conference on
Print_ISBN :
0-7803-8566-7
DOI :
10.1109/ICSMC.2004.1400937