Title :
An adaptation of particle swarm optimization for Markov decision processes
Author :
Chang, Hyeong Soo
Author_Institution :
Dept. of Comput. Sci. & Eng., Sogang Univ., Seoul, South Korea
Abstract :
In this paper, we adapt the metaheuristic of particle swarm optimization (PSO) for solving nonstochastic optimization problems into a novel convergent algorithm for solving Markov decision processes (MDP) with infinite horizon discounted cost criterion. We show that the algorithm converges to an optimal policy with probability one. We further study how to adapt PSO to develop PSO-based reinforcement learning for the case where transition and cost dynamics of a given MDP are unknown to the decision maker.
Keywords :
Markov processes; decision theory; evolutionary computation; learning (artificial intelligence); Markov decision process; convergent algorithm; infinite horizon discounted cost criterion; nonstochastic optimization problem; particle swarm optimization metaheuristic; reinforcement learning; Birds; Computer science; Convergence; Cost function; Infinite horizon; Intelligent robots; Learning; Particle swarm optimization; Statistics; Stochastic processes;
Conference_Titel :
Systems, Man and Cybernetics, 2004 IEEE International Conference on
Print_ISBN :
0-7803-8566-7
DOI :
10.1109/ICSMC.2004.1399867