Title :
Parallel reinforcement learning systems using exploration agents and dyna-Q algorithm
Author :
Tateyama, Takeshi ; Kawata, Seiichi ; Shimomura, Yoshiki
Author_Institution :
Tokyo Metropolitan Univ., Tokyo
Abstract :
We propose a new strategy for parallel reinforcement learning; using this strategy, the optimal value function and policy can be constructed more quickly than by using traditional strategies. We define two types of agents: exploitation agents and exploration agents. The exploitation agents select actions mainly for the purpose of exploitation, and the exploration agents concentrate on exploration by using the extended k-certainty exploration method. These agents learn in the same environment in parallel, combine each value function periodically and execute Dyna-Q. The use of this strategy, make it possible to expect the construction of the optimal value function , and enables the exploration agents to quickly select the optimal actions. The experimental results of the mobile robot simulation showed the applicability of our method.
Keywords :
learning (artificial intelligence); multi-agent systems; Dyna-Q algorithm; exploitation agent; exploration agent; extended k-certainty exploration method; parallel reinforcement learning system; Learning; Dyna-Q; exploitation; exploration; extended ¿-certainty exploration method; parallel reinforcement learning;
Conference_Titel :
SICE, 2007 Annual Conference
Conference_Location :
Takamatsu
Print_ISBN :
978-4-907764-27-2
Electronic_ISBN :
978-4-907764-27-2
DOI :
10.1109/SICE.2007.4421460