مرکز منطقه ای اطلاع رساني علوم و فناوري - Parallel reinforcement learning systems using exploration agents and dyna-Q algorithm

DocumentCode :

2644503

Title :

Parallel reinforcement learning systems using exploration agents and dyna-Q algorithm

Author :

Tateyama, Takeshi ; Kawata, Seiichi ; Shimomura, Yoshiki

Author_Institution :

Tokyo Metropolitan Univ., Tokyo

fYear :

2007

fDate :

17-20 Sept. 2007

Firstpage :

2774

Lastpage :

2778

Abstract :

We propose a new strategy for parallel reinforcement learning; using this strategy, the optimal value function and policy can be constructed more quickly than by using traditional strategies. We define two types of agents: exploitation agents and exploration agents. The exploitation agents select actions mainly for the purpose of exploitation, and the exploration agents concentrate on exploration by using the extended k-certainty exploration method. These agents learn in the same environment in parallel, combine each value function periodically and execute Dyna-Q. The use of this strategy, make it possible to expect the construction of the optimal value function , and enables the exploration agents to quickly select the optimal actions. The experimental results of the mobile robot simulation showed the applicability of our method.

Keywords :

learning (artificial intelligence); multi-agent systems; Dyna-Q algorithm; exploitation agent; exploration agent; extended k-certainty exploration method; parallel reinforcement learning system; Learning; Dyna-Q; exploitation; exploration; extended Â¿-certainty exploration method; parallel reinforcement learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

SICE, 2007 Annual Conference

Conference_Location :

Takamatsu

Print_ISBN :

978-4-907764-27-2

Electronic_ISBN :

978-4-907764-27-2

Type :

conf

DOI :

10.1109/SICE.2007.4421460

Filename :

4421460

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2644503