مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

2808103

Title :

A State-Cluster Based Q-Learning

Author :

Jin, Zhao ; Liu, Weiyi ; Jin, Jian

Author_Institution :

Sch. of Inf. Sci. & Eng., Yunnan Univ., Kunming, China

Volume :

fYear :

2009

fDate :

14-16 Aug. 2009

Firstpage :

Lastpage :

Abstract :

When apply Q-learning to complex real-world problems, the learning process is long enough to make this method unpractical. The major cause is Q-learning requires the agent to visit every state-action transition infinitely often for making Q value convergent. We propose a state-cluster based Q-learning method to accelerate convergence and shorten learning process. This method creates the state-cluster for each state the agent reached according to the state trajectory that the agent wandered. By our algorithm, the state-cluster of a state would hold these acyclic shortest state paths from other states to this state. When a state´s Q value is refined in one step of the agent, the refined Q value can be propagated immediately back to all these states in its State-Cluster along the state paths between them, instead of requiring the agent to visit these states again. With the State-Cluster, more Q value can be refined in one step of the agent, which speeds up the convergence of Q value. The experiments compared with Q-learning demonstrate this method is extraordinarily more effective. This method is aimed Q-learning, but it is also applicable for most other reinforcement learning methods based value function iteration.

Keywords :

learning (artificial intelligence); software agents; Q value convergent; Q-learning; reinforcement learning; state trajectory; state-action transition; state-cluster; value function iteration; Accelerated aging; Acceleration; Autonomous agents; Convergence; Costs; Information science; Intelligent agent; Learning; Space exploration; State-space methods;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Natural Computation, 2009. ICNC '09. Fifth International Conference on

Conference_Location :

Tianjin

Print_ISBN :

978-0-7695-3736-8

Type :

conf

DOI :

10.1109/ICNC.2009.405

Filename :

5362836

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2808103