Title :
A State-Cluster Based Q-Learning
Author :
Jin, Zhao ; Liu, Weiyi ; Jin, Jian
Author_Institution :
Sch. of Inf. Sci. & Eng., Yunnan Univ., Kunming, China
Abstract :
When apply Q-learning to complex real-world problems, the learning process is long enough to make this method unpractical. The major cause is Q-learning requires the agent to visit every state-action transition infinitely often for making Q value convergent. We propose a state-cluster based Q-learning method to accelerate convergence and shorten learning process. This method creates the state-cluster for each state the agent reached according to the state trajectory that the agent wandered. By our algorithm, the state-cluster of a state would hold these acyclic shortest state paths from other states to this state. When a state´s Q value is refined in one step of the agent, the refined Q value can be propagated immediately back to all these states in its State-Cluster along the state paths between them, instead of requiring the agent to visit these states again. With the State-Cluster, more Q value can be refined in one step of the agent, which speeds up the convergence of Q value. The experiments compared with Q-learning demonstrate this method is extraordinarily more effective. This method is aimed Q-learning, but it is also applicable for most other reinforcement learning methods based value function iteration.
Keywords :
learning (artificial intelligence); software agents; Q value convergent; Q-learning; reinforcement learning; state trajectory; state-action transition; state-cluster; value function iteration; Accelerated aging; Acceleration; Autonomous agents; Convergence; Costs; Information science; Intelligent agent; Learning; Space exploration; State-space methods;
Conference_Titel :
Natural Computation, 2009. ICNC '09. Fifth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3736-8
DOI :
10.1109/ICNC.2009.405