• DocumentCode
    2808103
  • Title

    A State-Cluster Based Q-Learning

  • Author

    Jin, Zhao ; Liu, Weiyi ; Jin, Jian

  • Author_Institution
    Sch. of Inf. Sci. & Eng., Yunnan Univ., Kunming, China
  • Volume
    1
  • fYear
    2009
  • fDate
    14-16 Aug. 2009
  • Firstpage
    44
  • Lastpage
    49
  • Abstract
    When apply Q-learning to complex real-world problems, the learning process is long enough to make this method unpractical. The major cause is Q-learning requires the agent to visit every state-action transition infinitely often for making Q value convergent. We propose a state-cluster based Q-learning method to accelerate convergence and shorten learning process. This method creates the state-cluster for each state the agent reached according to the state trajectory that the agent wandered. By our algorithm, the state-cluster of a state would hold these acyclic shortest state paths from other states to this state. When a state´s Q value is refined in one step of the agent, the refined Q value can be propagated immediately back to all these states in its State-Cluster along the state paths between them, instead of requiring the agent to visit these states again. With the State-Cluster, more Q value can be refined in one step of the agent, which speeds up the convergence of Q value. The experiments compared with Q-learning demonstrate this method is extraordinarily more effective. This method is aimed Q-learning, but it is also applicable for most other reinforcement learning methods based value function iteration.
  • Keywords
    learning (artificial intelligence); software agents; Q value convergent; Q-learning; reinforcement learning; state trajectory; state-action transition; state-cluster; value function iteration; Accelerated aging; Acceleration; Autonomous agents; Convergence; Costs; Information science; Intelligent agent; Learning; Space exploration; State-space methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation, 2009. ICNC '09. Fifth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3736-8
  • Type

    conf

  • DOI
    10.1109/ICNC.2009.405
  • Filename
    5362836