• DocumentCode
    2022404
  • Title

    An Optimized Q-Learning Algorithm Based on the Thinking of Tabu Search

  • Author

    Zhang, Xiaogang ; Liu, Zhijing

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Xidian Univ., Xian
  • Volume
    1
  • fYear
    2008
  • fDate
    17-18 Oct. 2008
  • Firstpage
    533
  • Lastpage
    536
  • Abstract
    One core issue in reinforcement learning is the balance between exploration and exploitation. Pure exploitation makes the agent reach the partial optimal solution quickly. Exploration avoids the partial optimal solution but too much exploration will reduce the performance of the Q -learning algorithm. How to avoid the partial optimal solution and find the global optimum solution is one of key goals of action selection in Q-learning. In this paper, the thinking of tabu search algorithm is introduced in order to balance exploration and exploitation of Q-learning. The optimized algorithms called T-Q-learning is proved to have a faster convergence rate and avoid the partial optimal solution in the experiments.
  • Keywords
    convergence; learning (artificial intelligence); search problems; convergence rate; optimized Q-learning algorithm; partial optimal solution; reinforcement learning; tabu search; Accelerated aging; Algorithm design and analysis; Approximation algorithms; Computational intelligence; Computer science; Convergence; Design optimization; Learning systems; Q-learning; reinforcement learning; tabu search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Design, 2008. ISCID '08. International Symposium on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-0-7695-3311-7
  • Type

    conf

  • DOI
    10.1109/ISCID.2008.179
  • Filename
    4725666