• DocumentCode
    2964233
  • Title

    Using the Q-learning algorithm in the constructive phase of the GRASP and reactive GRASP metaheuristics

  • Author

    de Lima, Francisco Chagas, Jr. ; De Melo, Jorge Dantas ; Neto, Adrião Duarte Doria

  • Author_Institution
    Dept. of Comput., State Univ. of Rio Grande do Norte, Rio Grande
  • fYear
    2008
  • fDate
    1-8 June 2008
  • Firstpage
    4169
  • Lastpage
    4176
  • Abstract
    Currently many non-tractable considered problems have been solved satisfactorily through methods of approximate optimization called metaheuristic. These methods use non-deterministic approaches that find good solutions which, however, do not guarantee the determination of the global optimum. The success of a metaheuristic is conditioned by capacity to adequately alternate between exploration and exploitation of the solution space. A way to guide such algorithms while searching for better solutions is supplying them with more knowledge of the solution space (environment of the problem). This can to be made in terms of a mapping of such environment in states and actions using reinforcement learning. This paper proposes the use of a technique of reinforcement learning - Q-learning algorithm - for the constructive phase of GRASP and reactive GRASP metaheuristic. The proposed methods will be applied to the symmetrical traveling salesman problem.
  • Keywords
    greedy algorithms; iterative methods; learning (artificial intelligence); optimisation; randomised algorithms; search problems; Q-learning algorithm; approximate optimization; greedy randomized adaptive search procedure; reactive GRASP metaheuristics; reinforcement learning; traveling salesman problem; Circuit testing; Cost function; Learning; Minimization methods; Optimization methods; Robustness; Statistical analysis; Traveling salesman problems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1098-7576
  • Print_ISBN
    978-1-4244-1820-6
  • Electronic_ISBN
    1098-7576
  • Type

    conf

  • DOI
    10.1109/IJCNN.2008.4634399
  • Filename
    4634399