Using the Q-learning algorithm in the constructive phase of the GRASP and reactive GRASP metaheuristics

Author

de Lima, Francisco Chagas, Jr. ; De Melo, Jorge Dantas ; Neto, Adrião Duarte Doria

Author_Institution

Dept. of Comput., State Univ. of Rio Grande do Norte, Rio Grande

fYear

2008

fDate

1-8 June 2008

Firstpage

4169

Lastpage

4176

Abstract

Currently many non-tractable considered problems have been solved satisfactorily through methods of approximate optimization called metaheuristic. These methods use non-deterministic approaches that find good solutions which, however, do not guarantee the determination of the global optimum. The success of a metaheuristic is conditioned by capacity to adequately alternate between exploration and exploitation of the solution space. A way to guide such algorithms while searching for better solutions is supplying them with more knowledge of the solution space (environment of the problem). This can to be made in terms of a mapping of such environment in states and actions using reinforcement learning. This paper proposes the use of a technique of reinforcement learning - Q-learning algorithm - for the constructive phase of GRASP and reactive GRASP metaheuristic. The proposed methods will be applied to the symmetrical traveling salesman problem.

Keywords

greedy algorithms; iterative methods; learning (artificial intelligence); optimisation; randomised algorithms; search problems; Q-learning algorithm; approximate optimization; greedy randomized adaptive search procedure; reactive GRASP metaheuristics; reinforcement learning; traveling salesman problem; Circuit testing; Cost function; Learning; Minimization methods; Optimization methods; Robustness; Statistical analysis; Traveling salesman problems;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on

Conference_Location

Hong Kong

ISSN

1098-7576

Print_ISBN

978-1-4244-1820-6

Electronic_ISBN

1098-7576

Type

conf

DOI

10.1109/IJCNN.2008.4634399

Filename

4634399