Title :
Using the Q-learning algorithm in the constructive phase of the GRASP and reactive GRASP metaheuristics
Author :
de Lima, Francisco Chagas, Jr. ; De Melo, Jorge Dantas ; Neto, Adrião Duarte Doria
Author_Institution :
Dept. of Comput., State Univ. of Rio Grande do Norte, Rio Grande
Abstract :
Currently many non-tractable considered problems have been solved satisfactorily through methods of approximate optimization called metaheuristic. These methods use non-deterministic approaches that find good solutions which, however, do not guarantee the determination of the global optimum. The success of a metaheuristic is conditioned by capacity to adequately alternate between exploration and exploitation of the solution space. A way to guide such algorithms while searching for better solutions is supplying them with more knowledge of the solution space (environment of the problem). This can to be made in terms of a mapping of such environment in states and actions using reinforcement learning. This paper proposes the use of a technique of reinforcement learning - Q-learning algorithm - for the constructive phase of GRASP and reactive GRASP metaheuristic. The proposed methods will be applied to the symmetrical traveling salesman problem.
Keywords :
greedy algorithms; iterative methods; learning (artificial intelligence); optimisation; randomised algorithms; search problems; Q-learning algorithm; approximate optimization; greedy randomized adaptive search procedure; reactive GRASP metaheuristics; reinforcement learning; traveling salesman problem; Circuit testing; Cost function; Learning; Minimization methods; Optimization methods; Robustness; Statistical analysis; Traveling salesman problems;
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2008.4634399