DocumentCode
2730099
Title
A Population Based Rewarding for Reinforcement Learning to Control Genetic Algorithms
Author
Sakurai, Yasushi ; Tsuruta, Setsuo
Author_Institution
Sch. of Inf. Environ., Tokyo Denki Univ., Chiba, Japan
fYear
2012
fDate
25-29 Nov. 2012
Firstpage
686
Lastpage
691
Abstract
The effectiveness of Genetic Algorithms (GA) heavily depends on the appropriate setting of its parameters. Moreover, optimal values for these parameters depend on both the type of GA and the application problem pattern and must be developed for each particular setting one by one. Therefore it requires special expertise and many experiments to validate the parameter setting. In order to solve this problem, a new method called "adaptive parameter control" was proposed, which adaptively controls parameters of an evolutionary algorithm. However, since this method just increases the selection probability of a search operator that generated a well evaluated individual, this is apt to be a shortsighted optimization method. On the contrary, a method is proposed to realize longsighted optimal parameter control of GA using Reinforcement Learning (RL). However, this method does neither consider the calculation cost of search operators nor population search characteristics of GA. Here, we propose a refined RL method for parameter control, in which (1) the reward decision rules are elaborately incorporated under the consideration of GA\´s population search characteristics and (2) the calculation cost of the search operator is taken into account. It is expected that this method can efficiently learn parameters to optimally select search operators of GA for approximately solving Traveling Salesman Problems (TSPs).
Keywords
adaptive control; genetic algorithms; learning systems; probability; search problems; travelling salesman problems; GA population search characteristics; TSP; adaptive parameter control; application problem pattern; evolutionary algorithm; genetic algorithms; longsighted optimal parameter control; population based rewarding; refined RL method; reinforcement learning; search operator; selection probability; shortsighted optimization method; traveling salesman problems; Equations; Genetic algorithms; Genetics; Learning; Mathematical model; Sociology; Statistics; Genetic Algorithm (GA); Parameter Control; Reinforcement Learning; Traveling Salesman Problems (TSP);
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Image Technology and Internet Based Systems (SITIS), 2012 Eighth International Conference on
Conference_Location
Naples
Print_ISBN
978-1-4673-5152-2
Type
conf
DOI
10.1109/SITIS.2012.104
Filename
6395158
Link To Document