DocumentCode
2460299
Title
Instance-Based Policy Search using Binomial Distribution Crossover and Iterated Refreshment
Author
Tsuchiya, Chikara ; Ikeda, Ken-ichi ; Jun Sakuma ; Ono, Isao ; Kobayashi, S.
Author_Institution
Tokyo Institute of Technology, Yokohama-city, Japan, Email: tsuchiya@fe.dis.titech.ac.jp
fYear
0
fDate
0-0 0
Firstpage
378
Lastpage
385
Abstract
This paper describes a GA based lazy approach toward reinforcement learning. This approach employs data-driven policy, which is composed of an instance set and an instance-based action selector. This feature provides a number of advantages. However some difficulties remain uninvestigated. One of them is the huge and complicated search space. We have an idea that preserving characteristics of the GA population and introducing new characteristics can overcome these difficulties. On the basis of this idea, we propose two genetic operators; Binomial Distribution Crossover (BDX) and iterated refreshment. The BDX generates the descendants inheriting the parents´ characteristics and the iterated refreshment introduces new characteristics greedily. The GA powered by these operators was applied to the benchmark tasks to demonstrate the ability. Each operator also was investigated and discussed from the various perspectives. Finally, we provide the preferable parameter settings for our method.
Keywords
binomial distribution; genetic algorithms; learning (artificial intelligence); binomial distribution crossover; data-driven policy; genetic operators; instance set; instance-based action selector; instance-based policy search; iterated refreshment; lazy approach; reinforcement learning; Character generation; Computational complexity; Convergence; Costs; Data structures; Genetics; Learning; Pareto optimization; Robustness;
fLanguage
English
Publisher
ieee
Conference_Titel
Evolutionary Computation, 2006. CEC 2006. IEEE Congress on
Conference_Location
Vancouver, BC
Print_ISBN
0-7803-9487-9
Type
conf
DOI
10.1109/CEC.2006.1688333
Filename
1688333
Link To Document