Instance-Based Policy Search using Binomial Distribution Crossover and Iterated Refreshment

Author

Tsuchiya, Chikara ; Ikeda, Ken-ichi ; Jun Sakuma ; Ono, Isao ; Kobayashi, S.

Author_Institution

Tokyo Institute of Technology, Yokohama-city, Japan, Email: tsuchiya@fe.dis.titech.ac.jp

fYear

0

fDate

0-0 0

Firstpage

378

Lastpage

385

Abstract

This paper describes a GA based lazy approach toward reinforcement learning. This approach employs data-driven policy, which is composed of an instance set and an instance-based action selector. This feature provides a number of advantages. However some difficulties remain uninvestigated. One of them is the huge and complicated search space. We have an idea that preserving characteristics of the GA population and introducing new characteristics can overcome these difficulties. On the basis of this idea, we propose two genetic operators; Binomial Distribution Crossover (BDX) and iterated refreshment. The BDX generates the descendants inheriting the parents´ characteristics and the iterated refreshment introduces new characteristics greedily. The GA powered by these operators was applied to the benchmark tasks to demonstrate the ability. Each operator also was investigated and discussed from the various perspectives. Finally, we provide the preferable parameter settings for our method.

Keywords

binomial distribution; genetic algorithms; learning (artificial intelligence); binomial distribution crossover; data-driven policy; genetic operators; instance set; instance-based action selector; instance-based policy search; iterated refreshment; lazy approach; reinforcement learning; Character generation; Computational complexity; Convergence; Costs; Data structures; Genetics; Learning; Pareto optimization; Robustness;

fLanguage

English

Publisher

ieee

Conference_Titel

Evolutionary Computation, 2006. CEC 2006. IEEE Congress on

Conference_Location

Vancouver, BC

Print_ISBN

0-7803-9487-9

Type

conf

DOI

10.1109/CEC.2006.1688333

Filename

1688333