• DocumentCode
    2460299
  • Title

    Instance-Based Policy Search using Binomial Distribution Crossover and Iterated Refreshment

  • Author

    Tsuchiya, Chikara ; Ikeda, Ken-ichi ; Jun Sakuma ; Ono, Isao ; Kobayashi, S.

  • Author_Institution
    Tokyo Institute of Technology, Yokohama-city, Japan, Email: tsuchiya@fe.dis.titech.ac.jp
  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    378
  • Lastpage
    385
  • Abstract
    This paper describes a GA based lazy approach toward reinforcement learning. This approach employs data-driven policy, which is composed of an instance set and an instance-based action selector. This feature provides a number of advantages. However some difficulties remain uninvestigated. One of them is the huge and complicated search space. We have an idea that preserving characteristics of the GA population and introducing new characteristics can overcome these difficulties. On the basis of this idea, we propose two genetic operators; Binomial Distribution Crossover (BDX) and iterated refreshment. The BDX generates the descendants inheriting the parents´ characteristics and the iterated refreshment introduces new characteristics greedily. The GA powered by these operators was applied to the benchmark tasks to demonstrate the ability. Each operator also was investigated and discussed from the various perspectives. Finally, we provide the preferable parameter settings for our method.
  • Keywords
    binomial distribution; genetic algorithms; learning (artificial intelligence); binomial distribution crossover; data-driven policy; genetic operators; instance set; instance-based action selector; instance-based policy search; iterated refreshment; lazy approach; reinforcement learning; Character generation; Computational complexity; Convergence; Costs; Data structures; Genetics; Learning; Pareto optimization; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation, 2006. CEC 2006. IEEE Congress on
  • Conference_Location
    Vancouver, BC
  • Print_ISBN
    0-7803-9487-9
  • Type

    conf

  • DOI
    10.1109/CEC.2006.1688333
  • Filename
    1688333