• DocumentCode
    2632348
  • Title

    Evolutionary approach of reward function for reinforcement learning using genetic programming

  • Author

    Sumino, Shota ; Mutoh, Atsuko ; Kato, Shohei

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol., Nagoya, Japan
  • fYear
    2011
  • fDate
    6-9 Nov. 2011
  • Firstpage
    385
  • Lastpage
    390
  • Abstract
    In recent year, reinforcement learning, which acquires a behavior of robots has been drawing attention. A suitable behavior is autonomously acquired by using this system. Robots learn the suitable behavior by iterating action and receiving the evaluated value of that action. The evaluated value is calculated by reward function. In general reinforcement learning, we acquire a suitable behavior by setting the suitable reward function for each problem. However in previous research of reinforcement learning, most reward functions are constructed based on human´s heuristics. To construct reward functions, trial-and-error is needed, and it imposes an enormous drain on humans. Therefore we propose an approach, which automatically generate reward functions, using Genetic Programming. In this approach, we create a method evaluating reward functions. Reward functions are generated by Genetic Programming, and are evaluated by evaluating method. A suitable reward function is generated by evolution of these reward functions. In this paper, we conducted an experiment to confirm the effectiveness of proposed method. In the experiment, we generate a suitable reward function of a problem, which a route searching problem in a tile-world. Through the experiment, we confirm that the proposed approach can generate a suitable reward function, and the generated reward function can acquire a more suitable behavior in comparison with a reward function by constructed based on human´s heuristics.
  • Keywords
    genetic algorithms; learning systems; mobile robots; search problems; evolutionary approach; general reinforcement learning; genetic programming; human heuristics; reward function; robot behavior; route searching problem; trial-and-error; Genetics; Robots;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Micro-NanoMechatronics and Human Science (MHS), 2011 International Symposium on
  • Conference_Location
    Nagoya
  • ISSN
    Pending
  • Print_ISBN
    978-1-4577-1360-6
  • Type

    conf

  • DOI
    10.1109/MHS.2011.6102214
  • Filename
    6102214