Evolutionary approach of reward function for reinforcement learning using genetic programming

Author

Sumino, Shota ; Mutoh, Atsuko ; Kato, Shohei

Author_Institution

Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol., Nagoya, Japan

fYear

2011

fDate

6-9 Nov. 2011

Firstpage

385

Lastpage

390

Abstract

In recent year, reinforcement learning, which acquires a behavior of robots has been drawing attention. A suitable behavior is autonomously acquired by using this system. Robots learn the suitable behavior by iterating action and receiving the evaluated value of that action. The evaluated value is calculated by reward function. In general reinforcement learning, we acquire a suitable behavior by setting the suitable reward function for each problem. However in previous research of reinforcement learning, most reward functions are constructed based on human´s heuristics. To construct reward functions, trial-and-error is needed, and it imposes an enormous drain on humans. Therefore we propose an approach, which automatically generate reward functions, using Genetic Programming. In this approach, we create a method evaluating reward functions. Reward functions are generated by Genetic Programming, and are evaluated by evaluating method. A suitable reward function is generated by evolution of these reward functions. In this paper, we conducted an experiment to confirm the effectiveness of proposed method. In the experiment, we generate a suitable reward function of a problem, which a route searching problem in a tile-world. Through the experiment, we confirm that the proposed approach can generate a suitable reward function, and the generated reward function can acquire a more suitable behavior in comparison with a reward function by constructed based on human´s heuristics.

Keywords

genetic algorithms; learning systems; mobile robots; search problems; evolutionary approach; general reinforcement learning; genetic programming; human heuristics; reward function; robot behavior; route searching problem; trial-and-error; Genetics; Robots;

fLanguage

English

Publisher

ieee

Conference_Titel

Micro-NanoMechatronics and Human Science (MHS), 2011 International Symposium on

Conference_Location

Nagoya

ISSN

Pending

Print_ISBN

978-1-4577-1360-6

Type

conf

DOI

10.1109/MHS.2011.6102214

Filename

6102214