DocumentCode
2632348
Title
Evolutionary approach of reward function for reinforcement learning using genetic programming
Author
Sumino, Shota ; Mutoh, Atsuko ; Kato, Shohei
Author_Institution
Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol., Nagoya, Japan
fYear
2011
fDate
6-9 Nov. 2011
Firstpage
385
Lastpage
390
Abstract
In recent year, reinforcement learning, which acquires a behavior of robots has been drawing attention. A suitable behavior is autonomously acquired by using this system. Robots learn the suitable behavior by iterating action and receiving the evaluated value of that action. The evaluated value is calculated by reward function. In general reinforcement learning, we acquire a suitable behavior by setting the suitable reward function for each problem. However in previous research of reinforcement learning, most reward functions are constructed based on human´s heuristics. To construct reward functions, trial-and-error is needed, and it imposes an enormous drain on humans. Therefore we propose an approach, which automatically generate reward functions, using Genetic Programming. In this approach, we create a method evaluating reward functions. Reward functions are generated by Genetic Programming, and are evaluated by evaluating method. A suitable reward function is generated by evolution of these reward functions. In this paper, we conducted an experiment to confirm the effectiveness of proposed method. In the experiment, we generate a suitable reward function of a problem, which a route searching problem in a tile-world. Through the experiment, we confirm that the proposed approach can generate a suitable reward function, and the generated reward function can acquire a more suitable behavior in comparison with a reward function by constructed based on human´s heuristics.
Keywords
genetic algorithms; learning systems; mobile robots; search problems; evolutionary approach; general reinforcement learning; genetic programming; human heuristics; reward function; robot behavior; route searching problem; trial-and-error; Genetics; Robots;
fLanguage
English
Publisher
ieee
Conference_Titel
Micro-NanoMechatronics and Human Science (MHS), 2011 International Symposium on
Conference_Location
Nagoya
ISSN
Pending
Print_ISBN
978-1-4577-1360-6
Type
conf
DOI
10.1109/MHS.2011.6102214
Filename
6102214
Link To Document