مرکز منطقه ای اطلاع رساني علوم و فناوري - Genetic Programming for Reward Function Search

DocumentCode :

1503968

Title :

Genetic Programming for Reward Function Search

Author :

Niekum, Scott ; Barto, Andrew G. ; Spector, Lee

Author_Institution :

Dept. of Comput. Sci., Univ. of Massachusetts, Amherst, MA, USA

Volume :

Issue :

fYear :

2010

fDate :

6/1/2010 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

Reward functions in reinforcement learning have largely been assumed given as part of the problem being solved by the agent. However, the psychological notion of intrinsic motivation has recently inspired inquiry into whether there exist alternate reward functions that enable an agent to learn a task more easily than the natural task-based reward function allows. This paper presents a genetic programming algorithm to search for alternate reward functions that improve agent learning performance. We present experiments that show the superiority of these reward functions, demonstrate the possible scalability of our method, and define three classes of problems where reward function search might be particularly useful: distributions of environments, nonstationary environments, and problems with short agent lifetimes.

Keywords :

genetic algorithms; learning (artificial intelligence); agent learning performance; genetic programming algorithm; intrinsic motivation; nonstationary environment; psychological notion; reinforcement learning; task based reward function; Genetic programming; intrinsic motivation; reinforcement learning;

fLanguage :

English

Journal_Title :

Autonomous Mental Development, IEEE Transactions on

Publisher :

ieee

ISSN :

1943-0604

Type :

jour

DOI :

10.1109/TAMD.2010.2051436

Filename :

5473118

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1503968