Title :
Genetic Programming for Reward Function Search
Author :
Niekum, Scott ; Barto, Andrew G. ; Spector, Lee
Author_Institution :
Dept. of Comput. Sci., Univ. of Massachusetts, Amherst, MA, USA
fDate :
6/1/2010 12:00:00 AM
Abstract :
Reward functions in reinforcement learning have largely been assumed given as part of the problem being solved by the agent. However, the psychological notion of intrinsic motivation has recently inspired inquiry into whether there exist alternate reward functions that enable an agent to learn a task more easily than the natural task-based reward function allows. This paper presents a genetic programming algorithm to search for alternate reward functions that improve agent learning performance. We present experiments that show the superiority of these reward functions, demonstrate the possible scalability of our method, and define three classes of problems where reward function search might be particularly useful: distributions of environments, nonstationary environments, and problems with short agent lifetimes.
Keywords :
genetic algorithms; learning (artificial intelligence); agent learning performance; genetic programming algorithm; intrinsic motivation; nonstationary environment; psychological notion; reinforcement learning; task based reward function; Genetic programming; intrinsic motivation; reinforcement learning;
Journal_Title :
Autonomous Mental Development, IEEE Transactions on
DOI :
10.1109/TAMD.2010.2051436