DocumentCode :
2842774
Title :
Evolutionary Computation on Multitask Reinforcement Learning Problems
Author :
Handa, Hisashi
Author_Institution :
Okayama Univ., Okayama
fYear :
2007
fDate :
15-17 April 2007
Firstpage :
685
Lastpage :
688
Abstract :
Recently, Multitask learning, which can cope with several tasks, has attracted much attention. Multitask Reinforcement Learning introduced by Tanaka et al is a problem class where number of problem instances of Markov Decision Processes sampled from the same probability distributions is sequentially given to reinforcement learning agents. The purpose of solving this problem is to realize adaptive agents for newly given environments by using knowledge acquired from past experience. Evolutionary Algorithms are often used to solve reinforcement learning problems if problem classes are quite different with Markov Decision Processes or state-action space is quite huge. From the viewpoint of Evolutionary Algorithms studies, the Multitask Reinforcement Learning problems are regarded as dynamic problems whose fitness landscape has changed temporally. In this paper, a memory-based Evolutionary Programming which is suitable for Multitask Reinforcement Learning problems is proposed.
Keywords :
Markov processes; evolutionary computation; learning (artificial intelligence); Markov decision processes; evolutionary algorithms; evolutionary computation; memory-based evolutionary programming; multitask reinforcement learning; probability distributions; state-action space; Computational modeling; Evolutionary computation; Gaussian distribution; Genetic mutations; Genetic programming; Learning systems; Noise robustness; Probability distribution; Random variables; Working environment noise; Dynamic Environments; Evolutionary Algorithms; Multitask Reinforcement Learning Problems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Networking, Sensing and Control, 2007 IEEE International Conference on
Conference_Location :
London
Print_ISBN :
1-4244-1076-2
Electronic_ISBN :
1-4244-1076-2
Type :
conf
DOI :
10.1109/ICNSC.2007.372862
Filename :
4239075
Link To Document :
بازگشت