DocumentCode :
2526082
Title :
Realizing Undelayed N-step TD prediction with neural networks
Author :
Zuters, Janis
Author_Institution :
Fac. of Comput., Univ. of Latvia, Riga, Latvia
fYear :
2010
fDate :
26-28 April 2010
Firstpage :
102
Lastpage :
106
Abstract :
There exist various techniques to extend reinforcement learning algorithms, e.g., eligibility traces and planning. In this paper, an approach is proposed, which combines several extension techniques, such as using eligibility-like traces, using approximators as value functions and exploiting the model of the environment. The obtained method, `Undelayed n-step TD prediction´ (TD-P), has produced competitive results when put in conditions of not fully observable environment.
Keywords :
learning (artificial intelligence); neural nets; eligibility planning; eligibility traces; neural networks; realizing undelayed N-step TD prediction; reinforcement learning algorithms; Computer networks; Delay; Dynamic programming; Machine learning; Multi-layer neural network; Multilayer perceptrons; Neural networks; Predictive models;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
MELECON 2010 - 2010 15th IEEE Mediterranean Electrotechnical Conference
Conference_Location :
Valletta
Print_ISBN :
978-1-4244-5793-9
Type :
conf
DOI :
10.1109/MELCON.2010.5476332
Filename :
5476332
Link To Document :
بازگشت