DocumentCode :
3500312
Title :
A neural architecture to address Reinforcement Learning problems
Author :
De Arruda, Rodrigo L S ; Zuben, Fernando J Von
Author_Institution :
Dept. of Comput. Eng. & Ind. Autom., Univ. of Campinas (Unicamp), Campinas, Brazil
fYear :
2011
fDate :
July 31 2011-Aug. 5 2011
Firstpage :
2930
Lastpage :
2935
Abstract :
In this paper, the Reinforcement Learning problem is formulated equivalently to a Markov Decision Process. We address the solution of such problem using a novel Adaptive Dynamic Programming algorithm which is based on a Multilayer Perceptron Neural Network composed of a parameterized function approximator called Wire-Fitting. Extending such established model, this work makes use of concepts of eligibility to conceive faster learning algorithms. The advantage of the proposed approach is founded on the capability to handle continuous environments and to learn a better policy while following another. Simulation results involving the automatic control of an inverted pendulum are presented to indicate the effectiveness of the proposed algorithm.
Keywords :
Markov processes; dynamic programming; learning (artificial intelligence); multilayer perceptrons; neural net architecture; Markov decision process; adaptive dynamic programming; automatic control; inverted pendulum; learning algorithm; multilayer perceptron neural network; neural architecture; parameterized function approximator; reinforcement learning; wire-fitting; Approximation methods; Dynamic programming; Equations; Heuristic algorithms; Markov processes; Mathematical model; Monte Carlo methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), The 2011 International Joint Conference on
Conference_Location :
San Jose, CA
ISSN :
2161-4393
Print_ISBN :
978-1-4244-9635-8
Type :
conf
DOI :
10.1109/IJCNN.2011.6033606
Filename :
6033606
Link To Document :
بازگشت