DocumentCode
3500312
Title
A neural architecture to address Reinforcement Learning problems
Author
De Arruda, Rodrigo L S ; Zuben, Fernando J Von
Author_Institution
Dept. of Comput. Eng. & Ind. Autom., Univ. of Campinas (Unicamp), Campinas, Brazil
fYear
2011
fDate
July 31 2011-Aug. 5 2011
Firstpage
2930
Lastpage
2935
Abstract
In this paper, the Reinforcement Learning problem is formulated equivalently to a Markov Decision Process. We address the solution of such problem using a novel Adaptive Dynamic Programming algorithm which is based on a Multilayer Perceptron Neural Network composed of a parameterized function approximator called Wire-Fitting. Extending such established model, this work makes use of concepts of eligibility to conceive faster learning algorithms. The advantage of the proposed approach is founded on the capability to handle continuous environments and to learn a better policy while following another. Simulation results involving the automatic control of an inverted pendulum are presented to indicate the effectiveness of the proposed algorithm.
Keywords
Markov processes; dynamic programming; learning (artificial intelligence); multilayer perceptrons; neural net architecture; Markov decision process; adaptive dynamic programming; automatic control; inverted pendulum; learning algorithm; multilayer perceptron neural network; neural architecture; parameterized function approximator; reinforcement learning; wire-fitting; Approximation methods; Dynamic programming; Equations; Heuristic algorithms; Markov processes; Mathematical model; Monte Carlo methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), The 2011 International Joint Conference on
Conference_Location
San Jose, CA
ISSN
2161-4393
Print_ISBN
978-1-4244-9635-8
Type
conf
DOI
10.1109/IJCNN.2011.6033606
Filename
6033606
Link To Document