مرکز منطقه ای اطلاع رساني علوم و فناوري - A neural architecture to address Reinforcement Learning problems

DocumentCode :

3500312

Title :

A neural architecture to address Reinforcement Learning problems

Author :

De Arruda, Rodrigo L S ; Zuben, Fernando J Von

Author_Institution :

Dept. of Comput. Eng. & Ind. Autom., Univ. of Campinas (Unicamp), Campinas, Brazil

fYear :

2011

fDate :

July 31 2011-Aug. 5 2011

Firstpage :

2930

Lastpage :

2935

Abstract :

In this paper, the Reinforcement Learning problem is formulated equivalently to a Markov Decision Process. We address the solution of such problem using a novel Adaptive Dynamic Programming algorithm which is based on a Multilayer Perceptron Neural Network composed of a parameterized function approximator called Wire-Fitting. Extending such established model, this work makes use of concepts of eligibility to conceive faster learning algorithms. The advantage of the proposed approach is founded on the capability to handle continuous environments and to learn a better policy while following another. Simulation results involving the automatic control of an inverted pendulum are presented to indicate the effectiveness of the proposed algorithm.

Keywords :

Markov processes; dynamic programming; learning (artificial intelligence); multilayer perceptrons; neural net architecture; Markov decision process; adaptive dynamic programming; automatic control; inverted pendulum; learning algorithm; multilayer perceptron neural network; neural architecture; parameterized function approximator; reinforcement learning; wire-fitting; Approximation methods; Dynamic programming; Equations; Heuristic algorithms; Markov processes; Mathematical model; Monte Carlo methods;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), The 2011 International Joint Conference on

Conference_Location :

San Jose, CA

ISSN :

2161-4393

Print_ISBN :

978-1-4244-9635-8

Type :

conf

DOI :

10.1109/IJCNN.2011.6033606

Filename :

6033606

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3500312