• DocumentCode
    3500312
  • Title

    A neural architecture to address Reinforcement Learning problems

  • Author

    De Arruda, Rodrigo L S ; Zuben, Fernando J Von

  • Author_Institution
    Dept. of Comput. Eng. & Ind. Autom., Univ. of Campinas (Unicamp), Campinas, Brazil
  • fYear
    2011
  • fDate
    July 31 2011-Aug. 5 2011
  • Firstpage
    2930
  • Lastpage
    2935
  • Abstract
    In this paper, the Reinforcement Learning problem is formulated equivalently to a Markov Decision Process. We address the solution of such problem using a novel Adaptive Dynamic Programming algorithm which is based on a Multilayer Perceptron Neural Network composed of a parameterized function approximator called Wire-Fitting. Extending such established model, this work makes use of concepts of eligibility to conceive faster learning algorithms. The advantage of the proposed approach is founded on the capability to handle continuous environments and to learn a better policy while following another. Simulation results involving the automatic control of an inverted pendulum are presented to indicate the effectiveness of the proposed algorithm.
  • Keywords
    Markov processes; dynamic programming; learning (artificial intelligence); multilayer perceptrons; neural net architecture; Markov decision process; adaptive dynamic programming; automatic control; inverted pendulum; learning algorithm; multilayer perceptron neural network; neural architecture; parameterized function approximator; reinforcement learning; wire-fitting; Approximation methods; Dynamic programming; Equations; Heuristic algorithms; Markov processes; Mathematical model; Monte Carlo methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2011 International Joint Conference on
  • Conference_Location
    San Jose, CA
  • ISSN
    2161-4393
  • Print_ISBN
    978-1-4244-9635-8
  • Type

    conf

  • DOI
    10.1109/IJCNN.2011.6033606
  • Filename
    6033606