• DocumentCode
    2498452
  • Title

    Adaptive dynamic programming with balanced weights seeking strategy

  • Author

    Fu, Jian ; He, Haibo ; Ni, Zhen

  • Author_Institution
    Sch. of Autom., Wuhan Univ. of Technol., Wuhan, China
  • fYear
    2011
  • fDate
    11-15 April 2011
  • Firstpage
    210
  • Lastpage
    217
  • Abstract
    In this paper we propose to integrate the recursive Levenberg-Marquardt method into the adaptive dynamic programming (ADP) design for improved learning and adaptive control performance. Our key motivation is to consider a balanced weight updating strategy with the consideration of both robustness and convergence during the online learning process. Specifically, a modified recursive Levenberg-Marquardt (LM) method is integrated into both the action network and critic network of the ADP design, and a detailed learning algorithm is proposed to implement this approach. We test the performance of our approach based on the triple link inverted pendulum, a popular benchmark in the community, to demonstrate online learning and control strategy. Experimental results and comparative study under different noise conditions demonstrate the effectiveness of this approach.
  • Keywords
    adaptive control; dynamic programming; learning systems; nonlinear control systems; ADP design; action network; adaptive control performance; adaptive dynamic programming design; balanced weights seeking strategy; critic network; learning control performance; recursive Levenberg-Marquardt method; triple link inverted pendulum; Algorithm design and analysis; Artificial neural networks; Convergence; Damping; Equations; Jacobian matrices; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on
  • Conference_Location
    Paris
  • Print_ISBN
    978-1-4244-9887-1
  • Type

    conf

  • DOI
    10.1109/ADPRL.2011.5967373
  • Filename
    5967373