Title :
Adaptive dynamic programming with balanced weights seeking strategy
Author :
Fu, Jian ; He, Haibo ; Ni, Zhen
Author_Institution :
Sch. of Autom., Wuhan Univ. of Technol., Wuhan, China
Abstract :
In this paper we propose to integrate the recursive Levenberg-Marquardt method into the adaptive dynamic programming (ADP) design for improved learning and adaptive control performance. Our key motivation is to consider a balanced weight updating strategy with the consideration of both robustness and convergence during the online learning process. Specifically, a modified recursive Levenberg-Marquardt (LM) method is integrated into both the action network and critic network of the ADP design, and a detailed learning algorithm is proposed to implement this approach. We test the performance of our approach based on the triple link inverted pendulum, a popular benchmark in the community, to demonstrate online learning and control strategy. Experimental results and comparative study under different noise conditions demonstrate the effectiveness of this approach.
Keywords :
adaptive control; dynamic programming; learning systems; nonlinear control systems; ADP design; action network; adaptive control performance; adaptive dynamic programming design; balanced weights seeking strategy; critic network; learning control performance; recursive Levenberg-Marquardt method; triple link inverted pendulum; Algorithm design and analysis; Artificial neural networks; Convergence; Damping; Equations; Jacobian matrices; Robustness;
Conference_Titel :
Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-9887-1
DOI :
10.1109/ADPRL.2011.5967373