DocumentCode :
583216
Title :
Reinforecement learning-based optimal tracking control for wheeled mobile robot
Author :
Luy, Nguyen Tan
Author_Institution :
Div. of Autom. Electron., Ho Chi Minh Univ. of Ind., Ho Chi Minh City, Vietnam
fYear :
2012
fDate :
27-31 May 2012
Firstpage :
371
Lastpage :
376
Abstract :
This paper proposes a new method to design a reinforcement learning-based integrated kinematic and dynamic tracking control scheme for a nonholonomic wheeled mobile robot. The scheme uses just only one neural network to design an online adaptive synchronous policy iteration algorithm implemented as an actor critic structure. Our tuning law for the single neural network not only learns online a tracking-HJB equation to approximate both the optimal cost and the optimal control law but also guarantees closed-loop stability in real-time. The convergence and stability of the overall system are proven by Lyapunov theory. The simulation results for wheeled mobile robot verify the effectiveness of the proposed controller.
Keywords :
Lyapunov methods; adaptive control; approximation theory; closed loop systems; control system synthesis; iterative methods; learning (artificial intelligence); mobile robots; neurocontrollers; position control; robot kinematics; stability; Lyapunov theory; actor critic structure; approximation; closed-loop stability; control design; kinematic scheme; neural network; nonholonomic wheeled mobile robot; online adaptive synchronous policy iteration algorithm; optimal tracking control; reinforcement learning; tuning law; wheeled mobile robot; Automation; Conferences; Control systems; Decision support systems; Intelligent systems; Adaptive critic; actor critic; mobile robot; neural network; policy iteration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cyber Technology in Automation, Control, and Intelligent Systems (CYBER), 2012 IEEE International Conference on
Conference_Location :
Bangkok
Print_ISBN :
978-1-4673-1420-6
Type :
conf
DOI :
10.1109/CYBER.2012.6392582
Filename :
6392582
Link To Document :
بازگشت