DocumentCode
2856757
Title
Finite-horizon input-constrained nonlinear optimal control using single network adaptive critics
Author
Heydari, A. ; Balakrishnan, S.N.
Author_Institution
Mech. & Aerosp. Eng. Dept., Missouri Univ. of Sci. & Technol., Rolla, MO, USA
fYear
2011
fDate
June 29 2011-July 1 2011
Firstpage
3047
Lastpage
3052
Abstract
A single neural network based controller called the Finite-SNAC is developed in this study to synthesize finite-horizon optimal controllers for nonlinear control-affine systems. For satisfying the constraint on the input, a non-quadratic cost function is used. Inputs to the neural network are the current system states and the time-to-go and the network outputs are the costates which are used to compute the feedback control. Convergence of the reinforcement learning based training method to the optimal solution, the training error and the network weights are provided. The resulting controller is shown to solve the associated time-varying Hamilton-Jacobi-Bellman (HJB) equation and provide the fixed-final-time optimal solution. Performance of the new synthesis technique is demonstrated through an attitude control problem wherein a rigid spacecraft performs a finite time attitude maneuver subject to control bounds. The new formulation has a great potential for implementation since it consists of only one neural network with single set of weights and it provides comprehensive feedback solutions online though it is trained offline.
Keywords
feedback; learning (artificial intelligence); neurocontrollers; nonlinear control systems; optimal control; time-varying systems; HJB equation; attitude control problem; feedback control; finite time attitude maneuver; finite-SNAC; finite-horizon input-constrained control; finite-horizon optimal controller; fixed-final-time optimal solution; neural network based controller; nonlinear control-affine system; nonlinear optimal control; nonquadratic cost function; reinforcement learning based training; single network adaptive critics; spacecraft; time-varying Hamilton-Jacobi-Bellman equation; Convergence; Cost function; Equations; Mathematical model; Optimal control; Satellites; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
American Control Conference (ACC), 2011
Conference_Location
San Francisco, CA
ISSN
0743-1619
Print_ISBN
978-1-4577-0080-4
Type
conf
DOI
10.1109/ACC.2011.5991378
Filename
5991378
Link To Document