Title :
Optimal Control for Unknown Discrete-Time Nonlinear Markov Jump Systems Using Adaptive Dynamic Programming
Author :
Xiangnan Zhong ; Haibo He ; Huaguang Zhang ; Zhanshan Wang
Author_Institution :
Dept. of Electr., Univ. of Rhode Island, Kingston, RI, USA
Abstract :
In this paper, we develop and analyze an optimal control method for a class of discrete-time nonlinear Markov jump systems (MJSs) with unknown system dynamics. Specifically, an identifier is established for the unknown systems to approximate system states, and an optimal control approach for nonlinear MJSs is developed to solve the Hamilton-Jacobi-Bellman equation based on the adaptive dynamic programming technique. We also develop detailed stability analysis of the control approach, including the convergence of the performance index function for nonlinear MJSs and the existence of the corresponding admissible control. Neural network techniques are used to approximate the proposed performance index function and the control law. To demonstrate the effectiveness of our approach, three simulation studies, one linear case, one nonlinear case, and one single link robot arm case, are used to validate the performance of the proposed optimal control method.
Keywords :
Markov processes; approximation theory; discrete time systems; dynamic programming; neurocontrollers; optimal control; performance index; stability; time-varying systems; Hamilton-Jacobi-Bellman equation; adaptive dynamic programming; admissible control; control law; neural network techniques; nonlinear MJSs; one single link robot arm; optimal control; performance index function convergence; stability analysis; system state approximation; unknown discrete-time nonlinear Markov jump systems; unknown system dynamics; Convergence; Equations; Markov processes; Neural networks; Nonlinear systems; Optimal control; Performance analysis; Adaptive dynamic programming (ADP); Markov jump systems (MJSs); neural network; optimal control; state identifier; state identifier.;
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
DOI :
10.1109/TNNLS.2014.2305841