مرکز منطقه ای اطلاع رساني علوم و فناوري - An integrated design for intensified direct heuristic dynamic programming

DocumentCode :

3269788

Title :

An integrated design for intensified direct heuristic dynamic programming

Author :

Xiong Luo ; Si, Jennie ; Yuchao Zhou

Author_Institution :

Sch. of Comput. & Commun. Eng., Univ. of Sci. & Technol. Beijing (USTB), Beijing, China

fYear :

2013

fDate :

16-19 April 2013

Firstpage :

183

Lastpage :

190

Abstract :

There has been a growing interest in the study of adaptive/approximate dynamic programming (ADP) in recent years. The ADP technique provides a powerful tool to understand and improve the principled technologies of machine intelligence system. As one of the ADP algorithms based on adaptive critic neural networks (NNs), the direct heuristic dynamic programming (direct HDP) has demonstrated some successful applications in solving realistic engineering control problems. In this study, based on a three-network architecture in which the reinforcement signal is approximated by an additional NN, a novel integrated design method for intensified direct HDP is developed. The new design approach is implemented by using multiple PID neural networks (PIDNNs), which effectively takes into account structural knowledge of system states and control that are usually present in a physical system. By using a Lyapunov stability approach, a uniformly ultimately boundedness (UUB) result is proved for our PIDNNs-based intensified direct HDP learning controller. Furthermore, the learning and control performances of the proposed design is tested using the popular cart-pole example to illustrate the key ideas of this paper.

Keywords :

Lyapunov methods; control system analysis; dynamic programming; neurocontrollers; stability; three-term control; ADP technique; Lyapunov stability approach; PIDNNs-based intensified direct HDP learning controller; UUB; adaptive critic neural networks; approximate dynamic programming; integrated design method; intensified direct heuristic dynamic programming; machine intelligence system; multiple PID neural networks; reinforcement signal; three-network architecture; uniformly ultimately boundedness; Algorithm design and analysis; Convergence; Dynamic programming; Educational institutions; Learning (artificial intelligence); Lyapunov methods; Neural networks; Direct heuristic dynamic programming; PID neural network; neural network; stability;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2013 IEEE Symposium on

Conference_Location :

Singapore

ISSN :

2325-1824

Type :

conf

DOI :

10.1109/ADPRL.2013.6615006

Filename :

6615006

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3269788