DocumentCode
2249948
Title
A kernel-based reinforcement learning approach to stochastic pole balancing control systems
Author
Xu, Xin ; Chengzhang, Peng ; Dai, Bin ; He, Han-gen
Author_Institution
Inst. of Autom., Nat. Univ. of Defense Technol., Changsha, China
fYear
2010
fDate
6-9 July 2010
Firstpage
1329
Lastpage
1334
Abstract
As a benchmark control problem with nonlinearity and instability, the controller design for inverted pendulums becomes more difficult when there are model uncertainties and unknown disturbances in the plant dynamics. In this paper, a kernel-based reinforcement learning controller is developed for inverted pendulums with unknown dynamics and stochastic disturbances. The learning controller makes use of approximate policy iteration with kernel-based least-squares temporal difference learning for policy evaluation. Due to the nonlinear approximation ability of kernel methods, good convergence property and learning efficiency can be realized in the approximate policy iteration process so that the controller performance can be optimized in a few iterations. Simulation results demonstrate that the proposed learning controller for stochastic inverted pendulums can achieve much better performance than previous learning control approaches such as Q-learning with function approximation and least-squares policy iteration (LSPI).
Keywords
approximation theory; control system synthesis; convergence; iterative methods; learning (artificial intelligence); learning systems; nonlinear control systems; pendulums; poles and zeros; stochastic systems; approximate policy iteration; controller design; convergence property; kernel-based least-squares temporal difference learning; kernel-based reinforcement learning controller; nonlinear approximation; stochastic inverted pendulums; stochastic pole balancing control systems;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Intelligent Mechatronics (AIM), 2010 IEEE/ASME International Conference on
Conference_Location
Montreal, ON
Print_ISBN
978-1-4244-8031-9
Type
conf
DOI
10.1109/AIM.2010.5695878
Filename
5695878
Link To Document