DocumentCode
3027041
Title
A stable Lyapunov constrained reinforcement learning based neural controller for non linear systems
Author
Kumar, Abhishek ; Sharma, Rajneesh
Author_Institution
ICE Div., NSIT, New Delhi, India
fYear
2015
fDate
15-16 May 2015
Firstpage
185
Lastpage
189
Abstract
This paper proposes a Lyapunov constrained neural network based reinforcement learning (RL) controller with guaranteed stability for non linear systems. Neural networks have been used as universal function approximators to deal with one of the core problem faced in RL commonly known as `The Curse of Dimensionality´. We propose to constrain controller action set to the one dictated by the Lyapunov stability theory to produce a controller with guaranteed stability. We prove that when the controller action set is constrained the cost function turns out to be a Lyapunov candidate function thereby guaranteeing stability of the controller. Proposed methodology has been applied to the benchmark inverted pendulum (IP) balancing problem to validate its effectiveness. Simulation results and comparison against baseline neural Q learning control brings out the effectiveness and viability of the proposed control scheme.
Keywords
Lyapunov methods; function approximation; learning (artificial intelligence); neurocontrollers; nonlinear control systems; stability; IP balancing problem; Lyapunov function; Lyapunov stability theory; RL controller; controller action set; cost function; curse-of-dimensionality; guaranteed stability; inverted pendulum balancing problem; nonlinear systems; stable Lyapunov constrained reinforcement learning based neural controller; universal function approximators; Artificial neural networks; Automation; Cost function; Learning (artificial intelligence); Linear systems; Torque; Inverted Pendulum; Lyapunov Theory; Neural Networks; Reinforcement Learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing, Communication & Automation (ICCCA), 2015 International Conference on
Conference_Location
Noida
Print_ISBN
978-1-4799-8889-1
Type
conf
DOI
10.1109/CCAA.2015.7148402
Filename
7148402
Link To Document