Title :
Improving reinforcement learning control via online bilinear action interpolation
Author :
Ribeiro, Carlos H C ; Hemerly, Elder M.
Author_Institution :
Div. de Engenharia Eletronica, Inst. Tecnologico de Aeronaut., Sao Paulo, Brazil
Abstract :
Reinforcement learning has been used as a reasonably successful method for the problem of model-free learning of action policies for some control problems. However, it is usually assumed that the process to be controlled is either open loop stable or of slow dynamics, when frequency of failures before acceptable performance or input-output processing time are not issues of primary importance. We consider the problem of model-free regulation for an unstable plant. As in many cases the need for state quantisation is an algorithmic storage requirement rather than a sensor limitation, we propose a modification of a standard reinforcement learning method that uses as additional information the distance between sampled and represented states, embedded in actions that are a result of a distance-wise local interpolation scheme. We obtained faster learning under minimal disturbance of the original learning scheme, and the modification is computationally modest enough to allow for real-time implementation
Keywords :
bilinear systems; control system synthesis; interpolation; learning (artificial intelligence); I/O processing time; action policies; algorithmic storage requirement; distance-wise local interpolation; input-output processing time; model-free learning; model-free regulation; online bilinear action interpolation; reinforcement learning control; state quantisation; unstable plant; Aerodynamics; Computational modeling; Computer networks; Heuristic algorithms; Interpolation; Learning systems; Navigation; Open loop systems; Process control; Quantization;
Conference_Titel :
Neural Networks, 1998. Proceedings. Vth Brazilian Symposium on
Conference_Location :
Belo Horizonte
Print_ISBN :
0-8186-8629-4
DOI :
10.1109/SBRN.1998.731002