Title :
An approach to exploit non-optimized data for efficient control of unknown systems through neural and kernel models
Author :
Cervellera, Cristiano ; Gaggero, Mauro ; Maccio, Danilo ; Marcialis, Roberto
Author_Institution :
Inst. of Intell. Syst. for Autom., Genoa, Italy
Abstract :
In this paper, efficient real time control strategies are devised for systems with unknown state equation, based only on a set of data inherited from non-optimized, possibly inefficient, operation of the system, in the case in which experimenting online with the latter is impossible or costly. Neural networks and kernel smoothing models are employed as architectures for learning the system dynamics. The former require an offline training phase to learn the state equation, whereas the latter exploit the available data in a direct fashion, thus making the proposed approach directly applicable online and able to exploit new available data without the need of an offline training. Convergence properties of the proposed algorithm for generating the control strategies are provided under suitable hypotheses. Simulation results on classic benchmark systems are reported for performance evaluation, also through a comparison with the SARSA reinforcement learning algorithm.
Keywords :
convergence; learning (artificial intelligence); neural nets; smoothing methods; SARSA reinforcement learning algorithm; convergence properties; kernel models; kernel smoothing models; neural models; neural networks; nonoptimized data; offline training phase; performance evaluation; real time control strategies; system dynamics; system operation; unknown state equation; unknown systems control; Approximation methods; Data models; Equations; Kernel; Mathematical model; Trajectory; Vectors;
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
DOI :
10.1109/IJCNN.2014.6889794