Title :
On-line learning optimal control using successive approximation techniques
Author :
Levine, M.D. ; Vilis, T.
Author_Institution :
McGill University, Montreal, PQ, Canada
fDate :
6/1/1973 12:00:00 AM
Abstract :
The application of learning theory to on-line optimization of unknown or poorly defined plants is discussed. An on-line optimization procedure is achieved by means of a learning algorithm which alters a trainable controller on the basis of an instantaneous performance criterion or subgoal. The subgoal is related to the over-all goal, the integral cost, by means of successive approximations to the Hamilton-Jacobi equation. The resulting piecewise linear controller is implemented by means of an encoder consisting of threshold logic units and a classifier consisting of a set of logic switching functions. The classifier is determined by means of an algorithm developed by Arkadev and Braverman. Features of the learning algorithm are illustrated by minimum-time and minimum-time-fuel problems.
Keywords :
Learning control systems; Optimal control; Costs; DC motors; Gaussian processes; Integral equations; Logic; Optimal control; Piecewise linear approximation; Piecewise linear techniques; Regulators; State feedback;
Journal_Title :
Automatic Control, IEEE Transactions on
DOI :
10.1109/TAC.1973.1100315