مرکز منطقه ای اطلاع رساني علوم و فناوري - Reinforcement learning in continuous time: advantage updating

DocumentCode :

2437796

Title :

Reinforcement learning in continuous time: advantage updating

Author :

Baird, Leemon C., III

Author_Institution :

Wright Lab., Wright-Patterson AFB, OH, USA

Volume :

fYear :

1994

fDate :

27 Jun-2 Jul 1994

Firstpage :

2448

Abstract :

A new algorithm for reinforcement learning, advantage updating, is described. Advantage updating is a direct learning technique; it does not require a model to be given or learned. It is incremental, requiring only a constant amount of calculation per time step, independent of the number of possible actions, possible outcomes from a given action, or number of states. Analysis and simulation indicate that advantage updating is applicable to reinforcement learning systems working in continuous time (or discrete time with small time steps) for which standard algorithms such as Q-learning are not applicable. Simulation results are presented indicating that for a simple linear quadratic regulator (LQR) problem, advantage updating learns more quickly than Q-learning by a factor of 100,000 when the time step is small. Even for large time steps, advantage updating is never slower than Q-learning, and advantage updating is more resistant to noise than is Q-learning. Convergence properties are discussed. It is proved that the learning rule for advantage updating converges to the optimal policy with probability one

Keywords :

continuous time systems; intelligent control; learning (artificial intelligence); learning systems; linear quadratic control; neural nets; Q-learning; advantage updating; continuous time system; convergence; learning rule; linear quadratic regulator; probability; reinforcement learning; Aerospace electronics; Algorithm design and analysis; Analytical models; Control systems; Cost function; Learning; Optimal control; Regulators;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks, 1994. IEEE World Congress on Computational Intelligence., 1994 IEEE International Conference on

Conference_Location :

Orlando, FL

Print_ISBN :

0-7803-1901-X

Type :

conf

DOI :

10.1109/ICNN.1994.374604

Filename :

374604

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2437796