مرکز منطقه ای اطلاع رساني علوم و فناوري - Reinforcement learning is direct adaptive optimal control

DocumentCode :

904737

Title :

Reinforcement learning is direct adaptive optimal control

Author :

Sutton, Richard S. ; Barto, Andrew G. ; Williams, Ronald J.

Author_Institution :

GTE Lab. Inc., Waltham, MA, USA

Volume :

Issue :

fYear :

1992

fDate :

4/1/1992 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

Neural network reinforcement learning methods are described and considered as a direct approach to adaptive optimal control of nonlinear systems. These methods have their roots in studies of animal learning and in early learning control work. An emerging deeper understanding of these methods is summarized that is obtained by viewing them as a synthesis of dynamic programming and stochastic approximation methods. The focus is on Q-learning systems, which maintain estimates of utilities for all state-action pairs and make use of these estimates to select actions. The use of hybrid direct/indirect methods is briefly discussed.<>

Keywords :

adaptive control; approximation theory; dynamic programming; learning systems; neural nets; nonlinear control systems; optimal control; Q-learning systems; direct adaptive optimal control; dynamic programming; hybrid direct/indirect methods; neural network reinforcement learning; nonlinear systems; state-action pair estimates; stochastic approximation; Adaptive control; Animals; Control system synthesis; Dynamic programming; Learning; Neural networks; Nonlinear systems; Optimal control; Programmable control; State estimation;

fLanguage :

English

Journal_Title :

Control Systems, IEEE

Publisher :

ieee

ISSN :

1066-033X

Type :

jour

DOI :

10.1109/37.126844

Filename :

126844

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=904737