مرکز منطقه ای اطلاع رساني علوم و فناوري - Efficient reinforcement learning: model-based Acrobot control

DocumentCode :

2091461

Title :

Efficient reinforcement learning: model-based Acrobot control

Author :

Boone, Gary

Author_Institution :

Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA

Volume :

fYear :

1997

fDate :

20-25 Apr 1997

Firstpage :

229

Abstract :

Several methods have been proposed in the reinforcement learning literature for learning optimal policies for sequential decision tasks. Q-learning is a model-free algorithm that has previously been applied to the Acrobot, a two-link arm with a single actuator at the elbow that learns to swing its free endpoint above a target height. However, applying Q-learning to a real Acrobot may be impractical due to the large number of required movements of the real robot as the controller learns. This paper explores the planning speed and data efficiency of explicitly learning models, as well as using heuristic knowledge to aid the search for solutions and reduce the amount of data required from the real robot

Keywords :

learning (artificial intelligence); manipulators; nonlinear dynamical systems; planning (artificial intelligence); search problems; Q-learning; data efficiency; heuristic knowledge; model-based Acrobot control; optimal policies; planning speed; reinforcement learning; sequential decision tasks; two-link arm; Actuators; Algorithm design and analysis; Control systems; Ear; Educational institutions; Elbow; Learning systems; Optimal control; Real time systems; Robot control;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Robotics and Automation, 1997. Proceedings., 1997 IEEE International Conference on

Conference_Location :

Albuquerque, NM

Print_ISBN :

0-7803-3612-7

Type :

conf

DOI :

10.1109/ROBOT.1997.620043

Filename :

620043

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2091461