مرکز منطقه ای اطلاع رساني علوم و فناوري - A survey of reinforcement learning research and its application for multi-robot systems

DocumentCode :

582061

Title :

A survey of reinforcement learning research and its application for multi-robot systems

Author :

Yuequan, Yang ; Lu, Jin ; Zhiqiang, Cao ; Hongru, Tang ; Yang, Xia ; Chunbo, Ni

Author_Institution :

Coll. of Inf. Eng., Yangzhou Univ., Yangzhou, China

fYear :

2012

fDate :

25-27 July 2012

Firstpage :

3068

Lastpage :

3074

Abstract :

Reinforcement learning aims to obtain optimal/suboptimal strategy through trial-and-error and interaction with dynamic environment. After an introduction of basic knowledge of reinforcement learning, TD algorithm, Q-learning algorithm, Dyna algorithm and Sarsa algorithm base on Markov decision model are discussed, respectively. Moreover, reinforcement learning based on partially observable Markov decision process and semi-Markov decision model for uncertain environment are analyzed, respectively. The research status of Q learning in the field of multi-robot systems is also presented. Finally, the main challenges and further research work are given.

Keywords :

Markov processes; learning (artificial intelligence); multi-robot systems; uncertain systems; Dyna algorithm; Q-learning algorithm; Sarsa algorithm; TD algorithm; dynamic environment interaction; multirobot systems; partially observable Markov decision process; reinforcement learning research; semiMarkov decision model; suboptimal strategy; trial-and-error; uncertain environment; Educational institutions; Heuristic algorithms; Laboratories; Learning; Markov processes; Multirobot systems; Nickel; Markov Decision; Multi-robot System; Reinforcement Learning; Semi-Markov Decision;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Control Conference (CCC), 2012 31st Chinese

Conference_Location :

Hefei

ISSN :

1934-1768

Print_ISBN :

978-1-4673-2581-3

Type :

conf

Filename :

6390450

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=582061