Title :
A survey of reinforcement learning research and its application for multi-robot systems
Author :
Yuequan, Yang ; Lu, Jin ; Zhiqiang, Cao ; Hongru, Tang ; Yang, Xia ; Chunbo, Ni
Author_Institution :
Coll. of Inf. Eng., Yangzhou Univ., Yangzhou, China
Abstract :
Reinforcement learning aims to obtain optimal/suboptimal strategy through trial-and-error and interaction with dynamic environment. After an introduction of basic knowledge of reinforcement learning, TD algorithm, Q-learning algorithm, Dyna algorithm and Sarsa algorithm base on Markov decision model are discussed, respectively. Moreover, reinforcement learning based on partially observable Markov decision process and semi-Markov decision model for uncertain environment are analyzed, respectively. The research status of Q learning in the field of multi-robot systems is also presented. Finally, the main challenges and further research work are given.
Keywords :
Markov processes; learning (artificial intelligence); multi-robot systems; uncertain systems; Dyna algorithm; Q-learning algorithm; Sarsa algorithm; TD algorithm; dynamic environment interaction; multirobot systems; partially observable Markov decision process; reinforcement learning research; semiMarkov decision model; suboptimal strategy; trial-and-error; uncertain environment; Educational institutions; Heuristic algorithms; Laboratories; Learning; Markov processes; Multirobot systems; Nickel; Markov Decision; Multi-robot System; Reinforcement Learning; Semi-Markov Decision;
Conference_Titel :
Control Conference (CCC), 2012 31st Chinese
Conference_Location :
Hefei
Print_ISBN :
978-1-4673-2581-3