• DocumentCode
    582061
  • Title

    A survey of reinforcement learning research and its application for multi-robot systems

  • Author

    Yuequan, Yang ; Lu, Jin ; Zhiqiang, Cao ; Hongru, Tang ; Yang, Xia ; Chunbo, Ni

  • Author_Institution
    Coll. of Inf. Eng., Yangzhou Univ., Yangzhou, China
  • fYear
    2012
  • fDate
    25-27 July 2012
  • Firstpage
    3068
  • Lastpage
    3074
  • Abstract
    Reinforcement learning aims to obtain optimal/suboptimal strategy through trial-and-error and interaction with dynamic environment. After an introduction of basic knowledge of reinforcement learning, TD algorithm, Q-learning algorithm, Dyna algorithm and Sarsa algorithm base on Markov decision model are discussed, respectively. Moreover, reinforcement learning based on partially observable Markov decision process and semi-Markov decision model for uncertain environment are analyzed, respectively. The research status of Q learning in the field of multi-robot systems is also presented. Finally, the main challenges and further research work are given.
  • Keywords
    Markov processes; learning (artificial intelligence); multi-robot systems; uncertain systems; Dyna algorithm; Q-learning algorithm; Sarsa algorithm; TD algorithm; dynamic environment interaction; multirobot systems; partially observable Markov decision process; reinforcement learning research; semiMarkov decision model; suboptimal strategy; trial-and-error; uncertain environment; Educational institutions; Heuristic algorithms; Laboratories; Learning; Markov processes; Multirobot systems; Nickel; Markov Decision; Multi-robot System; Reinforcement Learning; Semi-Markov Decision;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control Conference (CCC), 2012 31st Chinese
  • Conference_Location
    Hefei
  • ISSN
    1934-1768
  • Print_ISBN
    978-1-4673-2581-3
  • Type

    conf

  • Filename
    6390450