• DocumentCode
    3021606
  • Title

    A game-theoretic procedure for learning hierarchically structured strategies

  • Author

    Rosman, Benjamin ; Ramamoorthy, Subramanian

  • Author_Institution
    Sch. of Inf., Univ. of Edinburgh, Edinburgh, UK
  • fYear
    2010
  • fDate
    3-7 May 2010
  • Firstpage
    2977
  • Lastpage
    2983
  • Abstract
    This paper addresses the problem of acquiring a hierarchically structured robotic skill in a nonstationary environment. This is achieved through a combination of learning primitive strategies from observation of an expert, and autonomously synthesising composite strategies from that basis. Both aspects of this problem are approached from a game theoretic viewpoint, building on prior work in the area of multiplicative weights learning algorithms. The utility of this procedure is demonstrated through simulation experiments motivated by the problem of autonomous driving. We show that this procedure allows the agent to come to terms with two forms of uncertainty in the world - continually varying goals (due to oncoming traffic) and nonstationarity of optimisation criteria (e.g., driven by changing navigability of the road). We argue that this type of factored task specification and learning is a necessary ingredient for robust autonomous behaviour in a “large-world” setting.
  • Keywords
    game theory; learning (artificial intelligence); game-theoretic procedure; hierarchically structured strategies; primitive strategies learning; robust autonomous behaviour; weights learning algorithms; Buildings; Game theory; Hidden Markov models; Informatics; Learning; Noise robustness; Robotics and automation; Traffic control; USA Councils; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Automation (ICRA), 2010 IEEE International Conference on
  • Conference_Location
    Anchorage, AK
  • ISSN
    1050-4729
  • Print_ISBN
    978-1-4244-5038-1
  • Electronic_ISBN
    1050-4729
  • Type

    conf

  • DOI
    10.1109/ROBOT.2010.5509632
  • Filename
    5509632