• DocumentCode
    680752
  • Title

    Generating Memoryless Policies Faster Using Automatic Temporal Abstractions for Reinforcement Learning with Hidden State

  • Author

    Cilden, Erkin ; Polat, Faruk

  • Author_Institution
    Dept. of Comput. Eng., Middle East Tech. Univ., Ankara, Turkey
  • fYear
    2013
  • fDate
    4-6 Nov. 2013
  • Firstpage
    719
  • Lastpage
    726
  • Abstract
    Reinforcement learning with eligibility traces has been an effective way to solve problems with hidden state. Under certain conditions, it succeeds to build up a memoryless optimal policy over observations. Automatic generation of temporal abstractions, on the other hand, provides ways to extract and make use of useful sub-policies during reinforcement learning for a fully observable problem setting, so that the agent shall not need to repeatedly learn the same skill. One of the recent automatic abstraction techniques is the extended sequence tree method. We propose a novel way to bring together the extended sequence tree method and reinforcement learning for problems with hidden state. We expand the extended sequence tree method with a mechanism that helps the abstraction procedure to get rid of adverse effects of perceptual aliasing, letting the agent to make use of the remaining useful abstractions. Effectiveness of the method is shown empirically via experimentation on some benchmark problems.
  • Keywords
    learning (artificial intelligence); trees (mathematics); automatic temporal abstractions; eligibility traces; extended sequence tree method; hidden state; memoryless policies; perceptual aliasing; reinforcement learning; Heuristic algorithms; History; Learning (artificial intelligence); Learning systems; Markov processes; Tree data structures; extended sequence tree; hidden state; learning abstractions; reinforcement learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence (ICTAI), 2013 IEEE 25th International Conference on
  • Conference_Location
    Herndon, VA
  • ISSN
    1082-3409
  • Print_ISBN
    978-1-4799-2971-9
  • Type

    conf

  • DOI
    10.1109/ICTAI.2013.111
  • Filename
    6735322