DocumentCode
680752
Title
Generating Memoryless Policies Faster Using Automatic Temporal Abstractions for Reinforcement Learning with Hidden State
Author
Cilden, Erkin ; Polat, Faruk
Author_Institution
Dept. of Comput. Eng., Middle East Tech. Univ., Ankara, Turkey
fYear
2013
fDate
4-6 Nov. 2013
Firstpage
719
Lastpage
726
Abstract
Reinforcement learning with eligibility traces has been an effective way to solve problems with hidden state. Under certain conditions, it succeeds to build up a memoryless optimal policy over observations. Automatic generation of temporal abstractions, on the other hand, provides ways to extract and make use of useful sub-policies during reinforcement learning for a fully observable problem setting, so that the agent shall not need to repeatedly learn the same skill. One of the recent automatic abstraction techniques is the extended sequence tree method. We propose a novel way to bring together the extended sequence tree method and reinforcement learning for problems with hidden state. We expand the extended sequence tree method with a mechanism that helps the abstraction procedure to get rid of adverse effects of perceptual aliasing, letting the agent to make use of the remaining useful abstractions. Effectiveness of the method is shown empirically via experimentation on some benchmark problems.
Keywords
learning (artificial intelligence); trees (mathematics); automatic temporal abstractions; eligibility traces; extended sequence tree method; hidden state; memoryless policies; perceptual aliasing; reinforcement learning; Heuristic algorithms; History; Learning (artificial intelligence); Learning systems; Markov processes; Tree data structures; extended sequence tree; hidden state; learning abstractions; reinforcement learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence (ICTAI), 2013 IEEE 25th International Conference on
Conference_Location
Herndon, VA
ISSN
1082-3409
Print_ISBN
978-1-4799-2971-9
Type
conf
DOI
10.1109/ICTAI.2013.111
Filename
6735322
Link To Document