DocumentCode :
680752
Title :
Generating Memoryless Policies Faster Using Automatic Temporal Abstractions for Reinforcement Learning with Hidden State
Author :
Cilden, Erkin ; Polat, Faruk
Author_Institution :
Dept. of Comput. Eng., Middle East Tech. Univ., Ankara, Turkey
fYear :
2013
fDate :
4-6 Nov. 2013
Firstpage :
719
Lastpage :
726
Abstract :
Reinforcement learning with eligibility traces has been an effective way to solve problems with hidden state. Under certain conditions, it succeeds to build up a memoryless optimal policy over observations. Automatic generation of temporal abstractions, on the other hand, provides ways to extract and make use of useful sub-policies during reinforcement learning for a fully observable problem setting, so that the agent shall not need to repeatedly learn the same skill. One of the recent automatic abstraction techniques is the extended sequence tree method. We propose a novel way to bring together the extended sequence tree method and reinforcement learning for problems with hidden state. We expand the extended sequence tree method with a mechanism that helps the abstraction procedure to get rid of adverse effects of perceptual aliasing, letting the agent to make use of the remaining useful abstractions. Effectiveness of the method is shown empirically via experimentation on some benchmark problems.
Keywords :
learning (artificial intelligence); trees (mathematics); automatic temporal abstractions; eligibility traces; extended sequence tree method; hidden state; memoryless policies; perceptual aliasing; reinforcement learning; Heuristic algorithms; History; Learning (artificial intelligence); Learning systems; Markov processes; Tree data structures; extended sequence tree; hidden state; learning abstractions; reinforcement learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2013 IEEE 25th International Conference on
Conference_Location :
Herndon, VA
ISSN :
1082-3409
Print_ISBN :
978-1-4799-2971-9
Type :
conf
DOI :
10.1109/ICTAI.2013.111
Filename :
6735322
Link To Document :
بازگشت