DocumentCode :
3500833
Title :
Cascading Decomposition and State Abstractions for Reinforcement Learning
Author :
Chiu, Chung-Cheng ; Soo, Von-Wun
Author_Institution :
Dept. of Comput. Sci., Nat. TsingHua Univ., Hsinchu
fYear :
2008
fDate :
27-31 Oct. 2008
Firstpage :
82
Lastpage :
87
Abstract :
Problem decomposition and state abstractions applied in the hierarchical problem solving often requires manual construction of a hierarchy structure in advance. This work is to provide some automatic algorithms for dimension reduction in problem solving. We propose cascading decomposition algorithm based on the spectral analysis on a normalized graph Laplacian to decompose the problem into several sub-problems and conduct parameter relevance analysis on each sub-problem to perform dynamic state abstraction. In each decomposed sub-problem, only parameters in the projected state space related to its sub-goal are reserved, and identical sub-problems are integrated into one through feature comparison. The whole problem is transformed into a combination of projected sub-problems, and problem solving in the abstracted space is more efficient. The paper demonstrates the performance improvement on reinforcement learning based on the proposed state space decomposition and abstraction methods using a capture-the-flag scenario.
Keywords :
abstracting; learning (artificial intelligence); problem solving; capture-the-flag scenario; cascading decomposition; hierarchical problem solving; normalized graph Laplacian; parameter relevance analysis; problem decomposition; reinforcement learning; spectral analysis; state abstractions; Algorithm design and analysis; Computer science; Frequency; Learning; Performance analysis; Problem-solving; Sampling methods; Spectral analysis; State-space methods; Testing; reinforcement learning; spectral graph theory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Artificial Intelligence, 2008. MICAI '08. Seventh Mexican International Conference on
Conference_Location :
Atizapan de Zaragoza
Print_ISBN :
978-0-7695-3441-1
Type :
conf
DOI :
10.1109/MICAI.2008.65
Filename :
4682447
Link To Document :
بازگشت