DocumentCode
3500833
Title
Cascading Decomposition and State Abstractions for Reinforcement Learning
Author
Chiu, Chung-Cheng ; Soo, Von-Wun
Author_Institution
Dept. of Comput. Sci., Nat. TsingHua Univ., Hsinchu
fYear
2008
fDate
27-31 Oct. 2008
Firstpage
82
Lastpage
87
Abstract
Problem decomposition and state abstractions applied in the hierarchical problem solving often requires manual construction of a hierarchy structure in advance. This work is to provide some automatic algorithms for dimension reduction in problem solving. We propose cascading decomposition algorithm based on the spectral analysis on a normalized graph Laplacian to decompose the problem into several sub-problems and conduct parameter relevance analysis on each sub-problem to perform dynamic state abstraction. In each decomposed sub-problem, only parameters in the projected state space related to its sub-goal are reserved, and identical sub-problems are integrated into one through feature comparison. The whole problem is transformed into a combination of projected sub-problems, and problem solving in the abstracted space is more efficient. The paper demonstrates the performance improvement on reinforcement learning based on the proposed state space decomposition and abstraction methods using a capture-the-flag scenario.
Keywords
abstracting; learning (artificial intelligence); problem solving; capture-the-flag scenario; cascading decomposition; hierarchical problem solving; normalized graph Laplacian; parameter relevance analysis; problem decomposition; reinforcement learning; spectral analysis; state abstractions; Algorithm design and analysis; Computer science; Frequency; Learning; Performance analysis; Problem-solving; Sampling methods; Spectral analysis; State-space methods; Testing; reinforcement learning; spectral graph theory;
fLanguage
English
Publisher
ieee
Conference_Titel
Artificial Intelligence, 2008. MICAI '08. Seventh Mexican International Conference on
Conference_Location
Atizapan de Zaragoza
Print_ISBN
978-0-7695-3441-1
Type
conf
DOI
10.1109/MICAI.2008.65
Filename
4682447
Link To Document