DocumentCode :
1269654
Title :
Improving POMDP Tractability via Belief Compression and Clustering
Author :
Li, Xin ; Cheung, William K. ; Liu, Jiming
Author_Institution :
Dept. of Comput. Sci., Hong Kong Baptist Univ., Hong Kong, China
Volume :
40
Issue :
1
fYear :
2010
Firstpage :
125
Lastpage :
136
Abstract :
Partially observable Markov decision process (POMDP) is a commonly adopted mathematical framework for solving planning problems in stochastic environments. However, computing the optimal policy of POMDP for large-scale problems is known to be intractable, where the high dimensionality of the underlying belief space is one of the major causes. In this paper, we propose a hybrid approach that integrates two different approaches for reducing the dimensionality of the belief space: 1) belief compression and 2) value-directed compression. In particular, a novel orthogonal nonnegative matrix factorization is derived for the belief compression, which is then integrated in a value-directed framework for computing the policy. In addition, with the conjecture that a properly partitioned belief space can have its per-cluster intrinsic dimension further reduced, we propose to apply a k-means-like clustering technique to partition the belief space to form a set of sub-POMDPs before applying the dimension reduction techniques to each of them. We have evaluated the proposed belief compression and clustering approaches based on a set of benchmark problems and demonstrated their effectiveness in reducing the cost for computing policies, with the quality of the policies being retained.
Keywords :
Markov processes; belief maintenance; decision making; matrix decomposition; pattern clustering; belief clustering; belief compression; dimension reduction technique; k-means-like clustering technique; orthogonal nonnegative matrix factorization; partially observable Markov decision process; stochastic environment; value-directed compression; Belief clustering; belief compression; nonnegative matrix factorization (NMF); partially observable Markov decision process (POMDP);
fLanguage :
English
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4419
Type :
jour
DOI :
10.1109/TSMCB.2009.2021573
Filename :
5184876
Link To Document :
بازگشت