DocumentCode :
3652069
Title :
Profit Sharing That Can Learn Deterministic Policy for POMDPs Environments by Kohonen Feature Map Associative Memory
Author :
Daichi Koma;Yuko Osana
Author_Institution :
Sch. of Comput. Sci., Tokyo Univ. of Technol., Tokyo, Japan
fYear :
2013
Firstpage :
2651
Lastpage :
2658
Abstract :
In this paper, we propose a Profit Sharing that can learn deterministic policy for POMDPs environments by Kohonen Feature Map (KFM) associative memory. The proposed method is based on the conventional Profit Sharing that can learn deterministic policy for POMDPs environments which can obtain the deterministic policy by using the history of observations and the variable-sized Kohonen Feature Map Probabilistic Associative Memory (for short, KFM associative memory). In the conventional Profit Sharing that can learn deterministic policy for POMDPs environments, robustness for noise does not guaranteed. In the proposed method, rules which are obtained in the Profit Sharing that can learn deterministic policy for POMDPs environments are trained by the KFM associative memory and appropriate action selection is realized in noisy environment. We carried out a series of computer experiments, and confirmed that the proposed method can learn rules for deterministic action selection as similar as the conventional Profit Sharing that can learn deterministic policy for POMDPs environments, and appropriate action selection is realized in noisy environment.
Keywords :
"Neurons","Associative memory","Vectors","Probabilistic logic","Educational institutions","History","Noise measurement"
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2013 IEEE International Conference on
ISSN :
1062-922X
Type :
conf
DOI :
10.1109/SMC.2013.453
Filename :
6722206
Link To Document :
بازگشت