DocumentCode
424298
Title
Policy gradient fuzzy reinforcement learning
Author
Wang, Xue-ning ; Xu, Xin ; He, Han-gen
Author_Institution
Inst. of Autom., Nat. Univ. of Defence Technol., Changsha, China
Volume
2
fYear
2004
fDate
26-29 Aug. 2004
Firstpage
992
Abstract
This work presents a new approach for tuning conclusions of fuzzy rules based on reinforcement learning. Unlike the most of existing fuzzy reinforcement learning algorithms, which are based on value function, while our approach called policy gradient fuzzy reinforcement learning (PGFRL) bases on gradient estimate. In PGFRL, the algorithm GPOMDP is employed to estimate the performance gradient with respect to the parameters of fuzzy rules. In our work we prove the convergence of fuzzy rules´ parameters to a local optimum given necessary conditions. The experiment results show the effectiveness of PGFRL.
Keywords
fuzzy control; gradient methods; learning (artificial intelligence); fuzzy control; fuzzy rules; gradient estimate; policy gradient fuzzy reinforcement learning; Computational modeling; Control systems; Convergence; Equations; Fuzzy control; Fuzzy neural networks; Fuzzy systems; Helium; Learning; State estimation;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN
0-7803-8403-2
Type
conf
DOI
10.1109/ICMLC.2004.1382332
Filename
1382332
Link To Document