• DocumentCode
    424298
  • Title

    Policy gradient fuzzy reinforcement learning

  • Author

    Wang, Xue-ning ; Xu, Xin ; He, Han-gen

  • Author_Institution
    Inst. of Autom., Nat. Univ. of Defence Technol., Changsha, China
  • Volume
    2
  • fYear
    2004
  • fDate
    26-29 Aug. 2004
  • Firstpage
    992
  • Abstract
    This work presents a new approach for tuning conclusions of fuzzy rules based on reinforcement learning. Unlike the most of existing fuzzy reinforcement learning algorithms, which are based on value function, while our approach called policy gradient fuzzy reinforcement learning (PGFRL) bases on gradient estimate. In PGFRL, the algorithm GPOMDP is employed to estimate the performance gradient with respect to the parameters of fuzzy rules. In our work we prove the convergence of fuzzy rules´ parameters to a local optimum given necessary conditions. The experiment results show the effectiveness of PGFRL.
  • Keywords
    fuzzy control; gradient methods; learning (artificial intelligence); fuzzy control; fuzzy rules; gradient estimate; policy gradient fuzzy reinforcement learning; Computational modeling; Control systems; Convergence; Equations; Fuzzy control; Fuzzy neural networks; Fuzzy systems; Helium; Learning; State estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
  • Print_ISBN
    0-7803-8403-2
  • Type

    conf

  • DOI
    10.1109/ICMLC.2004.1382332
  • Filename
    1382332