مرکز منطقه ای اطلاع رساني علوم و فناوري - A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters

DocumentCode :

762938

Title :

A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters

Author :

Berenji, Hamid R. ; Vengerov, David

Author_Institution :

Comput. Sci. Div., Intelligent Inference Syst. Corp., Mountain View, CA, USA

Volume :

Issue :

fYear :

2003

Firstpage :

478

Lastpage :

485

Abstract :

This paper provides the first convergence proof for fuzzy reinforcement learning (FRL) as well as experimental results supporting our analysis. We extend the work of Konda and Tsitsiklis, who presented a convergent actor-critic (AC) algorithm for a general parameterized actor. In our work we prove that a fuzzy rulebase actor satisfies the necessary conditions that guarantee the convergence of its parameters to a local optimum. Our fuzzy rulebase uses Takagi-Sugeno-Kang rules, Gaussian membership functions, and product inference. As an application domain, we chose a difficult task of power control in wireless transmitters, characterized by delayed rewards and a high degree of stochasticity. To the best of our knowledge, no reinforcement learning algorithms have been previously applied to this task. Our simulation results show that the ACFRL algorithm consistently converges in this domain to a locally optimal policy.

Keywords :

fuzzy logic; learning (artificial intelligence); neurocontrollers; power control; FRL; convergence; fuzzy reinforcement learning; power control; reinforcement learning; wireless transmitters; Algorithm design and analysis; Convergence; Delay; Energy management; Fuzzy control; Inference algorithms; Learning; NASA; Power control; Transmitters;

fLanguage :

English

Journal_Title :

Fuzzy Systems, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6706

Type :

jour

DOI :

10.1109/TFUZZ.2003.814834

Filename :

1220293

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=762938