مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

466107

Title :

Complex-Valued Reinforcement Learning

Author :

Hamagami, Tomoki ; Shibuya, Takashi ; Shimada, Shingo

Author_Institution :

Yokohama Nat. Univ., Yokohama

Volume :

fYear :

2006

fDate :

8-11 Oct. 2006

Firstpage :

4175

Lastpage :

4179

Abstract :

A new reinforcement learning algorithm with complex-valued functions is proposed. The algorithm is inspired by complex-valued neural networks introducing complex numbers representing phase and amplitude into a conventional neural network. The strong advantage of using complex values in reinforcement learning is that the state-action function in a time series can be easily extended. In particular, considering the coherence of each complex value, the proposed learning algorithm can represent the context of agent behavior. This extension allows compensating for the perceptual aliasing problem and provides for the intelligent behavior of mobile robots in the real world. The complex-valued functions are applied to the conventional reinforcement learning algorithms: Q-learning and profit sharing. These algorithms are evaluated by simple maze problems and a bar-carrying task involving perceptual aliasings. Simulation experiments show that the new algorithm can efficiently solve perceptual aliasing.

Keywords :

learning (artificial intelligence); neural nets; time series; Q-learning; complex-valued functions; complex-valued neural networks; complex-valued reinforcement learning; mobile robots; perceptual aliasing problem; profit sharing; reinforcement learning algorithm; state-action function; time series; Control systems; Cybernetics; Intelligent agent; Intelligent robots; Learning; Mobile robots; Multiagent systems; Neural networks; Resonance light scattering; Robot control;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference on

Conference_Location :

Taipei

Print_ISBN :

1-4244-0099-6

Electronic_ISBN :

1-4244-0100-3

Type :

conf

DOI :

10.1109/ICSMC.2006.384789

Filename :

4274554

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=466107