DocumentCode :
466107
Title :
Complex-Valued Reinforcement Learning
Author :
Hamagami, Tomoki ; Shibuya, Takashi ; Shimada, Shingo
Author_Institution :
Yokohama Nat. Univ., Yokohama
Volume :
5
fYear :
2006
fDate :
8-11 Oct. 2006
Firstpage :
4175
Lastpage :
4179
Abstract :
A new reinforcement learning algorithm with complex-valued functions is proposed. The algorithm is inspired by complex-valued neural networks introducing complex numbers representing phase and amplitude into a conventional neural network. The strong advantage of using complex values in reinforcement learning is that the state-action function in a time series can be easily extended. In particular, considering the coherence of each complex value, the proposed learning algorithm can represent the context of agent behavior. This extension allows compensating for the perceptual aliasing problem and provides for the intelligent behavior of mobile robots in the real world. The complex-valued functions are applied to the conventional reinforcement learning algorithms: Q-learning and profit sharing. These algorithms are evaluated by simple maze problems and a bar-carrying task involving perceptual aliasings. Simulation experiments show that the new algorithm can efficiently solve perceptual aliasing.
Keywords :
learning (artificial intelligence); neural nets; time series; Q-learning; complex-valued functions; complex-valued neural networks; complex-valued reinforcement learning; mobile robots; perceptual aliasing problem; profit sharing; reinforcement learning algorithm; state-action function; time series; Control systems; Cybernetics; Intelligent agent; Intelligent robots; Learning; Mobile robots; Multiagent systems; Neural networks; Resonance light scattering; Robot control;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
1-4244-0099-6
Electronic_ISBN :
1-4244-0100-3
Type :
conf
DOI :
10.1109/ICSMC.2006.384789
Filename :
4274554
Link To Document :
بازگشت