Title :
Reinforcement learning with knowledge by using a stochastic gradient method on a Bayesian network
Author :
Yamamura, M. ; Onozuka, Takashi
Author_Institution :
Tokyo Inst. of Technol., Japan
Abstract :
For real applications of reinforcement learning, it is necessary to reduce the number of trial-and-errors. The paper proposes a method to use knowledge in reinforcement learning. We have regarded a Bayesian network as a stochastic policy, and adapted a rigid propagation procedure for a stochastic gradient method. We made preliminary experiments to demonstrate our method in a robot navigation task
Keywords :
directed graphs; learning (artificial intelligence); mobile robots; path planning; probability; Bayesian network; reinforcement learning; rigid propagation procedure; robot navigation task; stochastic gradient method; stochastic policy; trial-and-error; Bayesian methods; Data mining; Delay; Gradient methods; Knowledge acquisition; Learning; Navigation; Robots; Stochastic processes; Uncertainty;
Conference_Titel :
Neural Networks Proceedings, 1998. IEEE World Congress on Computational Intelligence. The 1998 IEEE International Joint Conference on
Conference_Location :
Anchorage, AK
Print_ISBN :
0-7803-4859-1
DOI :
10.1109/IJCNN.1998.687174