Title :
Model-based learning with Bayesian and MAXQ value function decomposition for hierarchical task
Author :
Dai, Zhaohui ; Chen, Xin ; Cao, Weihua ; Wu, Min
Author_Institution :
Sch. of Inf. Sci. & Eng., Central South Univ., Changsha, China
Abstract :
How to improve efficiency of learning is always the key issue for implementation of reinforcement learning. This paper makes use of advantages of both hierarchical learning and model-based learning, so that an improved algorithm, named Bayesian-MAXQ learning, is introduced, in which several modifications are developed to solve the value update of hierarchy, while the possible performance damages brought by prioritized sweeping is reduced to trivial. The simulation results show that, Bayesian-MAXQ learning performs with high efficiency, and it can serve as a good framework for further study on hierarchical model-based learning.
Keywords :
belief networks; learning (artificial intelligence); Bayesian-MAXQ learning; MAXQ value function decomposition; hierarchical task; model-based learning; reinforcement learning; Bayesian methods; Computational modeling; Dynamic programming; Indexes; Mathematical model; Pediatrics; Robots; Bayesian; MAXQ value function decomposition; prioritized sweeping; reinforcement learning;
Conference_Titel :
Intelligent Control and Automation (WCICA), 2010 8th World Congress on
Conference_Location :
Jinan
Print_ISBN :
978-1-4244-6712-9
DOI :
10.1109/WCICA.2010.5554020