Title :
An average-reward reinforcement learning algorithm based on Schweitzer´s Transformation
Author :
Jianjun, Li ; Jiangong, Ren ; Yanjie, Li
Author_Institution :
Shenzhen Grad. Sch., Harbin Inst. of Technol., Shenzhen, China
Abstract :
In this paper, we propose a relative value iteration reinforcement learning(RVI-RL) algorithm based on Schweitzer´s Transformation for Markov decision processes (MDP) with average reward. An equivalent average reward optimality equation and a new form of action-value function are presented via Schweitzer´s Transformation. Then, combined with the theory of relative value iteration, this RVI-RL algorithm doesn´t only omit the estimation of the average reward in the learning, but also improves the convergence rate. Finally, a simulation experiment for the navigation of autonomous mobile robot is considered, which illustrates the effectiveness and applicability of the algorithm.
Keywords :
Markov processes; convergence of numerical methods; iterative methods; learning (artificial intelligence); mobile robots; path planning; Markov decision processes; RVI-RL algorithm; Schweitzer transformation; action-value function; autonomous mobile robot navigation; average reward optimality equation; average-reward reinforcement learning algorithm; convergence rate improvement; relative value iteration reinforcement learning algorithm; Electronic mail; Equations; Learning; Machine learning; Markov processes; Mathematical model; Optimization; Average reward; Reinforcement Learning; Relative value iteration; Robotic navigation;
Conference_Titel :
Control Conference (CCC), 2012 31st Chinese
Conference_Location :
Hefei
Print_ISBN :
978-1-4673-2581-3