مرکز منطقه ای اطلاع رساني علوم و فناوري - An average-reward reinforcement learning algorithm based on Schweitzer´s Transformation

DocumentCode :

582041

Title :

An average-reward reinforcement learning algorithm based on Schweitzer´s Transformation

Author :

Jianjun, Li ; Jiangong, Ren ; Yanjie, Li

Author_Institution :

Shenzhen Grad. Sch., Harbin Inst. of Technol., Shenzhen, China

fYear :

2012

fDate :

25-27 July 2012

Firstpage :

2966

Lastpage :

2970

Abstract :

In this paper, we propose a relative value iteration reinforcement learning(RVI-RL) algorithm based on Schweitzer´s Transformation for Markov decision processes (MDP) with average reward. An equivalent average reward optimality equation and a new form of action-value function are presented via Schweitzer´s Transformation. Then, combined with the theory of relative value iteration, this RVI-RL algorithm doesn´t only omit the estimation of the average reward in the learning, but also improves the convergence rate. Finally, a simulation experiment for the navigation of autonomous mobile robot is considered, which illustrates the effectiveness and applicability of the algorithm.

Keywords :

Markov processes; convergence of numerical methods; iterative methods; learning (artificial intelligence); mobile robots; path planning; Markov decision processes; RVI-RL algorithm; Schweitzer transformation; action-value function; autonomous mobile robot navigation; average reward optimality equation; average-reward reinforcement learning algorithm; convergence rate improvement; relative value iteration reinforcement learning algorithm; Electronic mail; Equations; Learning; Machine learning; Markov processes; Mathematical model; Optimization; Average reward; Reinforcement Learning; Relative value iteration; Robotic navigation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Control Conference (CCC), 2012 31st Chinese

Conference_Location :

Hefei

ISSN :

1934-1768

Print_ISBN :

978-1-4673-2581-3

Type :

conf

Filename :

6390430

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=582041