مرکز منطقه ای اطلاع رساني علوم و فناوري - State thresholding to accelerate reinforcement learning

DocumentCode :

3590282

Title :

State thresholding to accelerate reinforcement learning

Author :

Sari, Safreni Candra ; Prihatmanto, Ary Setijadi ; Adiprawita, Widyawardana ; Kuspriyanto

Author_Institution :

Dept. of Electr. Eng., Gen. Achmad Yani Univ. (UNJANI), Cimahi, Indonesia

Volume :

fYear :

2014

Firstpage :

Lastpage :

Abstract :

Along with the learning convergence and nonstationary equilibria, important thing to be solved in Reinforcement Learning is the slow-learning problem. In highly dynamic and stochastic systems, Markov Decision Processes is often used to model the situation, and RL is used to produce optimum control values which are expressed by optimum policy. However, approaches to solving MDP´s using RL depend on storing the optimal value function or Q-value function and action models as tables do not scale to large state-spaces. The number of states grow exponentially along with the number of agents, the state and action space, which cause the learning process become very slow since it needs a very large amount of computer memory, and yet it needs more computation than the most computer performance nowadays have offered. This paper addressed curse of dimensionality problem by reducing the states in the MDP iteratively using a novel algorithm called State Thresholding in Reinforcement Learning (STRL). STRL accelerate the learning process and empirically proven to outperformed Q learning algorithm.

Keywords :

Markov processes; convergence of numerical methods; iterative methods; learning (artificial intelligence); Markov decision processes; Q-value function; STRL; action models; dimensionality problem; dynamic systems; learning convergence; nonstationary equilibria; optimal value function; optimum control values; reinforcement learning; slow-learning problem; state reduction; state thresholding; stochastic systems; Acceleration; Computational modeling; Convergence; Learning (artificial intelligence); Markov processes; Navigation; Robots; Markov Decision Process; Reinforcement Learning; gridworld robot navigation; learning acceleration;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

System Engineering and Technology (ICSET), 2014 IEEE 4th International Conference on

Print_ISBN :

978-1-4799-7188-6

Type :

conf

DOI :

10.1109/ICSEngT.2014.7111787

Filename :

7111787

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3590282