DocumentCode :
3590282
Title :
State thresholding to accelerate reinforcement learning
Author :
Sari, Safreni Candra ; Prihatmanto, Ary Setijadi ; Adiprawita, Widyawardana ; Kuspriyanto
Author_Institution :
Dept. of Electr. Eng., Gen. Achmad Yani Univ. (UNJANI), Cimahi, Indonesia
Volume :
4
fYear :
2014
Firstpage :
1
Lastpage :
6
Abstract :
Along with the learning convergence and nonstationary equilibria, important thing to be solved in Reinforcement Learning is the slow-learning problem. In highly dynamic and stochastic systems, Markov Decision Processes is often used to model the situation, and RL is used to produce optimum control values which are expressed by optimum policy. However, approaches to solving MDP´s using RL depend on storing the optimal value function or Q-value function and action models as tables do not scale to large state-spaces. The number of states grow exponentially along with the number of agents, the state and action space, which cause the learning process become very slow since it needs a very large amount of computer memory, and yet it needs more computation than the most computer performance nowadays have offered. This paper addressed curse of dimensionality problem by reducing the states in the MDP iteratively using a novel algorithm called State Thresholding in Reinforcement Learning (STRL). STRL accelerate the learning process and empirically proven to outperformed Q learning algorithm.
Keywords :
Markov processes; convergence of numerical methods; iterative methods; learning (artificial intelligence); Markov decision processes; Q-value function; STRL; action models; dimensionality problem; dynamic systems; learning convergence; nonstationary equilibria; optimal value function; optimum control values; reinforcement learning; slow-learning problem; state reduction; state thresholding; stochastic systems; Acceleration; Computational modeling; Convergence; Learning (artificial intelligence); Markov processes; Navigation; Robots; Markov Decision Process; Reinforcement Learning; gridworld robot navigation; learning acceleration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Engineering and Technology (ICSET), 2014 IEEE 4th International Conference on
Print_ISBN :
978-1-4799-7188-6
Type :
conf
DOI :
10.1109/ICSEngT.2014.7111787
Filename :
7111787
Link To Document :
بازگشت