Accelerated Q-learning for fail state and action spaces

Author

Park, In-Won ; Kim, Jong-Hwan ; Park, Kui-Hong

Author_Institution

Dept. of Electr. Eng. & Comput. Sci., KAIST, Daejeon

fYear

2008

fDate

12-15 Oct. 2008

Firstpage

763

Lastpage

767

Abstract

Accelerated Q-learning algorithm is proposed for environment having both goal and fail states. It extends Q-learning, a well-known scheme in reinforcement learning. Unlike this conventional Q-learning, the proposed algorithm keeps track of the past failure experiences as a separate fail state-action value, Q_F. Agent uses this value along with a goal state-action value, Q_N, which is calculated and updated using conventional Q-learning, to modify the exploratory behavior during learning phase. Effectiveness of the proposed accelerated Q-learning algorithm is verified in a grid world environment. The proposed algorithm significantly reduces a convergence speed to find out the optimal path from start state to goal state while maximizing its receiving rewards.

Keywords

grid computing; learning (artificial intelligence); accelerated Q-learning; fail state spaces; grid world environment; reinforcement learning; Acceleration; Computer science; Convergence; Humanoid robots; Humans; Machine learning algorithms; Negative feedback; Space technology; Telecommunications; Thin film transistors; Accelerated Q-learning algorithm; success and failure experiences of the agent;

fLanguage

English

Publisher

ieee

Conference_Titel

Systems, Man and Cybernetics, 2008. SMC 2008. IEEE International Conference on

Conference_Location

Singapore

ISSN

1062-922X

Print_ISBN

978-1-4244-2383-5

Electronic_ISBN

1062-922X

Type

conf

DOI

10.1109/ICSMC.2008.4811370

Filename

4811370