مرکز منطقه ای اطلاع رساني علوم و فناوري - Q-Learning with Continuous State Spaces and Finite Decision Set

DocumentCode :

2717800

Title :

Q-Learning with Continuous State Spaces and Finite Decision Set

Author :

Barty, Kengy ; Girardeau, Pierre ; Roy, Jean-Sébastien ; Strugarek, Cyrille

Author_Institution :

Electricite de France R&D, Clamart

fYear :

2007

fDate :

1-5 April 2007

Firstpage :

346

Lastpage :

351

Abstract :

This paper aims to present an original technique in order to compute the optimal policy of a Markov decision problem with continuous state space and discrete decision variables. We propose an extension of the Q-learning algorithm introduced in 1989 by Watkins for discrete Markov decision problems. Our algorithm relies on stochastic approximation and functional estimation, and uses kernels to locally update the Q-functions. We state under mild assumptions a converge theorem for this algorithm. Finally, we illustrate our algorithm by solving two classical problems: the mountain car task and the puddle world task

Keywords :

Markov processes; learning (artificial intelligence); Markov decision problem; Q-functions; Q-learning; continuous state spaces; discrete decision variables; finite decision set; functional estimation; mountain car task; puddle world task; stochastic approximation; Approximation algorithms; Costs; Dynamic programming; Kernel; Learning; Random variables; Recursive estimation; State-space methods; Stochastic processes; Uncertainty;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on

Conference_Location :

Honolulu, HI

Print_ISBN :

1-4244-0706-0

Type :

conf

DOI :

10.1109/ADPRL.2007.368209

Filename :

4220854

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2717800