Title of article :
Boundedness of iterates in Q-Learning
Author/Authors :
Abhijit Gosavi، نويسنده ,
Issue Information :
ماهنامه با شماره پیاپی سال 2006
Keywords :
Q-learning , boundedness , Stochastic control
Journal title :
Systems and Control Letters
Journal title :
Systems and Control Letters