DocumentCode :
480818
Title :
An Information-Theoretic Class of Stochastic Decision Processes
Author :
Iwata, Kazunori
Author_Institution :
Grad. Sch. of Inf. Sci., Hiroshima City Univ., Hiroshima
Volume :
2
fYear :
2008
fDate :
9-12 Dec. 2008
Firstpage :
340
Lastpage :
344
Abstract :
Stochastic decision processes in reinforcement learning are usually formulated as Markov decision processes which are stationary and ergodic. However, in fact, some of the stochastic decision processes are not necessarily Markov, stationary, and/or ergodic. In this paper, using an information-theoretic property, we show a class of stochastic decision processes in reinforcement learning in which return maximization occurs with a positive probability. The class would be useful in considering reinforcement learning applications.
Keywords :
decision theory; learning (artificial intelligence); probability; stochastic processes; information-theoretic property; positive probability; reinforcement learning; stochastic decision processes; Convergence; Intelligent agent; Learning; Stochastic processes; information theory; reinforcement learning; stochastic decision processes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-0-7695-3496-1
Type :
conf
DOI :
10.1109/WIIAT.2008.32
Filename :
4740646
Link To Document :
بازگشت