Title :
An Information-Theoretic Class of Stochastic Decision Processes
Author_Institution :
Grad. Sch. of Inf. Sci., Hiroshima City Univ., Hiroshima
Abstract :
Stochastic decision processes in reinforcement learning are usually formulated as Markov decision processes which are stationary and ergodic. However, in fact, some of the stochastic decision processes are not necessarily Markov, stationary, and/or ergodic. In this paper, using an information-theoretic property, we show a class of stochastic decision processes in reinforcement learning in which return maximization occurs with a positive probability. The class would be useful in considering reinforcement learning applications.
Keywords :
decision theory; learning (artificial intelligence); probability; stochastic processes; information-theoretic property; positive probability; reinforcement learning; stochastic decision processes; Convergence; Intelligent agent; Learning; Stochastic processes; information theory; reinforcement learning; stochastic decision processes;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-0-7695-3496-1
DOI :
10.1109/WIIAT.2008.32