Title :
Approximate dynamic programming with correlated Bayesian beliefs
Author :
Ryzhov, Ilya O. ; Powell, Warren B.
Author_Institution :
Dept. of Oper. Res. & Financial Eng., Princeton Univ., Princeton, NJ, USA
fDate :
Sept. 29 2010-Oct. 1 2010
Abstract :
In approximate dynamic programming, we can represent our uncertainty about the value function using a Bayesian model with correlated beliefs. Thus, a decision made at a single state can provide us with information about many states, making each individual observation much more powerful. We propose a new exploration strategy based on the knowledge gradient concept from the optimal learning literature, which is currently the only method capable of handling correlated belief structures. The proposed method outperforms several other heuristics in numerical experiments conducted on two broad problem classes.
Keywords :
Bayes methods; belief networks; dynamic programming; gradient methods; learning (artificial intelligence); Bayesian model; approximate dynamic programming; correlated Bayesian belief; correlated belief structure; exploration strategy; knowledge gradient concept; optimal learning literature; value function; Bayesian methods; Dynamic programming; Equations; Function approximation; Mathematical model; Tin;
Conference_Titel :
Communication, Control, and Computing (Allerton), 2010 48th Annual Allerton Conference on
Conference_Location :
Allerton, IL
Print_ISBN :
978-1-4244-8215-3
DOI :
10.1109/ALLERTON.2010.5707072