DocumentCode
2423109
Title
Approximate dynamic programming with correlated Bayesian beliefs
Author
Ryzhov, Ilya O. ; Powell, Warren B.
Author_Institution
Dept. of Oper. Res. & Financial Eng., Princeton Univ., Princeton, NJ, USA
fYear
2010
fDate
Sept. 29 2010-Oct. 1 2010
Firstpage
1360
Lastpage
1367
Abstract
In approximate dynamic programming, we can represent our uncertainty about the value function using a Bayesian model with correlated beliefs. Thus, a decision made at a single state can provide us with information about many states, making each individual observation much more powerful. We propose a new exploration strategy based on the knowledge gradient concept from the optimal learning literature, which is currently the only method capable of handling correlated belief structures. The proposed method outperforms several other heuristics in numerical experiments conducted on two broad problem classes.
Keywords
Bayes methods; belief networks; dynamic programming; gradient methods; learning (artificial intelligence); Bayesian model; approximate dynamic programming; correlated Bayesian belief; correlated belief structure; exploration strategy; knowledge gradient concept; optimal learning literature; value function; Bayesian methods; Dynamic programming; Equations; Function approximation; Mathematical model; Tin;
fLanguage
English
Publisher
ieee
Conference_Titel
Communication, Control, and Computing (Allerton), 2010 48th Annual Allerton Conference on
Conference_Location
Allerton, IL
Print_ISBN
978-1-4244-8215-3
Type
conf
DOI
10.1109/ALLERTON.2010.5707072
Filename
5707072
Link To Document