مرکز منطقه ای اطلاع رساني علوم و فناوري - Approximate dynamic programming with correlated Bayesian beliefs

DocumentCode :

2423109

Title :

Approximate dynamic programming with correlated Bayesian beliefs

Author :

Ryzhov, Ilya O. ; Powell, Warren B.

Author_Institution :

Dept. of Oper. Res. & Financial Eng., Princeton Univ., Princeton, NJ, USA

fYear :

2010

fDate :

Sept. 29 2010-Oct. 1 2010

Firstpage :

1360

Lastpage :

1367

Abstract :

In approximate dynamic programming, we can represent our uncertainty about the value function using a Bayesian model with correlated beliefs. Thus, a decision made at a single state can provide us with information about many states, making each individual observation much more powerful. We propose a new exploration strategy based on the knowledge gradient concept from the optimal learning literature, which is currently the only method capable of handling correlated belief structures. The proposed method outperforms several other heuristics in numerical experiments conducted on two broad problem classes.

Keywords :

Bayes methods; belief networks; dynamic programming; gradient methods; learning (artificial intelligence); Bayesian model; approximate dynamic programming; correlated Bayesian belief; correlated belief structure; exploration strategy; knowledge gradient concept; optimal learning literature; value function; Bayesian methods; Dynamic programming; Equations; Function approximation; Mathematical model; Tin;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Communication, Control, and Computing (Allerton), 2010 48th Annual Allerton Conference on

Conference_Location :

Allerton, IL

Print_ISBN :

978-1-4244-8215-3

Type :

conf

DOI :

10.1109/ALLERTON.2010.5707072

Filename :

5707072

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2423109