• DocumentCode
    2423109
  • Title

    Approximate dynamic programming with correlated Bayesian beliefs

  • Author

    Ryzhov, Ilya O. ; Powell, Warren B.

  • Author_Institution
    Dept. of Oper. Res. & Financial Eng., Princeton Univ., Princeton, NJ, USA
  • fYear
    2010
  • fDate
    Sept. 29 2010-Oct. 1 2010
  • Firstpage
    1360
  • Lastpage
    1367
  • Abstract
    In approximate dynamic programming, we can represent our uncertainty about the value function using a Bayesian model with correlated beliefs. Thus, a decision made at a single state can provide us with information about many states, making each individual observation much more powerful. We propose a new exploration strategy based on the knowledge gradient concept from the optimal learning literature, which is currently the only method capable of handling correlated belief structures. The proposed method outperforms several other heuristics in numerical experiments conducted on two broad problem classes.
  • Keywords
    Bayes methods; belief networks; dynamic programming; gradient methods; learning (artificial intelligence); Bayesian model; approximate dynamic programming; correlated Bayesian belief; correlated belief structure; exploration strategy; knowledge gradient concept; optimal learning literature; value function; Bayesian methods; Dynamic programming; Equations; Function approximation; Mathematical model; Tin;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communication, Control, and Computing (Allerton), 2010 48th Annual Allerton Conference on
  • Conference_Location
    Allerton, IL
  • Print_ISBN
    978-1-4244-8215-3
  • Type

    conf

  • DOI
    10.1109/ALLERTON.2010.5707072
  • Filename
    5707072