• DocumentCode
    3119008
  • Title

    Stochastic Optimal Control with Neural Networks and Application to a Retailer Inventory Problem

  • Author

    Huang, Zhongwu ; Wang, Xiaohua ; Balakrishnan, S.N.

  • Author_Institution
    PhD from the department of mechanical and aerospace engineering, University of Missouri Rolla, Rolla, Mo, 65401, USA (email: huang@umr.edu)
  • fYear
    2005
  • fDate
    12-15 Dec. 2005
  • Firstpage
    4518
  • Lastpage
    4523
  • Abstract
    Overwhelming computational requirements of classical dynamic programming algorithms render them inapplicable to most practical stochastic problems. To overcome this problem a neural network based Dynamic Programming (DP) approach is described in this study. The cost function which is critical in a dynamic programming formulation is approximated by a neural network according to some designed weight-update rule based on Temporal Difference(TD)learning. A Lyapunov based theory is developed to guarantee an upper error bound between the output of the cost neural network and the true cost. We illustrate this approach through a retailer inventory problem.
  • Keywords
    Aerospace engineering; Convergence; Cost function; Dynamic programming; Function approximation; Multi-layer neural network; Neural networks; Optimal control; Process control; Stochastic processes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 2005 and 2005 European Control Conference. CDC-ECC '05. 44th IEEE Conference on
  • Print_ISBN
    0-7803-9567-0
  • Type

    conf

  • DOI
    10.1109/CDC.2005.1582874
  • Filename
    1582874