• DocumentCode
    1925590
  • Title

    Reduction of temporal complexity in Markov decision processes

  • Author

    Garcia-Hernandez, M.G. ; Ruiz-Pinales, J. ; Ledesma-Orozco, S. ; Avina-Cervantes, G.

  • Author_Institution
    Univ. de Guanajuato, Salamanca, Mexico
  • fYear
    2012
  • fDate
    27-29 Feb. 2012
  • Firstpage
    311
  • Lastpage
    316
  • Abstract
    In this paper we propose a new algorithm in order to reduce temporal complexity in Markov decision processes. Value iteration is a classical algorithm for solving them, but this algorithm and its variants are quite slow for discount factors close to one and their convergence properties depend to a great extent on a good state update order. It has been shown that improved topological value iteration presents a good convergence speed thanks to the use of an improved topological ordering. Nevertheless, its drawback is due to high memory requirements. So, our algorithm obtains the optimal state backup order with less memory requirements. Experimental results on stochastic shortest-path problems (highly cyclic) are presented. Our approach obtained a considerable reduction in temporal complexity with respect to other variants of value iteration. For instance, the experiments showed in one test a reduction of six times with respect to asynchronous value iteration.
  • Keywords
    Markov processes; back-up procedures; computational complexity; convergence of numerical methods; iterative methods; storage management; Markov decision process; asynchronous value iteration; convergence properties; memory requirements; optimal state backup order; stochastic shortest-path problems; temporal complexity; topological value iteration; Acceleration; Complexity theory; Convergence; Equations; Markov processes; Memory management; Dijkstra algorithm; Markov decision processes; priority queues;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical Communications and Computers (CONIELECOMP), 2012 22nd International Conference on
  • Conference_Location
    Cholula, Puebla
  • Print_ISBN
    978-1-4577-1326-2
  • Type

    conf

  • DOI
    10.1109/CONIELECOMP.2012.6189930
  • Filename
    6189930