• DocumentCode
    1743635
  • Title

    A learning algorithm for Markov decision processes with adaptive state aggregation

  • Author

    Baras, J.S. ; Borkar, V.S.

  • Author_Institution
    Inst. for Syst. Res., Maryland Univ., College Park, MD, USA
  • Volume
    4
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    3351
  • Abstract
    We propose a simulation-based algorithm for learning good policies for a Markov decision process with unknown transition law, with aggregated states. The state aggregation itself can be adapted on a slower time scale by an auxiliary learning algorithm. Rigorous justifications are provided for both algorithms
  • Keywords
    Markov processes; adaptive systems; decision theory; learning (artificial intelligence); stochastic systems; Markov decision processes; adaptive state aggregation; learning algorithm; state aggregation; unknown transition law; Algorithm design and analysis; Clustering algorithms; Communication system control; Computational modeling; Computer science; Data compression; Educational institutions; Learning; State estimation; Stochastic processes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 2000. Proceedings of the 39th IEEE Conference on
  • Conference_Location
    Sydney, NSW
  • ISSN
    0191-2216
  • Print_ISBN
    0-7803-6638-7
  • Type

    conf

  • DOI
    10.1109/CDC.2000.912220
  • Filename
    912220