• DocumentCode
    3129267
  • Title

    Clustering Similar Actions in Sequential Decision Processes

  • Author

    Chiu, Po-Hsiang ; Huber, Manfred

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Texas at Arlington, Arlington, TX, USA
  • fYear
    2009
  • fDate
    13-15 Dec. 2009
  • Firstpage
    776
  • Lastpage
    781
  • Abstract
    The presence of a large number of available actions in the context of an automated, adaptive decision process can lead to an excessively large search space and thus significantly increase the overhead for the policy learning process. This issue occurs particularly in problem domains such as path planning or grid scheduling where the number of decision points is large and irreducible. The learning algorithm developed in this paper attempts to create a more compact representation of the state and action space by grouping similar actions that are likely leading to very similar future results. Actions are considered similar if they, with high probability, lead to future results with sufficient commonality. This paper develops this action clustering framework within the MDP formalism where actions in any given state are grouped if they result in similar reinforcement feedback based on the past learning experience. The resulting action sets are then considered as a whole in the decision process.
  • Keywords
    Markov processes; decision theory; learning (artificial intelligence); pattern clustering; probability; search problems; Markov decision process; adaptive decision process; policy learning process; probability; reinforcement feedback; reinforcement learning; search space; sequential decision process; similar action clustering; Application software; Assembly; Clustering algorithms; Computer science; Machine learning; Path planning; Processor scheduling; Resource management; State feedback; State-space methods; action clustering; parametric actions; reinforcement learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications, 2009. ICMLA '09. International Conference on
  • Conference_Location
    Miami Beach, FL
  • Print_ISBN
    978-0-7695-3926-3
  • Type

    conf

  • DOI
    10.1109/ICMLA.2009.98
  • Filename
    5382109