DocumentCode :
3129267
Title :
Clustering Similar Actions in Sequential Decision Processes
Author :
Chiu, Po-Hsiang ; Huber, Manfred
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Texas at Arlington, Arlington, TX, USA
fYear :
2009
fDate :
13-15 Dec. 2009
Firstpage :
776
Lastpage :
781
Abstract :
The presence of a large number of available actions in the context of an automated, adaptive decision process can lead to an excessively large search space and thus significantly increase the overhead for the policy learning process. This issue occurs particularly in problem domains such as path planning or grid scheduling where the number of decision points is large and irreducible. The learning algorithm developed in this paper attempts to create a more compact representation of the state and action space by grouping similar actions that are likely leading to very similar future results. Actions are considered similar if they, with high probability, lead to future results with sufficient commonality. This paper develops this action clustering framework within the MDP formalism where actions in any given state are grouped if they result in similar reinforcement feedback based on the past learning experience. The resulting action sets are then considered as a whole in the decision process.
Keywords :
Markov processes; decision theory; learning (artificial intelligence); pattern clustering; probability; search problems; Markov decision process; adaptive decision process; policy learning process; probability; reinforcement feedback; reinforcement learning; search space; sequential decision process; similar action clustering; Application software; Assembly; Clustering algorithms; Computer science; Machine learning; Path planning; Processor scheduling; Resource management; State feedback; State-space methods; action clustering; parametric actions; reinforcement learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications, 2009. ICMLA '09. International Conference on
Conference_Location :
Miami Beach, FL
Print_ISBN :
978-0-7695-3926-3
Type :
conf
DOI :
10.1109/ICMLA.2009.98
Filename :
5382109
Link To Document :
بازگشت