Title :
State evaluation strategy for exemplar-based policy optimization of dynamic decision problems
Author :
Ikeda, Kokolo ; Kita, Hajime
Author_Institution :
Kyoto Univ., Kyoto
Abstract :
Direct policy search (DPS) that optimizes the parameters of a decision making model, combined with evolutionary algorithms which enable robust optimization, is a promising approach to dynamic decision problems. Exemplar- based policy (EBP) optimization is a novel framework for DPS in which the policy is composed of a set of exemplars and a case- based action selector, with the set of exemplars being refined and evolved using a GA. In this paper, state evaluation type EBP representations are proposed for the problem class whose state transition can be predicted. For example, the vector-real representation defines pairs of feature vector and its desirability as exemplars, and evaluate the predicted next states using the exemplars. The state evaluation type EBP-based optimization procedures are shown to be superior to conventional state-action type EBP optimization through application to the Tetris game.
Keywords :
Markov processes; decision making; evolutionary computation; optimisation; search problems; Markov decision process; Tetris game; case-based action selector; decision making model; direct policy search; dynamic decision problem; evolutionary algorithm; exemplar-based policy optimization; feature vector; state evaluation strategy; state transition; vector-real representation; Acceleration; Artificial neural networks; Decision making; Evolutionary computation; Game theory; Genetic algorithms; Learning; Robustness; Search methods; State-space methods;
Conference_Titel :
Evolutionary Computation, 2007. CEC 2007. IEEE Congress on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-1339-3
Electronic_ISBN :
978-1-4244-1340-9
DOI :
10.1109/CEC.2007.4424950