Title : 
State evaluation strategy for exemplar-based policy optimization of dynamic decision problems
         
        
            Author : 
Ikeda, Kokolo ; Kita, Hajime
         
        
            Author_Institution : 
Kyoto Univ., Kyoto
         
        
        
        
        
        
            Abstract : 
Direct policy search (DPS) that optimizes the parameters of a decision making model, combined with evolutionary algorithms which enable robust optimization, is a promising approach to dynamic decision problems. Exemplar- based policy (EBP) optimization is a novel framework for DPS in which the policy is composed of a set of exemplars and a case- based action selector, with the set of exemplars being refined and evolved using a GA. In this paper, state evaluation type EBP representations are proposed for the problem class whose state transition can be predicted. For example, the vector-real representation defines pairs of feature vector and its desirability as exemplars, and evaluate the predicted next states using the exemplars. The state evaluation type EBP-based optimization procedures are shown to be superior to conventional state-action type EBP optimization through application to the Tetris game.
         
        
            Keywords : 
Markov processes; decision making; evolutionary computation; optimisation; search problems; Markov decision process; Tetris game; case-based action selector; decision making model; direct policy search; dynamic decision problem; evolutionary algorithm; exemplar-based policy optimization; feature vector; state evaluation strategy; state transition; vector-real representation; Acceleration; Artificial neural networks; Decision making; Evolutionary computation; Game theory; Genetic algorithms; Learning; Robustness; Search methods; State-space methods;
         
        
        
        
            Conference_Titel : 
Evolutionary Computation, 2007. CEC 2007. IEEE Congress on
         
        
            Conference_Location : 
Singapore
         
        
            Print_ISBN : 
978-1-4244-1339-3
         
        
            Electronic_ISBN : 
978-1-4244-1340-9
         
        
        
            DOI : 
10.1109/CEC.2007.4424950