مرکز منطقه ای اطلاع رساني علوم و فناوري - State evaluation strategy for exemplar-based policy optimization of dynamic decision problems

DocumentCode :

2695057

Title :

State evaluation strategy for exemplar-based policy optimization of dynamic decision problems

Author :

Ikeda, Kokolo ; Kita, Hajime

Author_Institution :

Kyoto Univ., Kyoto

fYear :

2007

fDate :

25-28 Sept. 2007

Firstpage :

3685

Lastpage :

3691

Abstract :

Direct policy search (DPS) that optimizes the parameters of a decision making model, combined with evolutionary algorithms which enable robust optimization, is a promising approach to dynamic decision problems. Exemplar- based policy (EBP) optimization is a novel framework for DPS in which the policy is composed of a set of exemplars and a case- based action selector, with the set of exemplars being refined and evolved using a GA. In this paper, state evaluation type EBP representations are proposed for the problem class whose state transition can be predicted. For example, the vector-real representation defines pairs of feature vector and its desirability as exemplars, and evaluate the predicted next states using the exemplars. The state evaluation type EBP-based optimization procedures are shown to be superior to conventional state-action type EBP optimization through application to the Tetris game.

Keywords :

Markov processes; decision making; evolutionary computation; optimisation; search problems; Markov decision process; Tetris game; case-based action selector; decision making model; direct policy search; dynamic decision problem; evolutionary algorithm; exemplar-based policy optimization; feature vector; state evaluation strategy; state transition; vector-real representation; Acceleration; Artificial neural networks; Decision making; Evolutionary computation; Game theory; Genetic algorithms; Learning; Robustness; Search methods; State-space methods;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Evolutionary Computation, 2007. CEC 2007. IEEE Congress on

Conference_Location :

Singapore

Print_ISBN :

978-1-4244-1339-3

Electronic_ISBN :

978-1-4244-1340-9

Type :

conf

DOI :

10.1109/CEC.2007.4424950

Filename :

4424950

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2695057