Title :
The problem of adaptive optimum control in Markov processes
Author :
Karelin, Vladimir ; Polyakova, Lyudmila
Author_Institution :
St.-Peterburg State Univ., St. Petersburg, Russia
fDate :
June 30 2014-July 4 2014
Abstract :
The problem of synthesis of optimum strategy is considered for Dynamic of systems where a part of basic information, required forthe construction of the optimal control is lacking. A method for finding the synthesis of the optimal strategy, based on a recurrent algorithm for computing conditional densitiesis suggested. The basic device at an establishment of the fact of convergence in the given work is the information measure of a mismatch between estimated distribution and an estimation. As such measure so-called information number Kulbaka - Ley-blerais taken. In work convergence of the offered procedure is established.
Keywords :
Markov processes; optimal control; Markov process; adaptive optimum control problem; basic device; basic information part; computing conditional density; estimated distribution; information number Kulbaka-Leyblerais; mismatch information measure; optimal control construction; optimal strategy synthesis; optimum strategy synthesis problem; procedure convergence; recurrent algorithm; system dynamic; Convergence; Difference equations; Educational institutions; Electronic mail; Optimal control; Vectors;
Conference_Titel :
Beam Dynamics and Optimization (BDO), 2014 20th International Workshop on
Conference_Location :
St. Petersburg
Print_ISBN :
978-1-4799-5319-6
DOI :
10.1109/BDO.2014.6890028