DocumentCode
831796
Title
Theory for automatic learning under partially observed Markov-dependent noise
Author
Yakowtiz, S. ; Jayawardena, Thusitha ; Li, Shu
Author_Institution
Dept. of Syst. & Ind. Eng., Arizona Univ., Tucson, AZ, USA
Volume
37
Issue
9
fYear
1992
fDate
9/1/1992 12:00:00 AM
Firstpage
1316
Lastpage
1324
Abstract
A vigorous branch of automatic learning is directed at the task of locating a global minimum of an unknown multimodal function f (θ) on the basis of noisy observations L (θ(i ))=f (θ(i ))+W (θ(i )) taken at sequentially-chosen control points {θ(i )}. In all preceding convergence deviations known to the authors, the noise is postulated to depend on the past only through control selection. Here they allow the observation noise sequence to be stochastically dependent, in particular, a function of an unknown underlying Markov decision process, the observations being the stagewise losses. In a sense, in order to be made precise, the algorithm offered is shown to attain asymptotically optimal performance, and rates are assured. A motivating example from queueing theory is offered, and connections with classical problems of Markov control theory and other disciplines are mentioned
Keywords
Markov processes; decision theory; learning (artificial intelligence); noise; Markov control theory; automatic learning; global minimum; noisy observations; partially observed Markov-dependent noise; queueing theory; stagewise losses; unknown multimodal function; unknown underlying Markov decision process; Automatic control; Control theory; Convergence; Kernel; Minimization methods; Queueing analysis; Random variables; State-space methods; Stochastic processes; Stochastic resonance;
fLanguage
English
Journal_Title
Automatic Control, IEEE Transactions on
Publisher
ieee
ISSN
0018-9286
Type
jour
DOI
10.1109/9.159569
Filename
159569
Link To Document