Theory for automatic learning under partially observed Markov-dependent noise

Author

Yakowtiz, S. ; Jayawardena, Thusitha ; Li, Shu

Author_Institution

Dept. of Syst. & Ind. Eng., Arizona Univ., Tucson, AZ, USA

Volume

37

Issue

9

fYear

1992

fDate

9/1/1992 12:00:00 AM

Firstpage

1316

Lastpage

1324

Abstract

A vigorous branch of automatic learning is directed at the task of locating a global minimum of an unknown multimodal function f(θ) on the basis of noisy observations L(θ(i))=f(θ(i))+W (θ(i)) taken at sequentially-chosen control points {θ(i)}. In all preceding convergence deviations known to the authors, the noise is postulated to depend on the past only through control selection. Here they allow the observation noise sequence to be stochastically dependent, in particular, a function of an unknown underlying Markov decision process, the observations being the stagewise losses. In a sense, in order to be made precise, the algorithm offered is shown to attain asymptotically optimal performance, and rates are assured. A motivating example from queueing theory is offered, and connections with classical problems of Markov control theory and other disciplines are mentioned

Keywords

Markov processes; decision theory; learning (artificial intelligence); noise; Markov control theory; automatic learning; global minimum; noisy observations; partially observed Markov-dependent noise; queueing theory; stagewise losses; unknown multimodal function; unknown underlying Markov decision process; Automatic control; Control theory; Convergence; Kernel; Minimization methods; Queueing analysis; Random variables; State-space methods; Stochastic processes; Stochastic resonance;

fLanguage

English

Journal_Title

Automatic Control, IEEE Transactions on

Publisher

ieee

ISSN

0018-9286

Type

jour

DOI

10.1109/9.159569

Filename

159569