• DocumentCode
    831796
  • Title

    Theory for automatic learning under partially observed Markov-dependent noise

  • Author

    Yakowtiz, S. ; Jayawardena, Thusitha ; Li, Shu

  • Author_Institution
    Dept. of Syst. & Ind. Eng., Arizona Univ., Tucson, AZ, USA
  • Volume
    37
  • Issue
    9
  • fYear
    1992
  • fDate
    9/1/1992 12:00:00 AM
  • Firstpage
    1316
  • Lastpage
    1324
  • Abstract
    A vigorous branch of automatic learning is directed at the task of locating a global minimum of an unknown multimodal function f(θ) on the basis of noisy observations L(θ(i))=f(θ(i))+W (θ(i)) taken at sequentially-chosen control points {θ(i)}. In all preceding convergence deviations known to the authors, the noise is postulated to depend on the past only through control selection. Here they allow the observation noise sequence to be stochastically dependent, in particular, a function of an unknown underlying Markov decision process, the observations being the stagewise losses. In a sense, in order to be made precise, the algorithm offered is shown to attain asymptotically optimal performance, and rates are assured. A motivating example from queueing theory is offered, and connections with classical problems of Markov control theory and other disciplines are mentioned
  • Keywords
    Markov processes; decision theory; learning (artificial intelligence); noise; Markov control theory; automatic learning; global minimum; noisy observations; partially observed Markov-dependent noise; queueing theory; stagewise losses; unknown multimodal function; unknown underlying Markov decision process; Automatic control; Control theory; Convergence; Kernel; Minimization methods; Queueing analysis; Random variables; State-space methods; Stochastic processes; Stochastic resonance;
  • fLanguage
    English
  • Journal_Title
    Automatic Control, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9286
  • Type

    jour

  • DOI
    10.1109/9.159569
  • Filename
    159569