• DocumentCode
    3493624
  • Title

    Explorations on system identification via higher-level application of Adaptive-Critic Approximate Dynamic Programming

  • Author

    Hughes, Joshua G. ; Lendaris, George G.

  • Author_Institution
    Syst. Sci. Grad. Program, Portland State Univ., Portland, OR, USA
  • fYear
    2011
  • fDate
    July 31 2011-Aug. 5 2011
  • Firstpage
    640
  • Lastpage
    647
  • Abstract
    In previous work it was shown that Adaptive- Critic-type Approximate Dynamic Programming could be applied in a “higher-level” way to create autonomous agents capable of using experience to discern context and select optimal, context-dependent control policies. Early experiments with this approach were based on full a priori knowledge of the system being monitored. The experiments reported in this paper, using small neural networks representing families of mappings, were designed to explore what happens when knowledge of the system is less precise. Results of these experiments show that agents trained with this approach perform well when subject to even large amounts of noise or when employing (slightly) imperfect models. The results also suggest that aspects of this method of context discernment are consistent with our intuition about human learning. The insights gained from these explorations can be used to guide further efforts for developing this approach into a general methodology for solving arbitrary identification and control problems.
  • Keywords
    approximation theory; dynamic programming; identification; learning (artificial intelligence); neural nets; adaptive critic approximate dynamic programming; autonomous agents; context dependent control policies; context discernment; higher-level application; human learning; neural networks; system identification; Artificial neural networks; Context; Context modeling; Noise; Process control; System identification; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2011 International Joint Conference on
  • Conference_Location
    San Jose, CA
  • ISSN
    2161-4393
  • Print_ISBN
    978-1-4244-9635-8
  • Type

    conf

  • DOI
    10.1109/IJCNN.2011.6033281
  • Filename
    6033281