• DocumentCode
    1929043
  • Title

    Attempting to reduce the vanishing gradient effect through a novel recurrent multiscale architecture

  • Author

    Squartini, Stefano ; Hussain, Amir ; Piazza, Francesco

  • Author_Institution
    Dipt. di Elettronica e Autom., Ancona Univ., Italy
  • Volume
    4
  • fYear
    2003
  • fDate
    20-24 July 2003
  • Firstpage
    2819
  • Abstract
    This paper proposes a possible solution to the vanishing gradient problem in recurrent neural networks, occurring when such networks are applied to solving tasks where detection of long term dependencies is required. The main idea consists of pre-processing the signal (a time series typically) through a discrete wavelet decomposition, in order to separate the short term information from the long term ones, and treating each scale by different recurrent neural networks. The partial results concerning all the sequences at diverse time/frequency resolutions are combined through an adaptive nonlinear structure in order to achieve the final goal. This new preprocessing based approach is distinct from the other one reported in literature to-date, as it tends to mitigate the effects of the problem under study avoiding relevant changing in network´s architecture and learning techniques. The overall system (called recurrent multiscale network, RMN) is described and its performances tested through typical tasks namely the latching problem and time series prediction.
  • Keywords
    learning (artificial intelligence); neural net architecture; recurrent neural nets; signal processing; time series; wavelet transforms; adaptive nonlinear structure; discrete wavelet decomposition; latching problem; long term dependencies detection; long term information; recurrent multiscale architecture; short term information; time series prediction; vanishing gradient effect; Computer architecture; Computer hacking; Delay effects; Discrete wavelet transforms; Neural networks; Performance evaluation; Recurrent neural networks; Signal processing algorithms; Signal resolution; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2003. Proceedings of the International Joint Conference on
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-7898-9
  • Type

    conf

  • DOI
    10.1109/IJCNN.2003.1224018
  • Filename
    1224018