DocumentCode
1697235
Title
Advances in optimizing recurrent networks
Author
Bengio, Yoshua ; Boulanger-Lewandowski, Nicolas ; Pascanu, Razvan
Author_Institution
U. Montreal, Montreal, QC, Canada
fYear
2013
Firstpage
8624
Lastpage
8628
Abstract
After a more than decade-long period of relatively little research activity in the area of recurrent neural networks, several new developments will be reviewed here that have allowed substantial progress both in understanding and in technical solutions towards more efficient training of recurrent networks. These advances have been motivated by and related to the optimization issues surrounding deep learning. Although recurrent networks are extremely powerful in what they can in principle represent in terms of modeling sequences, their training is plagued by two aspects of the same issue regarding the learning of long-term dependencies. Experiments reported here evaluate the use of clipping gradients, spanning longer time ranges with leaky integration, advanced momentum techniques, using more powerful output probability models, and encouraging sparser gradients to help symmetry breaking and credit assignment. The experiments are performed on text and music data and show off the combined effects of these techniques in generally improving both training and test error.
Keywords
learning (artificial intelligence); optimisation; recurrent neural nets; advanced momentum; clipping gradients; credit assignment; deep learning; long term dependencies; music data; output probability models; recurrent neural networks optimization; sparser gradients; symmetry breaking; test error; Biological neural networks; Computational modeling; Entropy; Optimization; Recurrent neural networks; Training; Recurrent networks; deep learning; long-term dependencies; representation learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639349
Filename
6639349
Link To Document