DocumentCode
446052
Title
Framewise phoneme classification with bidirectional LSTM networks
Author
Graves, Alex ; Schmidhuber, Jurgen
Author_Institution
IDSIA, Switzerland
Volume
4
fYear
2005
fDate
July 31 2005-Aug. 4 2005
Firstpage
2047
Abstract
In this paper, we apply bidirectional training to a long short term memory (LSTM) network for the first time. We also present a modified, full gradient version of the LSTM learning algorithm. We discuss the significance of framewise phoneme classification to continuous speech recognition, and the validity of using bidirectional networks for online causal tasks. On the TIMIT speech database, we measure the framewise phoneme classification scores of bidirectional and unidirectional variants of both LSTM and conventional recurrent neural networks (RNNs). We find that bidirectional LSTM outperforms both RNNs and unidirectional LSTM.
Keywords
recurrent neural nets; signal classification; speech recognition; bidirectional training; continuous speech recognition; framewise phoneme classification; long short term memory network; recurrent neural networks; speech database; Acoustic measurements; Data analysis; Databases; Electronic mail; Error correction; Hidden Markov models; Memory architecture; Neural networks; Recurrent neural networks; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Conference_Location
Montreal, Que.
Print_ISBN
0-7803-9048-2
Type
conf
DOI
10.1109/IJCNN.2005.1556215
Filename
1556215
Link To Document