DocumentCode :
3428858
Title :
Learning acoustic frame labeling for speech recognition with recurrent neural networks
Author :
Sak, Hasim ; Senior, Andrew ; Rao, Kanishka ; Irsoy, Ozan ; Graves, Alex ; Beaufays, Francoise ; Schalkwyk, Johan
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
4280
Lastpage :
4284
Abstract :
We explore alternative acoustic modeling techniques for large vocabulary speech recognition using Long Short-Term Memory recurrent neural networks. For an acoustic frame labeling task, we compare the conventional approach of cross-entropy (CE) training using fixed forced-alignments of frames and labels, with the Connectionist Temporal Classification (CTC) method proposed for labeling unsegmented sequence data. We demonstrate that the latter can be implemented with finite state transducers. We experiment with phones and context dependent HMM states as acoustic modeling units. We also investigate the effect of context in acoustic input by training unidirectional and bidirectional LSTM RNN models. We show that a bidirectional LSTM RNN CTC model using phone units can perform as well as an LSTM RNN model trained with CE using HMM state alignments. Finally, we also show the effect of sequence discriminative training on these models and show the first results for sMBR training of CTC models.
Keywords :
entropy; hidden Markov models; learning (artificial intelligence); recurrent neural nets; speech recognition; CE training; CTC method; HMM state; LSTM RNN model; connectionist temporal classification method; cross-entropy training; finite state transducer; learning acoustic frame labeling; long short-term memory recurrent neural network; sMBR training; sequence discriminative training; speech recognition; Acoustics; Context modeling; Gold; Hidden Markov models; Neural networks; Speech recognition; Training; CTC; LSTM; RNN; acoustic modeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178778
Filename :
7178778
Link To Document :
بازگشت