DocumentCode :
2177462
Title :
Deep Belief Networks using discriminative features for phone recognition
Author :
Mohamed, Abdel-rahman ; Sainath, Tara N. ; Dahl, George ; Ramabhadran, Bhuvana ; Hinton, Geoffrey E. ; Picheny, Michael A.
Author_Institution :
Dept. of Comput. Sci., Univ. of Toronto, Toronto, ON, Canada
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
5060
Lastpage :
5063
Abstract :
Deep Belief Networks (DBNs) are multi-layer generative models. They can be trained to model windows of coefficients extracted from speech and they discover multiple layers of features that capture the higher-order statistical structure of the data. These features can be used to initialize the hidden units of a feed-forward neural network that is then trained to predict the HMM state for the central frame of the window. Initializing with features that are good at generating speech makes the neural network perform much better than initializing with random weights. DBNs have already been used successfully for phone recognition with input coefficients that are MFCCs or filterbank outputs. In this paper, we demonstrate that they work even better when their inputs are speaker adaptive, discriminative features. On the standard TIMIT corpus, they give phone error rates of 19.6% using monophone HMMs and a bigram language model and 19.4% using monophone HMMs and a trigram language model.
Keywords :
hidden Markov models; higher order statistics; speech recognition; DBN; MFCCs; TIMIT corpus; bigram language model; deep belief networks; discriminative features; higher-order statistical structure; multilayer generative model; phone recognition; trigram language model; Artificial neural networks; Decoding; Error analysis; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Training; Deep belief networks; Discriminative feature transformation; Phone recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947494
Filename :
5947494
Link To Document :
بازگشت