DocumentCode :
3424256
Title :
Exploiting contextual information for improved phoneme recognition
Author :
Pinto, Joel ; Yegnanarayana, B. ; Hermansky, H. ; -Doss, Mathew Magimai
Author_Institution :
IDIAP Res. Inst., Martigny
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4449
Lastpage :
4452
Abstract :
In this paper, we investigate the significance of contextual information in a phoneme recognition system using the hidden Markov model - artificial neural network paradigm. Contextual information is probed at the feature level as well as at the output of the multilayered perceptron. At the feature level, we analyze and compare different methods to model sub-phonemic classes. To exploit the contextual information at the output of the multilayered perceptron, we propose the hierarchical estimation of phoneme posterior probabilities. The best phoneme (excluding silence) recognition accuracy of 73.4% on the TIMIT database is comparable to that of the state-of- the-art systems, but more emphasis is on analysis of the contextual information.
Keywords :
hidden Markov models; multilayer perceptrons; speech processing; speech recognition; artificial neural network; contextual information; hidden Markov model; hierarchical estimation; multilayered perceptron; phoneme posterior probabilities; phoneme recognition system; sub-phonemic classes; Artificial neural networks; Databases; Decoding; Hidden Markov models; Hierarchical systems; Information analysis; Matched filters; Multilayer perceptrons; Signal processing; Speech recognition; Phoneme recognition; contextual information; hierarchical systems; matched filters;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518643
Filename :
4518643
Link To Document :
بازگشت