DocumentCode :
3640864
Title :
Hierarchical tandem feature extraction
Author :
Sunil Sivadas;Hynek Hermansky
Author_Institution :
Oregon Graduate Institute of Science and Technology, Portland, USA
Volume :
1
fYear :
2002
fDate :
5/1/2002 12:00:00 AM
Abstract :
We present a hierarchical architecture for tandem acoustic modeling. In the tandem acoustic modeling paradigm a Multi Layer Perceptron (MLP) is discriminatively trained to estimate phoneme posterior probabilities on a labeled database. The outputs of the MLP after nonlinear transformation and whitening are used as features in a Gaussian Mixture Model (GMM) based recognizer. In this paper we replace the large monolithic MLP with hierarchies of MLP experts. We apply this approach on Speech in Noisy Environments (SPINE 1) evaluation conducted by the Naval Research Laboratory (NRL). We observe a reduction in word error rate of 30% with context-independent models and 5% WER with context-dependent models relative to PLP features.
Keywords :
"Artificial neural networks","Books","Speech"
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743841
Filename :
5743841
Link To Document :
بازگشت