Title :
Hierarchical tandem feature extraction
Author :
Sunil Sivadas;Hynek Hermansky
Author_Institution :
Oregon Graduate Institute of Science and Technology, Portland, USA
fDate :
5/1/2002 12:00:00 AM
Abstract :
We present a hierarchical architecture for tandem acoustic modeling. In the tandem acoustic modeling paradigm a Multi Layer Perceptron (MLP) is discriminatively trained to estimate phoneme posterior probabilities on a labeled database. The outputs of the MLP after nonlinear transformation and whitening are used as features in a Gaussian Mixture Model (GMM) based recognizer. In this paper we replace the large monolithic MLP with hierarchies of MLP experts. We apply this approach on Speech in Noisy Environments (SPINE 1) evaluation conducted by the Naval Research Laboratory (NRL). We observe a reduction in word error rate of 30% with context-independent models and 5% WER with context-dependent models relative to PLP features.
Keywords :
"Artificial neural networks","Books","Speech"
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.2002.5743841