DocumentCode :
3328970
Title :
An evaluation of a nonlinear feature transformation for conversational speech recognition
Author :
Omar, Mohamed Kamal ; Kingsbury, Brian
Author_Institution :
Illinois Univ., Urbana, IL, USA
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
We test the nonlinear symplectic maximum-likelihood transformation (SMLT) on two large-vocabulary, conversational speech recognition tasks: IBM´s Superhuman test and the DARPA 2003 Rich Transcription (RT03) test. Features in these tests are computed via linear discriminant analysis (LDA) on spliced MFCC features and subsequent transformation of the projected features using either a maximum-likelihood linear transformation (MLLT), an SMLT, or both. In contrast to previous tests of the SMLT on TIMIT phone recognition with static and delta MFCC, these tests use a more difficult task and very different features. The four results of this work are that both LDA+MLLT and LDA+SMLT systems outperform an LDA-only system; the LDA+MLLT system outperforms the LDA+SMLT system (but the MLLT has 20 times more parameters than the SMLT); small improvements over an LDA+MLLT system are obtained with an LDA+MLLT+SMLT system on well-matched material; and no improvements are obtained using two class-dependent SMLT in an LDA+MLLT+SMLT system.
Keywords :
feature extraction; maximum likelihood estimation; speech recognition; DARPA 2003 Rich Transcription; IBM Superhuman test; LDA; MLLT; RT03 test; SMLT; conversational speech recognition; large-vocabulary speech recognition; linear discriminant analysis; maximum-likelihood linear transformation; nonlinear feature transformation; spliced MFCC features; symplectic maximum-likelihood transformation; Automatic speech recognition; Degradation; Gaussian processes; Hidden Markov models; Linear discriminant analysis; Maximum likelihood estimation; Mel frequency cepstral coefficient; Parameter estimation; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326103
Filename :
1326103
Link To Document :
بازگشت