DocumentCode :
2703783
Title :
Optimization of Temporal Filters in the Modulation Frequency Domain via Constrained Linear Discriminant Analysis (C-LDA) for Constructing Robust Features in Speech Recognition
Author :
Jeih-weih Hung
Author_Institution :
Dept. of Electr. Eng., Nat. Chi Nan Univ., Taiwan
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
Data-driven temporal filtering approaches based on a specific optimization criterion have been shown to be capable of enhancing the discrimination and robustness of speech features in speech recognition. The filters in these approaches are often obtained with the statistics of the features in the temporal domain. In this paper, we derive new data-driven temporal filters that employ the statistics of the modulation spectra of the speech features. The new temporal filtering approach is based on the constrained version of linear discriminant analysis (C-LDA). It is shown that the proposed C-LDA temporal filters can effectively improve the speech recognition accuracy in various noise corrupted environments. In experiments conducted on Test Set A of the Aurora-2 noisy digits database, these new temporal filters, together with cepstral mean and variance normalization (CMVN), provides average relative error reduction rates of over 47% and 30%, when compared with the baseline MFCC processing and CMVN alone, respectively.
Keywords :
feature extraction; filtering theory; modulation; speech processing; speech recognition; cepstral mean and variance normalization; constrained linear discriminant analysis; data-driven temporal filtering; modulation frequency domain; modulation spectra; speech features; speech recognition; temporal filters; Chirp modulation; Constraint optimization; Filtering; Frequency domain analysis; Frequency modulation; Linear discriminant analysis; Nonlinear filters; Robustness; Speech recognition; Statistics; linear discriminant analysis; modulation frequency; noise robustness; temporal filter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367035
Filename :
4218223
Link To Document :
بازگشت