DocumentCode :
394325
Title :
Perceptual MVDR-based cepstral coefficients (PMCCs) for robust speech recognition
Author :
Yapanel, Umit H. ; Dharanipragada, Satya
Author_Institution :
Center for Spoken Language Res., Colorado Univ., Boulder, CO, USA
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
This paper describes a robust feature extraction technique for continuous speech recognition. Central to the technique is the minimum variance distortionless response (MVDR) method of spectrum estimation. We incorporate perceptual information directly in to the spectrum estimation. This provides improved robustness and computational efficiency when compared with the previously proposed MVDR-MFCC technique. On an in-car speech recognition task this method, which we refer to as PMCC, is 15% more accurate in WER and requires approximately a factor of 4 times less computation than the MVDR-MFCC technique. On the same task PMCC yields 20% relative improvement over MFCC and 11% relative improvement over PLP frontends. Similar improvements are observed on the Aurora 2 database.
Keywords :
cepstral analysis; error statistics; feature extraction; speech recognition; PMCC; WER; computational efficiency; continuous speech recognition; in-car speech recognition task; minimum variance distortionless response; perceptual MVDR-based cepstral coefficients; robust feature extraction; spectrum estimation; Cepstral analysis; Feature extraction; Finite impulse response filter; Frequency estimation; Natural languages; Power harmonic filters; Robustness; Spectral analysis; Speech analysis; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198863
Filename :
1198863
Link To Document :
بازگشت