Title :
A Novel Acoustic Feature Extraction Algorithm Based on Root Cepstrum Coefficients and CCBC for Robust Speech Recognition
Author :
Wang, Xu ; Han, Zhiyan
Author_Institution :
Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
Abstract :
Studies have shown that depending on speaker task and environmental conditions, recognizers are sensitive to noisy stressful environments. The focus of this study is to achieve robust recognition in diverse environmental conditions through extracting robust features. Central to the technique is Root Cepstrum Coefficients (RCC) method, instead of logarithm amplitude spectrum and discrete cosine transform of the conventional Mel Frequency Cepstral Coefficients (MFCC), but also using Two-dimensional Root Cepstrum Coefficients (TDRCC). This feature is called TDRCC-MFCC. And then, we consider incorporating Canonical Correlation Based Compensation (CCBC) to cope with the mismatch between training and test set. The mismatch between training and test conditions can be simply clustered into three classes: differences of speakers, changes of recording channel and effects of noisy environment. We evaluate the technique using Back-Propagation Neural Networks (BPNN) on two different tasks: one is in-car speech recognition task, another is different SNR speech recognition. The experimental results show that the novel feature has very good robustness and effectiveness relative to MFCC feature and the CCBC algorithm can make speech recognition system greatly robust to all three kinds of mismatch between training set and test set.
Keywords :
backpropagation; correlation methods; feature extraction; neural nets; speech recognition; Mel frequency cepstral coefficients; acoustic feature extraction; back-propagation neural networks; canonical correlation based compensation; robust speech recognition; two-dimensional root cepstrum coefficients; Acoustic noise; Cepstrum; Discrete cosine transforms; Feature extraction; Loudspeakers; Mel frequency cepstral coefficient; Robustness; Speech recognition; Testing; Working environment noise; Canonical Correlation Based Compensation; Feature Extraction; Root Cepstrum; Speech Recognition;
Conference_Titel :
Intelligent Information Technology Application, 2008. IITA '08. Second International Symposium on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3497-8
DOI :
10.1109/IITA.2008.562