DocumentCode :
1997394
Title :
Sparse kernel cepstral coefficients (SKCC): Inner-product based features for noise-robust speech recognition
Author :
Fazel, Amin ; Chakrabartty, Shantanu
Author_Institution :
Dept. of Electr. & Comput. Eng., Michigan State Univ., East Lansing, MI, USA
fYear :
2011
fDate :
15-18 May 2011
Firstpage :
2401
Lastpage :
2404
Abstract :
In this paper we present a novel speech feature extraction algorithm based on sparse auditory coding and regression techniques in a reproducing kernel Hilbert space (RKHS). The features known as sparse kernel cepstral coefficients (SKCC) are extracted under the hypothesis that the noise-robust information in speech signal is embedded in a subspace spanned by overcomplete, regularized and normalized gamma- tone basis functions. After identifying the information bearing subspace, noise-robustness is achieved by sparsifying the SKCC features using simple thresholding. We show that computing the SKCC features involves correlating the speech signal with a pre-computed matrix, thus making the algorithm amenable to DSP based implementation. Speech recognition experiments using AURORA 2 dataset demonstrate that the SKCC features delivers consistent improvements in recognition performance over the state-of-the-art features under different noisy recording conditions.
Keywords :
Hilbert spaces; cepstral analysis; feature extraction; regression analysis; sparse matrices; speech coding; speech recognition; SKCC; noise robustness; normalized gamma tone basis functions; pre-computed matrix; regression techniques; reproducing kernel Hilbert space; sparse auditory coding; sparse kernel cepstral coefficients; speech feature extraction; speech recognition; Feature extraction; Kernel; Mel frequency cepstral coefficient; Noise; Robustness; Speech; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems (ISCAS), 2011 IEEE International Symposium on
Conference_Location :
Rio de Janeiro
ISSN :
0271-4302
Print_ISBN :
978-1-4244-9473-6
Electronic_ISBN :
0271-4302
Type :
conf
DOI :
10.1109/ISCAS.2011.5938087
Filename :
5938087
Link To Document :
بازگشت