DocumentCode
3530449
Title
Robust speech feature extraction based on Gabor filtering and tensor factorization
Author
Wu, Qiang ; Zhang, Liqing ; Shi, Guangchuan
Author_Institution
Dept. of Comput. Sci., Shanghai Jiao Tong Univ., Shanghai
fYear
2009
fDate
19-24 April 2009
Firstpage
4649
Lastpage
4652
Abstract
In this paper, we investigate the speech feature extraction problem in the noisy environment. A novel approach based on Gabor filtering and tensor factorization is proposed. From recent physiological and psychoacoustic experimental results, localized spectro-temporal features are essential for auditory perception. We employ 2D-Gabor functions with different scales and directions to analyze the localized patches of power spectrogram, by which speech signal can be encoded as a general higher order tensor. Then nonnegative tensor PCA with sparse constraint is used to learn the projection matrices from multiple interrelated feature subspaces and extract the robust features. Experimental results confirm that our proposed method can improve the speech recognition performance, especially in noisy environment, compared with traditional speech feature extraction methods.
Keywords
Gabor filters; feature extraction; hearing; matrix decomposition; principal component analysis; speech recognition; tensors; Gabor filtering; auditory perception; localized spectro-temporal feature; multiple interrelated feature subspace; nonnegative tensor PCA; power spectrogram; projection matrix; robust speech feature extraction; tensor factorization; Feature extraction; Filtering; Gabor filters; Psychology; Robustness; Signal analysis; Spectrogram; Speech analysis; Tensile stress; Working environment noise; acoustic noise; auditory perception; feature extraction; gabor filtering; speech recognition; tensor factorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960667
Filename
4960667
Link To Document