DocumentCode :
1813889
Title :
Reducing the environmental sensitivity of cepstral features for speaker recognition
Author :
Openshaw, J.P. ; Mason, J.S.
Author_Institution :
Dept. of Electr. Eng., Univ. Coll. of Swansea, UK
Volume :
1
fYear :
1996
fDate :
14-18 Oct 1996
Firstpage :
721
Abstract :
This paper investigates the robustness of cepstral based features with respect to additive noise, and details two methods of increasing the robustness with minimal need for a-priori knowledge of the noise statistics. The first approach is a form of noise masking which adds a fixed offset to the linear spectral estimate. The second is a form of sub-band filtering, again in the linear domain, which estimates the dynamic content of the speech using Fourier transforms. This avoids negative values normally inherent in such filtering and which presents difficulties in deriving log estimates. Both methods are shown to provide useful levels of robustness to additive noise, for example, speaker identification error rates in SNR mis-matched conditions of 15 dB are reduced from 60.5% for standard mel cepstra to 13.8% and 24.1% for the two approaches respectively, a relative reduction in error of 77% and 60.1%
Keywords :
Fourier transforms; acoustic filters; acoustic noise; cepstral analysis; error analysis; speaker recognition; Fourier transform; additive noise; cepstral features; dynamic content; environmental sensitivity; error rates; linear domain; linear spectral estimate; log estimate; noise masking; noise statistics; robustness; speaker recognition; sub-band filtering; Additive noise; Cepstral analysis; Error analysis; Filtering; Fourier transforms; Noise robustness; Nonlinear filters; Speech; Statistics; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing, 1996., 3rd International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-2912-0
Type :
conf
DOI :
10.1109/ICSIGP.1996.567364
Filename :
567364
Link To Document :
بازگشت