DocumentCode
312136
Title
Noise robust estimate of speech dynamics for speaker recognition
Author
Openshaw, J.P. ; Mason, J.S.
Author_Institution
Dept. of Electr. Eng., Univ. Coll. of Swansea, UK
Volume
2
fYear
1996
fDate
3-6 Oct 1996
Firstpage
925
Abstract
The paper investigates the robustness of cepstral based features with respect to additive noise, and details two methods of increasing the robustness with minimal need for a-priori knowledge of the noise statistics. The first approach is a form of noise masking which adds a fixed offset to the linear spectral estimate. The second is a form of sub-band filtering, again in the linear domain, which estimates the dynamic content of the speech using Fourier transforms. This avoids negative values normally inherent in such filtering and which presents difficulties in deriving log estimates. Both methods are shown to provide useful levels of robustness to additive noise, for example, speaker identification error rates in SNR mis-matched conditions of 15 dB are reduced from 60.5% for standard mel cepstra to 13.3% and 24.1% for the two approaches respectively
Keywords
Fourier transforms; cepstral analysis; filters; noise; speaker recognition; Fourier transforms; additive noise; cepstral based feature robustness; dynamic speech content estimation; fixed offset; linear spectral estimate; log estimates; mis-matched conditions; noise masking; noise robust estimate; speaker identification error rates; speaker recognition; speech dynamics; standard mel cepstra; sub-band filtering; Additive noise; Cepstral analysis; Error analysis; Filtering; Fourier transforms; Noise robustness; Nonlinear filters; Speaker recognition; Speech enhancement; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607753
Filename
607753
Link To Document