Title :
Incorporating Auditory Feature Uncertainties in Robust Speaker Identification
Author :
Yang Shao ; SRINIVASAN, SUDARSHAN ; DeLiang Wang
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Abstract :
Conventional speaker recognition systems perform poorly under noisy conditions. Recent research suggests that binary time-frequency (T-F) masks be a promising front-end for robust speaker recognition. In this paper, we propose novel auditory features based on an auditory periphery model, and show that these features capture significant speaker characteristics. Additionally, we estimate uncertainties of the auditory features based on binary T-F masks, and calculate speaker likelihood scores using uncertainty decoding. Our approach achieves substantial performance improvement in a speaker identification task compared with a state-of-the-art robust front-end in a wide range of signal-to-noise conditions.
Keywords :
audio coding; decoding; feature extraction; speaker recognition; speech coding; auditory feature uncertainties; auditory periphery model; binary T-F masks; robust speaker identification; signal-to-noise conditions; speaker likelihood scores; speaker recognition systems; uncertainty decoding; Acoustic noise; Acoustical engineering; Cepstral analysis; Decoding; Feature extraction; Filter bank; Mel frequency cepstral coefficient; Noise robustness; Speaker recognition; Uncertainty; auditory features; robust speaker identification; uncertainty decoding;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.366903