DocumentCode :
2701372
Title :
Incorporating Auditory Feature Uncertainties in Robust Speaker Identification
Author :
Yang Shao ; SRINIVASAN, SUDARSHAN ; DeLiang Wang
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
Conventional speaker recognition systems perform poorly under noisy conditions. Recent research suggests that binary time-frequency (T-F) masks be a promising front-end for robust speaker recognition. In this paper, we propose novel auditory features based on an auditory periphery model, and show that these features capture significant speaker characteristics. Additionally, we estimate uncertainties of the auditory features based on binary T-F masks, and calculate speaker likelihood scores using uncertainty decoding. Our approach achieves substantial performance improvement in a speaker identification task compared with a state-of-the-art robust front-end in a wide range of signal-to-noise conditions.
Keywords :
audio coding; decoding; feature extraction; speaker recognition; speech coding; auditory feature uncertainties; auditory periphery model; binary T-F masks; robust speaker identification; signal-to-noise conditions; speaker likelihood scores; speaker recognition systems; uncertainty decoding; Acoustic noise; Acoustical engineering; Cepstral analysis; Decoding; Feature extraction; Filter bank; Mel frequency cepstral coefficient; Noise robustness; Speaker recognition; Uncertainty; auditory features; robust speaker identification; uncertainty decoding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366903
Filename :
4218091
Link To Document :
بازگشت