مرکز منطقه ای اطلاع رساني علوم و فناوري - Incorporating Auditory Feature Uncertainties in Robust Speaker Identification

DocumentCode :

2701372

Title :

Incorporating Auditory Feature Uncertainties in Robust Speaker Identification

Author :

Yang Shao ; SRINIVASAN, SUDARSHAN ; DeLiang Wang

Author_Institution :

Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA

Volume :

fYear :

2007

fDate :

15-20 April 2007

Abstract :

Conventional speaker recognition systems perform poorly under noisy conditions. Recent research suggests that binary time-frequency (T-F) masks be a promising front-end for robust speaker recognition. In this paper, we propose novel auditory features based on an auditory periphery model, and show that these features capture significant speaker characteristics. Additionally, we estimate uncertainties of the auditory features based on binary T-F masks, and calculate speaker likelihood scores using uncertainty decoding. Our approach achieves substantial performance improvement in a speaker identification task compared with a state-of-the-art robust front-end in a wide range of signal-to-noise conditions.

Keywords :

audio coding; decoding; feature extraction; speaker recognition; speech coding; auditory feature uncertainties; auditory periphery model; binary T-F masks; robust speaker identification; signal-to-noise conditions; speaker likelihood scores; speaker recognition systems; uncertainty decoding; Acoustic noise; Acoustical engineering; Cepstral analysis; Decoding; Feature extraction; Filter bank; Mel frequency cepstral coefficient; Noise robustness; Speaker recognition; Uncertainty; auditory features; robust speaker identification; uncertainty decoding;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on

Conference_Location :

Honolulu, HI

ISSN :

1520-6149

Print_ISBN :

1-4244-0727-3

Type :

conf

DOI :

10.1109/ICASSP.2007.366903

Filename :

4218091

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2701372