DocumentCode :
3410935
Title :
Robust speaker identification using auditory features and computational auditory scene analysis
Author :
Shao, Yang ; Wang, DeLiang
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
1589
Lastpage :
1592
Abstract :
The performance of speaker recognition systems drop significantly under noisy conditions. To improve robustness, we have recently proposed novel auditory features and a robust speaker recognition system using a front-end based on computational auditory scene analysis. In this paper, we further study the auditory features by exploring different feature dimensions and incorporating dynamic features. In addition, we evaluate the features and robust recognition in a speaker identification task in a number of noisy conditions. We find that one of the auditory features performs substantially better than a conventional speaker feature. Furthermore, our recognition system achieves significant performance improvements compared with an advanced front-end in a wide range of signal-to-noise conditions.
Keywords :
feature extraction; speaker recognition; auditory features; computational auditory scene analysis; signal-to-noise condition; speaker identification; speaker recognition; Cepstral analysis; Crosstalk; Feature extraction; Humans; Image analysis; Mel frequency cepstral coefficient; Noise robustness; Signal to noise ratio; Speaker recognition; Speech analysis; Gammatone feature; Gammatone frequency cepstral coefficient; Robust speaker recognition; auditory feature; computational auditory scene analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4517928
Filename :
4517928
Link To Document :
بازگشت