Title :
Exploiting long-range temporal dynamics of speech for noise-robust speaker recognition
Author :
Jafari, Ayeh ; Srinivasan, Ramji ; Crookes, Danny ; Ji Ming
Author_Institution :
Inst. of Electron., Commun. & Inf. Technol., Queen´s Univ. Belfast, Belfast, UK
fDate :
Aug. 29 2011-Sept. 2 2011
Abstract :
Temporal dynamics is an important feature of speech that distinguishes speech from noise, as well as distinguishing between different speakers. In this paper, we present an approach to maximally extract this feature of speech to improve the robustness against background noise, for text-independent speaker recognition. The new approach identifies and compares the longest matching speech segments between the training and test speech to increase noise immunity. Experiments have been conducted on the NIST 2002 SRE database in the presence of various types of noise including fast-varying song and music. The new approach has shown significantly improved performance over conventional noise-robust techniques.
Keywords :
speaker recognition; NIST 2002 SRE database; background noise; fast-varying song; long-range temporal dynamics; matching speech segments; music; noise immunity; noise-robust speaker recognition; test speech; text-independent speaker recognition; training speech; Least squares approximations; Noise; Noise measurement; Speaker recognition; Speech; Speech recognition; Training;
Conference_Titel :
Signal Processing Conference, 2011 19th European
Conference_Location :
Barcelona