DocumentCode :
1686603
Title :
Developing a speaker identification system for the DARPA RATS project
Author :
Plchot, Oldrich ; Matsoukas, Spyros ; Matejka, Pavel ; Dehak, Najim ; Ma, Jiaxin ; Cumani, Sandro ; Glembek, O. ; Hermansky, Hynek ; Mallidi, S.H. ; Mesgarani, N. ; Schwartz, R. ; Soufifar, M. ; Tan, Z.H. ; Thomas, Stephan ; Zhang, Boming ; Zhou, Xiaoxin
Author_Institution :
Speech@FIT, Brno Univ. of Technol., Brno, Czech Republic
fYear :
2013
Firstpage :
6768
Lastpage :
6772
Abstract :
This paper describes the speaker identification (SID) system developed by the Patrol team for the first phase of the DARPA RATS (Robust Automatic Transcription of Speech) program, which seeks to advance state of the art detection capabilities on audio from highly degraded communication channels. We present results using multiple SID systems differing mainly in the algorithm used for voice activity detection (VAD) and feature extraction. We show that (a) unsupervised VAD performs as well supervised methods in terms of downstream SID performance, (b) noise-robust feature extraction methods such as CFCCs out-perform MFCC front-ends on noisy audio, and (c) fusion of multiple systems provides 24% relative improvement in EER compared to the single best system when using a novel SVM-based fusion algorithm that uses side information such as gender, language, and channel id.
Keywords :
feature extraction; speaker recognition; DARPA RATS project; SID system; degraded communication channels; feature extraction; noise-robust feature extraction methods; noisy speech processing; robust automatic transcription of speech program; speaker identification system; unsupervised VAD; voice activity detection; Feature extraction; Mel frequency cepstral coefficient; Noise; Rats; Speech; Speech processing; Support vector machines; noisy speech processing; speaker identification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6638972
Filename :
6638972
Link To Document :
بازگشت