Title :
Speaker detection using multi-speaker audio files for both enrollment and test
Author :
Bonastre, Jean-François ; Meignier, Sylvuin ; Merlin, Tevu
Author_Institution :
LIA-Avignon, Avignon, France
Abstract :
This paper focuses on speaker detection using multispeaker files both for the enrollment phase and for the test phase. This task was introduced during the 2002 NIST speaker recognition evaluation campaign. Enrollment data is composed of three two-speaker files. Test files are also two-speaker records. The system presented here uses a speaker segmentation process based on an HMM conversation model followed by a speaker matching technique to produce one-speaker segments. Speaker detection is then achieved using AMIRAL, LIA´s GMM-based speaker verification system. Validation of the proposed strategy is done using extracts from the NIST 2002 results.
Keywords :
hidden Markov models; speaker recognition; 2002 NIST speaker recognition evaluation; AMIRAL; HMM conversation model; LIA GMM-based speaker verification system; enrollment data; multispeaker audio files; one-speaker segments; speaker detection; speaker matching technique; speaker segmentation; test data; two-speaker files; two-speaker records; Data mining; Hidden Markov models; Iterative decoding; Loudspeakers; NIST; Phase detection; Speaker recognition; Speech; System testing; Viterbi algorithm;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1202298