مرکز منطقه ای اطلاع رساني علوم و فناوري - Speaker detection using multi-speaker audio files for both enrollment and test

DocumentCode :

395197

Title :

Speaker detection using multi-speaker audio files for both enrollment and test

Author :

Bonastre, Jean-François ; Meignier, Sylvuin ; Merlin, Tevu

Author_Institution :

LIA-Avignon, Avignon, France

Volume :

fYear :

2003

fDate :

6-10 April 2003

Abstract :

This paper focuses on speaker detection using multispeaker files both for the enrollment phase and for the test phase. This task was introduced during the 2002 NIST speaker recognition evaluation campaign. Enrollment data is composed of three two-speaker files. Test files are also two-speaker records. The system presented here uses a speaker segmentation process based on an HMM conversation model followed by a speaker matching technique to produce one-speaker segments. Speaker detection is then achieved using AMIRAL, LIA´s GMM-based speaker verification system. Validation of the proposed strategy is done using extracts from the NIST 2002 results.

Keywords :

hidden Markov models; speaker recognition; 2002 NIST speaker recognition evaluation; AMIRAL; HMM conversation model; LIA GMM-based speaker verification system; enrollment data; multispeaker audio files; one-speaker segments; speaker detection; speaker matching technique; speaker segmentation; test data; two-speaker files; two-speaker records; Data mining; Hidden Markov models; Iterative decoding; Loudspeakers; NIST; Phase detection; Speaker recognition; Speech; System testing; Viterbi algorithm;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

0-7803-7663-3

Type :

conf

DOI :

10.1109/ICASSP.2003.1202298

Filename :

1202298

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=395197