Title :
New implementations of the E-HMM-based system for speaker diarization in meeting rooms
Author :
Fredouille, Corinne ; Evans, Nicholas
Author_Institution :
LIA, Univ. of Avignon, Avignon
fDate :
March 31 2008-April 4 2008
Abstract :
This paper addresses the problem of speaker diarization in the specific context of meeting room recordings. Some new enhancements to the E-HMM-based speaker diarization system are reported. These involve a different approach to speaker modelling utilising EM/ML-based training rather than MAP adaptation as in our previous work. Using the new system we investigate the effects of speech activity detection through speaker diarization experiments conducted on 23 meetings extracted from the NIST/RT evaluation campaign datasets. We propose a new approach, which assigns confidence values according to the type of information carried by the signal and incorporates these values directly into the speaker diarization system. Experimental results show that, perhaps surprisingly, the non-speech segments do not systematically affect the robustness of the speaker diarization system, and more precisely the speaker model training process.
Keywords :
hidden Markov models; speaker recognition; speech processing; HMM-based speaker diarization system; meeting room recordings; nonspeech segments; speaker diarization system; speaker modelling; speech activity detection; Clustering algorithms; Data mining; Microphones; NIST; Protocols; Robustness; Shape; Speaker recognition; Speech analysis; Speech enhancement; confidence values; meeting rooms; speaker diarization; speaker recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518620