Title :
Combination of agglomerative and sequential clustering for speaker diarization
Author :
Vijayasenan, Deepu ; Valente, Fabio ; Bourlard, Hervé
Author_Institution :
IDIAP Res. Inst., Martigny
fDate :
March 31 2008-April 4 2008
Abstract :
This paper aims at investigating the use of sequential clustering for speaker diarization. Conventional diarization systems are based on parametric models and agglomerative clustering. In our previous work we proposed a non-parametric method based on the agglomerative information bottleneck for very fast diarization. Here we consider the combination of sequential and agglomerative clustering for avoiding local maxima of the objective function and for purification. Experiments are run on the RT06 eval data. Sequential Clustering with oracle model selection can reduce the speaker error by 10% w.r.t. agglomerative clustering. When the model selection is based on Normalized Mutual Information criterion, a relative improvement of 5% is obtained using a combination of agglomerative and sequential clustering.
Keywords :
speaker recognition; speech synthesis; agglomerative clustering; normalized mutual information criterion; oracle model selection; sequential clustering; speaker diarization; speaker error; Bayesian methods; Clustering algorithms; Hidden Markov models; Mutual information; Parameter estimation; Parametric statistics; Partitioning algorithms; Purification; Speech; Streaming media; Meetings data; Speaker Diarization; agglomerative and sequential information bottleneck;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518621