DocumentCode
148485
Title
A novel method for selecting the number of clusters in a speaker diarization system
Author
Lopez-Otero, Paula ; Docio-Fernandez, Laura ; Garcia-Mateo, Carmen
Author_Institution
Multimedia Technol. Group (GTM), Univ. de Vigo, Vigo, Spain
fYear
2014
fDate
1-5 Sept. 2014
Firstpage
656
Lastpage
660
Abstract
This paper introduces the cluster score (C-score) as a measure for determining a suitable number of clusters when performing speaker clustering in a speaker diarization system. C-score finds a trade-off between intra-cluster and extra-cluster similarities, selecting a number of clusters with cluster elements that are similar between them but different to the elements in other clusters. Speech utterances are represented by Gaussian mixture model mean supervectors, and also the projection of the supervectors into a low-dimensional discriminative subspace by linear discriminant analysis is assessed. This technique shows robustness to segmentation errors and, compared with the widely used Bayesian information criterion (BIC)-based stopping criterion, results in a lower speaker clustering error and dramatically reduces computation time. Experiments were run using the broadcast news database used for the Albayzin 2010 Speaker Diarization Evaluation.
Keywords
Gaussian processes; speaker recognition; statistical analysis; C-score; Gaussian mixture model mean supervectors; cluster score; inter-cluster similarity; intra-cluster similarity; linear discriminant analysis; low-dimensional discriminative subspace; speaker clustering; speaker diarization system; speech utterances; Clustering algorithms; Databases; Feature extraction; Robustness; Speech; Speech processing; Vectors; Cluster Similarity; Linear Discriminant Analysis; Speaker Clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
Conference_Location
Lisbon
Type
conf
Filename
6952190
Link To Document