Title :
A novel method for selecting the number of clusters in a speaker diarization system
Author :
Lopez-Otero, Paula ; Docio-Fernandez, Laura ; Garcia-Mateo, Carmen
Author_Institution :
Multimedia Technol. Group (GTM), Univ. de Vigo, Vigo, Spain
Abstract :
This paper introduces the cluster score (C-score) as a measure for determining a suitable number of clusters when performing speaker clustering in a speaker diarization system. C-score finds a trade-off between intra-cluster and extra-cluster similarities, selecting a number of clusters with cluster elements that are similar between them but different to the elements in other clusters. Speech utterances are represented by Gaussian mixture model mean supervectors, and also the projection of the supervectors into a low-dimensional discriminative subspace by linear discriminant analysis is assessed. This technique shows robustness to segmentation errors and, compared with the widely used Bayesian information criterion (BIC)-based stopping criterion, results in a lower speaker clustering error and dramatically reduces computation time. Experiments were run using the broadcast news database used for the Albayzin 2010 Speaker Diarization Evaluation.
Keywords :
Gaussian processes; speaker recognition; statistical analysis; C-score; Gaussian mixture model mean supervectors; cluster score; inter-cluster similarity; intra-cluster similarity; linear discriminant analysis; low-dimensional discriminative subspace; speaker clustering; speaker diarization system; speech utterances; Clustering algorithms; Databases; Feature extraction; Robustness; Speech; Speech processing; Vectors; Cluster Similarity; Linear Discriminant Analysis; Speaker Clustering;
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
Conference_Location :
Lisbon