Title :
A Cluster Purification Algorithm for Speaker Diarization System
Author_Institution :
Sch. of Inf. & Commun. Eng., Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
In speaker diarization system, it´s common to use bottom-up clustering method where the input data is first split in small pieces and then merged the most similar segments until reaching a stopping point. However, it´s not easy to ensure that every selection merges the right pair, and these errors tend to deteriorate the post-merging results. In this paper, a fast cluster purification algorithm is introduced after the size of clusters reach to a pre-estimated number K, which is no less than the real number of speakers involved in the conversation, and try to remedy the errors by removing the inappropriate segments into the right cluster. An effective way to estimate the number of speakers is also introduced before the clustering stage. The experiment results show improvement in both the purity of cluster and the diarization error rate (DER) after using the purification algorithm. The purity improves 0.8% and DER reduces 1.14% in average.
Keywords :
error statistics; pattern clustering; speaker recognition; DER; cluster purification algorithm; cluster segments; clusters size; diarization error rate; segment shuffling; speaker diarization system; speaker number estimation; speakers conversation; Clustering algorithms; Conferences; Density estimation robust algorithm; Measurement; Purification; Speech; Speech processing; cluster purification; segment shuffling; speaker diarization; speaker number estimation;
Conference_Titel :
Computational Intelligence and Design (ISCID), 2014 Seventh International Symposium on
Print_ISBN :
978-1-4799-7004-9
DOI :
10.1109/ISCID.2014.129