DocumentCode :
2958902
Title :
Improving Speaker Diarization by Cross EM Refinement
Author :
Ning, Huazhong ; Xu, Wei ; Gong, Yihong ; Huang, Thomas
Author_Institution :
Beckman Inst., Univ. of Illinois at Urbana-Champaign, Urbana, IL
fYear :
2006
fDate :
9-12 July 2006
Firstpage :
1901
Lastpage :
1904
Abstract :
In this paper, we present a new speaker diarization system that improves the accuracy of traditional hierarchical clustering-based methods with little increase in computational cost. Our contributions are mainly two fold. First, we include a preprocessing called "local clustering" before the hierarchical clustering algorithm to merge very similar adjacent speech segments. This local clustering aims to reduce the number of segments to be clustered by the hierarchical clustering, so as to dramatically increase the processing speed. Second, we perform a postprocessing called "cross EM refinement" to purify the clusters generated by the hierarchical clustering. This algorithm is based on the idea of cross validation and EM algorithm. Our experimental evaluations show that the proposed cross EM refinement approach reduces the speaker diarization error by up to 56%, with an average reduction of 22% compared to the traditional hierarchical clustering method
Keywords :
expectation-maximisation algorithm; speaker recognition; cross EM refinement approach; hierarchical clustering; local clustering algorithm; speaker diarization system; Clustering algorithms; Clustering methods; Computational efficiency; Explosives; Internet; Laboratories; Loudspeakers; National electric code; Refining; Speech; BIC; Cross EM Refinement; Hierarchical Clustering; Speaker Diarization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2006 IEEE International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
1-4244-0366-7
Electronic_ISBN :
1-4244-0367-7
Type :
conf
DOI :
10.1109/ICME.2006.262927
Filename :
4036996
Link To Document :
بازگشت