DocumentCode :
2131891
Title :
Speaker segmentation and clustering based on the improved spectral clustering
Author :
Ma, Yong ; Bao, Chang-chun ; Liu, Jia
Author_Institution :
Speech & Audio Signal Process. Lab., Beijing Univ. of Technol., Beijing, China
fYear :
2011
fDate :
18-21 Sept. 2011
Firstpage :
1
Lastpage :
5
Abstract :
Efficient speaker segmentation and clustering method based on the improved spectral clustering is proposed in this paper. Traditional speaker segmentation and clustering is performed by the hierarchical clustering algorithms with Bayesian information criterion (BIC) metric and cross likelihood ratio (CLR) metric after the speakers are segmented. Since this method has high computational complexity and may result in a suboptimal solution, we use spectral clustering to overcome this problem and improve the performance of clustering algorithm. First the affinity matrix is constructed with the mean supervector feature transformed by KL kernel mapping. And then the scaling parameter is selected adaptively. The experiments performed on the NIST 1998 multi-speaker corpus show that the proposed method outperforms the baseline system.
Keywords :
Bayes methods; matrix algebra; pattern clustering; speaker recognition; BIC metric; Bayesian information criterion; CLR metric; KL kernel mapping; affinity matrix; cross likelihood ratio; hierarchical clustering algorithm; mean supervector feature; scaling parameter; speaker clustering; speaker segmentation; spectral clustering; Clustering algorithms; Clustering methods; Kernel; Measurement; NIST; Speech; Viterbi algorithm; Bayesian information criterion; Speaker segmentation and clustering; Spectral Clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning for Signal Processing (MLSP), 2011 IEEE International Workshop on
Conference_Location :
Santander
ISSN :
1551-2541
Print_ISBN :
978-1-4577-1621-8
Electronic_ISBN :
1551-2541
Type :
conf
DOI :
10.1109/MLSP.2011.6064579
Filename :
6064579
Link To Document :
بازگشت