مرکز منطقه ای اطلاع رساني علوم و فناوري - Speaker segmentation and clustering based on the improved spectral clustering

DocumentCode :

2131891

Title :

Speaker segmentation and clustering based on the improved spectral clustering

Author :

Ma, Yong ; Bao, Chang-chun ; Liu, Jia

Author_Institution :

Speech & Audio Signal Process. Lab., Beijing Univ. of Technol., Beijing, China

fYear :

2011

fDate :

18-21 Sept. 2011

Firstpage :

Lastpage :

Abstract :

Efficient speaker segmentation and clustering method based on the improved spectral clustering is proposed in this paper. Traditional speaker segmentation and clustering is performed by the hierarchical clustering algorithms with Bayesian information criterion (BIC) metric and cross likelihood ratio (CLR) metric after the speakers are segmented. Since this method has high computational complexity and may result in a suboptimal solution, we use spectral clustering to overcome this problem and improve the performance of clustering algorithm. First the affinity matrix is constructed with the mean supervector feature transformed by KL kernel mapping. And then the scaling parameter is selected adaptively. The experiments performed on the NIST 1998 multi-speaker corpus show that the proposed method outperforms the baseline system.

Keywords :

Bayes methods; matrix algebra; pattern clustering; speaker recognition; BIC metric; Bayesian information criterion; CLR metric; KL kernel mapping; affinity matrix; cross likelihood ratio; hierarchical clustering algorithm; mean supervector feature; scaling parameter; speaker clustering; speaker segmentation; spectral clustering; Clustering algorithms; Clustering methods; Kernel; Measurement; NIST; Speech; Viterbi algorithm; Bayesian information criterion; Speaker segmentation and clustering; Spectral Clustering;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning for Signal Processing (MLSP), 2011 IEEE International Workshop on

Conference_Location :

Santander

ISSN :

1551-2541

Print_ISBN :

978-1-4577-1621-8

Electronic_ISBN :

1551-2541

Type :

conf

DOI :

10.1109/MLSP.2011.6064579

Filename :

6064579

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2131891