DocumentCode
2646483
Title
Blind Speaker Clustering
Author
Iyer, A.N. ; Ofoegbu, U.O. ; Yantorno, R.B. ; Smolenski, B.Y.
Author_Institution
Signal Process. Lab., Temple Univ., Philadelphia, PA
fYear
2006
fDate
12-15 Dec. 2006
Firstpage
343
Lastpage
346
Abstract
A novel approach to performing speaker clustering in telephone conversations is presented in this paper. The method is based on a simple observation that the distance between populations of feature vectors extracted from different speakers is greater than a preset threshold. This observation is incorporated into the clustering problem by the formulation of a constrained optimization problem. A modified c-means algorithm is designed to solve the optimization problem. Another key aspect in speaker clustering is to determine the number of clusters, which is either assumed or expected as an input in traditional methods. The proposed method does not require such information; instead, the number of clusters is automatically determined from the data. The performance of the proposed algorithm with the Hellinger, Bhattacharyya, Mahalanobis and the generalized likelihood ratio distance measures is evaluated and compared. The approach, employing the Hellinger distance, resulted in an average cluster purity value of 0.85 from experiments performed using the switchboard telephone conversation al speech database. The result indicates a 9% relative improvement in the average cluster purity as compared to the best performing agglomerative clustering system
Keywords
feature extraction; optimisation; speech processing; agglomerative clustering system; blind speaker clustering; c-means algorithm; constrained optimization problem; feature vectors extraction; generalized likelihood ratio distance measures; switchboard telephone conversational speech database; Broadcasting; Clustering algorithms; Constraint optimization; Data mining; Feature extraction; Laboratories; Signal processing; Signal processing algorithms; Speech processing; Telephony;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Signal Processing and Communications, 2006. ISPACS '06. International Symposium on
Conference_Location
Yonago
Print_ISBN
0-7803-9732-0
Electronic_ISBN
0-7803-9733-9
Type
conf
DOI
10.1109/ISPACS.2006.364902
Filename
4212289
Link To Document