Title :
Blind Speaker Clustering
Author :
Iyer, A.N. ; Ofoegbu, U.O. ; Yantorno, R.B. ; Smolenski, B.Y.
Author_Institution :
Signal Process. Lab., Temple Univ., Philadelphia, PA
Abstract :
A novel approach to performing speaker clustering in telephone conversations is presented in this paper. The method is based on a simple observation that the distance between populations of feature vectors extracted from different speakers is greater than a preset threshold. This observation is incorporated into the clustering problem by the formulation of a constrained optimization problem. A modified c-means algorithm is designed to solve the optimization problem. Another key aspect in speaker clustering is to determine the number of clusters, which is either assumed or expected as an input in traditional methods. The proposed method does not require such information; instead, the number of clusters is automatically determined from the data. The performance of the proposed algorithm with the Hellinger, Bhattacharyya, Mahalanobis and the generalized likelihood ratio distance measures is evaluated and compared. The approach, employing the Hellinger distance, resulted in an average cluster purity value of 0.85 from experiments performed using the switchboard telephone conversation al speech database. The result indicates a 9% relative improvement in the average cluster purity as compared to the best performing agglomerative clustering system
Keywords :
feature extraction; optimisation; speech processing; agglomerative clustering system; blind speaker clustering; c-means algorithm; constrained optimization problem; feature vectors extraction; generalized likelihood ratio distance measures; switchboard telephone conversational speech database; Broadcasting; Clustering algorithms; Constraint optimization; Data mining; Feature extraction; Laboratories; Signal processing; Signal processing algorithms; Speech processing; Telephony;
Conference_Titel :
Intelligent Signal Processing and Communications, 2006. ISPACS '06. International Symposium on
Conference_Location :
Yonago
Print_ISBN :
0-7803-9732-0
Electronic_ISBN :
0-7803-9733-9
DOI :
10.1109/ISPACS.2006.364902