• DocumentCode
    2646483
  • Title

    Blind Speaker Clustering

  • Author

    Iyer, A.N. ; Ofoegbu, U.O. ; Yantorno, R.B. ; Smolenski, B.Y.

  • Author_Institution
    Signal Process. Lab., Temple Univ., Philadelphia, PA
  • fYear
    2006
  • fDate
    12-15 Dec. 2006
  • Firstpage
    343
  • Lastpage
    346
  • Abstract
    A novel approach to performing speaker clustering in telephone conversations is presented in this paper. The method is based on a simple observation that the distance between populations of feature vectors extracted from different speakers is greater than a preset threshold. This observation is incorporated into the clustering problem by the formulation of a constrained optimization problem. A modified c-means algorithm is designed to solve the optimization problem. Another key aspect in speaker clustering is to determine the number of clusters, which is either assumed or expected as an input in traditional methods. The proposed method does not require such information; instead, the number of clusters is automatically determined from the data. The performance of the proposed algorithm with the Hellinger, Bhattacharyya, Mahalanobis and the generalized likelihood ratio distance measures is evaluated and compared. The approach, employing the Hellinger distance, resulted in an average cluster purity value of 0.85 from experiments performed using the switchboard telephone conversation al speech database. The result indicates a 9% relative improvement in the average cluster purity as compared to the best performing agglomerative clustering system
  • Keywords
    feature extraction; optimisation; speech processing; agglomerative clustering system; blind speaker clustering; c-means algorithm; constrained optimization problem; feature vectors extraction; generalized likelihood ratio distance measures; switchboard telephone conversational speech database; Broadcasting; Clustering algorithms; Constraint optimization; Data mining; Feature extraction; Laboratories; Signal processing; Signal processing algorithms; Speech processing; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Signal Processing and Communications, 2006. ISPACS '06. International Symposium on
  • Conference_Location
    Yonago
  • Print_ISBN
    0-7803-9732-0
  • Electronic_ISBN
    0-7803-9733-9
  • Type

    conf

  • DOI
    10.1109/ISPACS.2006.364902
  • Filename
    4212289