• DocumentCode
    3245407
  • Title

    A robust speaker clustering algorithm

  • Author

    Ajmera, J. ; Wooters, C.

  • Author_Institution
    IDIAP, Martigny, Switzerland
  • fYear
    2003
  • fDate
    30 Nov.-3 Dec. 2003
  • Firstpage
    411
  • Lastpage
    416
  • Abstract
    In this paper, we present a novel speaker segmentation and clustering algorithm. The algorithm automatically performs both speaker segmentation and clustering without any prior knowledge of the identities or the number of speakers. Our algorithm uses "standard" speech processing components and techniques such as HMM, agglomerative clustering, and the Bayesian information criterion. However, we have combined and modified these so as to produce an algorithm with the following advantages: no threshold adjustment requirements; no need for training/development data; and robustness to different data conditions. This paper also reports the performance of this algorithm on different datasets released by the USA National Institute of Standards and Technology (NIST) with different initial conditions and parameter settings. The consistently low speaker-diarization error rate clearly indicates the robustness and utility of the algorithm.
  • Keywords
    belief networks; error statistics; hidden Markov models; pattern clustering; speaker recognition; speech processing; Bayesian information criterion; HMM; NIST; USA National Institute of Standards and Technology; agglomerative clustering; performance; robust speaker clustering algorithm; speaker segmentation; speaker-diarization error rate; standard speech processing; Audio recording; Bayesian methods; Clustering algorithms; Error analysis; Hidden Markov models; Information retrieval; Iterative algorithms; NIST; Robustness; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
  • Print_ISBN
    0-7803-7980-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2003.1318476
  • Filename
    1318476