• DocumentCode
    454706
  • Title

    Purity Algorithms for Speaker Diarization of Meetings Data

  • Author

    Anguera, Xavier ; Woofers, C. ; Hernando, Javier

  • Author_Institution
    Int. Comput. Sci. Inst., Berkeley, CA
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    When performing speaker diarization, it is common to use an agglomerative clustering approach where the acoustic data is first split in small pieces and then pairs are merged until reaching a stopping point. When using a purely agglomerative clustering technique, one cluster cannot be split into two. Therefore, errors caused by multiple speakers being assigned to one cluster can be common. Furthermore, clusters often contain non-speech frames, creating problems when deciding which two clusters to merge and when to stop the clustering. In this paper, we present two algorithms that aim to purify the clusters. The first assigns conflicting speech segments to a new cluster, and the second detects and eliminates non-speech frames when comparing two clusters. We show improvements of over 18% relative using three datasets from the most current rich transcription (RT) evaluations
  • Keywords
    acoustics; pattern clustering; speech processing; acoustic data; agglomerative clustering approach; meetings data; nonspeech frames; purity algorithms; rich transcription; speaker diarization; speech segments; Acoustic propagation; Audio recording; Clustering algorithms; Computer science; Detectors; Error analysis; Impurities; Loudspeakers; Region 1; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660198
  • Filename
    1660198