• DocumentCode
    1857563
  • Title

    A novel DTW-based distance measure for speaker segmentation

  • Author

    Park, A. ; Glass, J.R.

  • Author_Institution
    Comput. Sci. & Artificial Intell. Lab., MIT, Cambridge, MA
  • fYear
    2006
  • fDate
    10-13 Dec. 2006
  • Firstpage
    22
  • Lastpage
    25
  • Abstract
    We present a novel distance measure for comparing two speech segments that uses a local version of the well-known DTW algorithm. Our approach is based on the idea of finding word-level speech patterns that are repeated by the same speaker. Using this distance measure, we develop a speaker segmentation procedure and apply it to the task of segmenting multi-speaker lectures. We demonstrate that our approach is able to generate segmentations that correlate well to independently generated human segmentations. In experiments performed on over ten hours of multi-speaker lecture data, we were able to find speaker change points with precision and recall rates of 80% and 100%, respectively.
  • Keywords
    speaker recognition; time warp simulation; DTW-based distance measure; dynamic time warp; multi speaker lecture data; speaker segmentation; word-level speech patterns; Artificial intelligence; Broadcasting; Clustering algorithms; Computer science; Glass; Humans; Laboratories; Navigation; Speech; Streaming media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop, 2006. IEEE
  • Conference_Location
    Palm Beach
  • Print_ISBN
    1-4244-0872-5
  • Type

    conf

  • DOI
    10.1109/SLT.2006.326807
  • Filename
    4123352