Title :
A novel DTW-based distance measure for speaker segmentation
Author :
Park, A. ; Glass, J.R.
Author_Institution :
Comput. Sci. & Artificial Intell. Lab., MIT, Cambridge, MA
Abstract :
We present a novel distance measure for comparing two speech segments that uses a local version of the well-known DTW algorithm. Our approach is based on the idea of finding word-level speech patterns that are repeated by the same speaker. Using this distance measure, we develop a speaker segmentation procedure and apply it to the task of segmenting multi-speaker lectures. We demonstrate that our approach is able to generate segmentations that correlate well to independently generated human segmentations. In experiments performed on over ten hours of multi-speaker lecture data, we were able to find speaker change points with precision and recall rates of 80% and 100%, respectively.
Keywords :
speaker recognition; time warp simulation; DTW-based distance measure; dynamic time warp; multi speaker lecture data; speaker segmentation; word-level speech patterns; Artificial intelligence; Broadcasting; Clustering algorithms; Computer science; Glass; Humans; Laboratories; Navigation; Speech; Streaming media;
Conference_Titel :
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location :
Palm Beach
Print_ISBN :
1-4244-0872-5
DOI :
10.1109/SLT.2006.326807