DocumentCode
1857563
Title
A novel DTW-based distance measure for speaker segmentation
Author
Park, A. ; Glass, J.R.
Author_Institution
Comput. Sci. & Artificial Intell. Lab., MIT, Cambridge, MA
fYear
2006
fDate
10-13 Dec. 2006
Firstpage
22
Lastpage
25
Abstract
We present a novel distance measure for comparing two speech segments that uses a local version of the well-known DTW algorithm. Our approach is based on the idea of finding word-level speech patterns that are repeated by the same speaker. Using this distance measure, we develop a speaker segmentation procedure and apply it to the task of segmenting multi-speaker lectures. We demonstrate that our approach is able to generate segmentations that correlate well to independently generated human segmentations. In experiments performed on over ten hours of multi-speaker lecture data, we were able to find speaker change points with precision and recall rates of 80% and 100%, respectively.
Keywords
speaker recognition; time warp simulation; DTW-based distance measure; dynamic time warp; multi speaker lecture data; speaker segmentation; word-level speech patterns; Artificial intelligence; Broadcasting; Clustering algorithms; Computer science; Glass; Humans; Laboratories; Navigation; Speech; Streaming media;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location
Palm Beach
Print_ISBN
1-4244-0872-5
Type
conf
DOI
10.1109/SLT.2006.326807
Filename
4123352
Link To Document