DocumentCode
337485
Title
Automatic speaker clustering from multi-speaker utterances
Author
McLaughlin, Jack ; Reynolds, Douglas ; Singer, Elliot ; O´Leary, Gerald C.
Author_Institution
Lincoln Lab., MIT, Lexington, MA, USA
Volume
2
fYear
1999
fDate
15-19 Mar 1999
Firstpage
817
Abstract
Blind clustering of multi-person utterances by speaker is complicated by the fact that each utterance has at least two talkers. In the case of a two-person conversation, one can simply split each conversation into its respective speaker halves, but this introduces error which ultimately hurts clustering. We propose a clustering algorithm which is capable of associating each conversation with two clusters (and therefore two-speakers) obviating the need for splitting. Results are given for two speaker conversations culled from the Switchboard corpus, and comparisons are made to results obtained on single-speaker utterances. We conclude that although the approach is promising, our technique for computing inter-conversation similarities prior to clustering needs improvement
Keywords
pattern clustering; speech recognition; automatic speaker clustering; blind clustering; clustering algorithm; inter-conversation similarities; multi-person utterances; multi-speaker utterances; two-person conversation; Cepstral analysis; Clustering algorithms; Clustering methods; Laboratories; Lifting equipment; Natural languages; Speech; Telephony; Tree data structures;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location
Phoenix, AZ
ISSN
1520-6149
Print_ISBN
0-7803-5041-3
Type
conf
DOI
10.1109/ICASSP.1999.759796
Filename
759796
Link To Document