• DocumentCode
    337485
  • Title

    Automatic speaker clustering from multi-speaker utterances

  • Author

    McLaughlin, Jack ; Reynolds, Douglas ; Singer, Elliot ; O´Leary, Gerald C.

  • Author_Institution
    Lincoln Lab., MIT, Lexington, MA, USA
  • Volume
    2
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    817
  • Abstract
    Blind clustering of multi-person utterances by speaker is complicated by the fact that each utterance has at least two talkers. In the case of a two-person conversation, one can simply split each conversation into its respective speaker halves, but this introduces error which ultimately hurts clustering. We propose a clustering algorithm which is capable of associating each conversation with two clusters (and therefore two-speakers) obviating the need for splitting. Results are given for two speaker conversations culled from the Switchboard corpus, and comparisons are made to results obtained on single-speaker utterances. We conclude that although the approach is promising, our technique for computing inter-conversation similarities prior to clustering needs improvement
  • Keywords
    pattern clustering; speech recognition; automatic speaker clustering; blind clustering; clustering algorithm; inter-conversation similarities; multi-person utterances; multi-speaker utterances; two-person conversation; Cepstral analysis; Clustering algorithms; Clustering methods; Laboratories; Lifting equipment; Natural languages; Speech; Telephony; Tree data structures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.759796
  • Filename
    759796