• DocumentCode
    431326
  • Title

    Two-way cluster voting to improve speaker diarisation performance

  • Author

    Tranter, S.E.

  • Author_Institution
    Dept. of Eng., Cambridge Univ., UK
  • Volume
    1
  • fYear
    2005
  • fDate
    23-23 March 2005
  • Abstract
    Speaker diarisation is the task of automatically segmenting audio data and providing speaker labels for the resulting regions of audio. A cluster-voting scheme is described which takes the output from two speaker diarisation systems and produces a new output which aims to have a lower speaker diarisation error rate (DER) than either input. The scheme works in two stages: the first produces a set of possible outputs which minimise a distance metric based on the DER; the second votes between these alternatives to give the final output. Decisions where the inputs agree are always passed to the output and those where the inputs differ are re-evaluated in the final voting stage. Results are presented on the 6-show RT-03 broadcast news evaluation data; they show that the DER can be reduced by 1.64% and 2.56% absolute using this method when combining the best two Cambridge University and the best two MIT Lincoln Laboratory diarisation systems respectively.
  • Keywords
    decision making; error statistics; minimisation; speaker recognition; audio data segmentation; distance metric minimisation; error rate; speaker diarisation; speaker labels; two-way cluster voting; Broadcasting; Databases; Density estimation robust algorithm; Error analysis; Indexing; Laboratories; Performance analysis; Speech recognition; US Government; Voting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
  • Conference_Location
    Philadelphia, PA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8874-7
  • Type

    conf

  • DOI
    10.1109/ICASSP.2005.1415223
  • Filename
    1415223