• DocumentCode
    1161093
  • Title

    An overview of automatic speaker diarization systems

  • Author

    Tranter, Sue E. ; Reynolds, Douglas A.

  • Author_Institution
    Dept. of Eng., Cambridge Univ.
  • Volume
    14
  • Issue
    5
  • fYear
    2006
  • Firstpage
    1557
  • Lastpage
    1565
  • Abstract
    Audio diarization is the process of annotating an input audio channel with information that attributes (possibly overlapping) temporal regions of signal energy to their specific sources. These sources can include particular speakers, music, background noise sources, and other signal source/channel characteristics. Diarization can be used for helping speech recognition, facilitating the searching and indexing of audio archives, and increasing the richness of automatic transcriptions, making them more readable. In this paper, we provide an overview of the approaches currently used in a key area of audio diarization, namely speaker diarization, and discuss their relative merits and limitations. Performances using the different techniques are compared within the framework of the speaker diarization task in the DARPA EARS Rich Transcription evaluations. We also look at how the techniques are being introduced into real broadcast news systems and their portability to other domains and tasks such as meetings and speaker verification
  • Keywords
    audio signal processing; speaker recognition; DARPA EARS Rich Transcription evaluations; audio archives; audio diarization; automatic speaker diarization systems; automatic transcriptions; broadcast news systems; input audio channel annotation; speaker verification; speech recognition; temporal signal energy region; Acoustic noise; Broadcasting; Colored noise; Data mining; Indexing; Loudspeakers; Multiple signal classification; Music; Signal processing; Speech recognition; Speaker diarization; speaker segmentation and clustering;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2006.878256
  • Filename
    1677976