• DocumentCode
    417175
  • Title

    The ELISA consortium approaches in broadcast news speaker segmentation during the NIST 2003 rich transcription evaluation

  • Author

    Moraru, Daniel ; Meignier, Sylvain ; Fredouille, Corinne ; Besacier, Laurent ; Bonastre, Jean-François

  • Author_Institution
    CLIPS-IMAG, Grenoble, France
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    The paper presents the ELISA consortium activities in automatic speaker segmentation, also known as speaker diarization, during the NIST rich transcription (RT), 2003, evaluation. The experiments were conducted on real broadcast news data (HUB4). Two different approaches from the CLIPS and LIA laboratories are presented and different possibilities of combining them are investigated, in the framework of the ELISA consortium. The system submitted as an ELISA primary system obtained the second lowest segmentation error rate compared to the other RT03-participant primary systems. Another ELISA system submitted as a secondary system outperformed the best primary system and obtained the lowest speaker segmentation error rate.
  • Keywords
    error statistics; speaker recognition; speech processing; ELISA consortium; broadcast news speaker segmentation; rich transcription evaluation; segmentation error rate; speaker diarization; speech processing; Acoustic signal detection; Broadcasting; Ear; Error analysis; Indexing; Laboratories; Loudspeakers; NIST; Speech analysis; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326000
  • Filename
    1326000