• DocumentCode
    730824
  • Title

    Fusion of speaker and lexical information for topic segmentation: A co-segmentation approach

  • Author

    Charlet, Delphine ; Damnati, Geraldine ; Bouchekif, Abdessalam ; Douib, Ameur

  • Author_Institution
    Orange Labs., Lannion, France
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    5261
  • Lastpage
    5265
  • Abstract
    In this work, we investigate how speaker-based information and lexical-based information can be fused efficiently for topic segmentation of spoken contents. While in recent work, we have proposed an early fusion scheme, so as to jointly model speaker and lexical distribution, we propose here a co-segmentation framework, between segmentations performed in the speaker space and in the lexical space. Experiments carried out on two distinct corpora (Radio talk show and TV Broadcast News) show that, even if performances of speaker information are contrasted and closely related to the content structure, its integration with lexical information, either by early fusion or by co-segmentation, is always effective. Absolute gains of 16% (Radio corpus) and 5% (TV corpus) are observed for topic boundary detection performance.
  • Keywords
    computational linguistics; speaker recognition; lexical information; lexical space; speaker space; speaker-based information; spoken contents; topic segmentation; Acoustics; Classification algorithms; Indexes; Legged locomotion; Speech; TV; Topic segmentation; co-segmentation; lexical cohesion; speaker cohesion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178975
  • Filename
    7178975