Fusion of speaker and lexical information for topic segmentation: A co-segmentation approach

Author

Charlet, Delphine ; Damnati, Geraldine ; Bouchekif, Abdessalam ; Douib, Ameur

Author_Institution

Orange Labs., Lannion, France

fYear

2015

fDate

19-24 April 2015

Firstpage

5261

Lastpage

5265

Abstract

In this work, we investigate how speaker-based information and lexical-based information can be fused efficiently for topic segmentation of spoken contents. While in recent work, we have proposed an early fusion scheme, so as to jointly model speaker and lexical distribution, we propose here a co-segmentation framework, between segmentations performed in the speaker space and in the lexical space. Experiments carried out on two distinct corpora (Radio talk show and TV Broadcast News) show that, even if performances of speaker information are contrasted and closely related to the content structure, its integration with lexical information, either by early fusion or by co-segmentation, is always effective. Absolute gains of 16% (Radio corpus) and 5% (TV corpus) are observed for topic boundary detection performance.

Keywords

computational linguistics; speaker recognition; lexical information; lexical space; speaker space; speaker-based information; spoken contents; topic segmentation; Acoustics; Classification algorithms; Indexes; Legged locomotion; Speech; TV; Topic segmentation; co-segmentation; lexical cohesion; speaker cohesion;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on

Conference_Location

South Brisbane, QLD

Type

conf

DOI

10.1109/ICASSP.2015.7178975

Filename

7178975