DocumentCode
730824
Title
Fusion of speaker and lexical information for topic segmentation: A co-segmentation approach
Author
Charlet, Delphine ; Damnati, Geraldine ; Bouchekif, Abdessalam ; Douib, Ameur
Author_Institution
Orange Labs., Lannion, France
fYear
2015
fDate
19-24 April 2015
Firstpage
5261
Lastpage
5265
Abstract
In this work, we investigate how speaker-based information and lexical-based information can be fused efficiently for topic segmentation of spoken contents. While in recent work, we have proposed an early fusion scheme, so as to jointly model speaker and lexical distribution, we propose here a co-segmentation framework, between segmentations performed in the speaker space and in the lexical space. Experiments carried out on two distinct corpora (Radio talk show and TV Broadcast News) show that, even if performances of speaker information are contrasted and closely related to the content structure, its integration with lexical information, either by early fusion or by co-segmentation, is always effective. Absolute gains of 16% (Radio corpus) and 5% (TV corpus) are observed for topic boundary detection performance.
Keywords
computational linguistics; speaker recognition; lexical information; lexical space; speaker space; speaker-based information; spoken contents; topic segmentation; Acoustics; Classification algorithms; Indexes; Legged locomotion; Speech; TV; Topic segmentation; co-segmentation; lexical cohesion; speaker cohesion;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178975
Filename
7178975
Link To Document