Title :
Content Selection Operators for Multidocument Summarization Based on Cross-Document Structure Theory
Author :
Jorge, Maria Lucía Castro ; Pardo, Thiago Alexandre Salgueiro
Author_Institution :
Inst. de Cienc. Mat. e de Comput., Univ. de Sao Paulo, São Carlos, Brazil
Abstract :
This paper aims at presenting an analysis of content selection techniques for multidocument summarization based on the multidocument discourse theory CST (Cross-document Structure Theory). We approach the task of content selection by using CST-based operators and focus specifically on redundancy treatment, which is an important and pervasive problem in multidocument summarization. Our experiments with Brazilian Portuguese news texts show that CST improves summaries quality by exploring relations among texts. Particularly, redundancy is reduced by identifying common information among texts, especially when compression rate is low.
Keywords :
Air accidents; Airplanes; Humans;
Conference_Titel :
Information and Human Language Technology (STIL), 2009 Seventh Brazilian Symposium in
Conference_Location :
Sao Carlos, TBD, Brazil
Print_ISBN :
978-1-4244-6008-3
DOI :
10.1109/STIL.2009.15