DocumentCode :
658376
Title :
A Four Dimension Graph Model for Automatic Text Summarization
Author :
Ferreira, Ricardo ; Freitas, Fred ; De Souza Cabral, Luciano ; Dueire Lins, Rafael ; Lima, Raphaela ; Franca, Gabriel ; Simskez, Steven J. ; Favaro, Luciano
Author_Institution :
Inf. Center, Fed. Univ. of Pernambuco, Recife, Brazil
Volume :
1
fYear :
2013
fDate :
17-20 Nov. 2013
Firstpage :
389
Lastpage :
396
Abstract :
Text summarization is the process of automatically creating a shorter version of one or more text documents. In this context, word-based, sentence-based and graph-based methods approaches are largely used. Among these, graph based methods for automatic text summarization produce summaries based on the relationships between sentences. These relationships may also support the creation of several text processing applications such as extractive and abstractive summaries, question-answering and information retrieval systems, among others. A new graph model for text processing applications is proposed in this paper. It relies on four dimensions (similarity, semantic similarity, co reference, discourse information) to create the graph. The rationale behind the proposal presented here is resorting to more dimensions than previous works, and taking into account co reference resolution, taking into account to the role of pronouns in connecting the sentences. Co reference was not used in any previous graph based summarization technique. An experiment was performed using the Text Rank algorithm with the presented approach, on the CNN corpus. The results show that the model proposed here outperforms the current approaches both quantitatively and qualitatively.
Keywords :
graph theory; text analysis; word processing; CNN corpus; TextRank algorithm; abstractive summaries; automatic text summarization; discourse information; extractive summaries; four dimension graph model; graph-based methods; graph-based summarization technique; information retrieval systems; question-answering systems; semantic similarity; sentence-based methods; text documents; text processing applications; word-based methods; Measurement uncertainty; Proposals; Semantics; Silicon; Text processing; Vectors; Graph-Model; Summarization; TextRank;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2013 IEEE/WIC/ACM International Joint Conferences on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4799-2902-3
Type :
conf
DOI :
10.1109/WI-IAT.2013.55
Filename :
6690041
Link To Document :
بازگشت