Title :
Multi document summarization based on cross-document relation using voting technique
Author :
Kumar, Yogan Jaya ; Salim, Naomie ; Abuobieda, Albaraa ; Tawfik, Ameer
Author_Institution :
Fac. of Inf. & Commun. Technol., Univ. Teknikal Malaysia Melaka, Durian Tunggal, Malaysia
Abstract :
News articles which are available through online search often provide readers with large collection of texts. Especially in the case of news story, different news sources reporting on the same event usually returns multiple articles in response to a reader´s search. In this work, we first identify cross-document relations from un-annotated texts using Genetic-CBR approach. Following that, we develop a new sentence scoring model based on voting technique over the identified cross-document relations. Our experiments show that incorporating the proposed methods in the summarization process yields substantial improvement over the mainstream methods. The performances of all methods were evaluated using ROUGE - a standard evaluation metric used in text summarization.
Keywords :
case-based reasoning; information resources; information retrieval; text analysis; ROUGE; case base reasoning; cross-document relation; genetic-CBR approach; mainstream methods; multidocument summarization; news articles; news story; online search; sentence scoring model; text summarization; voting technique; Classification algorithms; Cognition; Computational modeling; Feature extraction; Genetics; Mathematical model; Support vector machines; Case-based reasoning; Cross-document relation; Genetic algorithm; Machine learning; Multi document summarization; Voting Technique;
Conference_Titel :
Computing, Electrical and Electronics Engineering (ICCEEE), 2013 International Conference on
Conference_Location :
Khartoum
Print_ISBN :
978-1-4673-6231-3
DOI :
10.1109/ICCEEE.2013.6634009