Title :
Citation sentence identification and classification for related work summarization
Author :
Widyantoro, Dwi H. ; Amin, Imaduddin
Author_Institution :
Inst. of Technol. Bandung, Bandung, Indonesia
Abstract :
Scientific article summarization is an important problem because it can be of helpful for researchers, particularly for those who start a new research topic. In this paper, we address the problem of related work summarization from scientific papers. The process of summarization comprises of extracting citation sentence followed by classifying the rhetorical category of citation sentence. Citation sentence extraction is performed by combining regular expression-based patterns, co-reference system, evidence-based approach and additional extraction rule. Citation sentence is represented as feature vectors containing term frequency, sentence length, thematic word and cue phrase feature groups. The learning of classification model is explored using Naïve Bayes, Complement Naïve Bayes and Decision Tree. Experiment results reveal that the approaches adopted for citation sentence extraction and rhetorical category classification is promising to provide the ground work for related work summarization.
Keywords :
Bayes methods; citation analysis; decision trees; pattern classification; citation classification; citation sentence extraction; citation sentence identification; complement naive Bayes; coreference system; cue phrase feature groups; decision tree; evidence-based approach; regular expression-based patterns; related work summarization; rhetorical category classification; scientific article summarization; scientific papers; sentence length; term frequency; thematic word; Decision support systems; Vectors;
Conference_Titel :
Advanced Computer Science and Information Systems (ICACSIS), 2014 International Conference on
DOI :
10.1109/ICACSIS.2014.7065871