Title :
Extraction and classification of rhetorical sentences of experimental technical paper based on section class
Author :
Helen, Afrida ; Purwarianti, Ayu ; Widyantoro, Dwi H.
Author_Institution :
Sch. of Electr. Eng. & Inf., Bandung Inst. of Technol., Bandung, Indonesia
Abstract :
An extraction process of rhetorical sentences has become one of the problems in the study of automatic text summarization with a rhetorical sentence basis. Rhetorical sentence with high accuracy will be needed for producing a good summary. To improve the accuracy, this paper proposes a method in how to extract rhetorical sentences from experimental papers according to their category. The four main categories of experimental papers include problem, data, method, and result. Moreover, this paper proposes a section class as feature. We also calculated the frequency occurrence of rhetorical sentence in every section class. In our evaluation, we used tree algorithms including Naive Bayes, SVM and Decision Tree. Generally the SVM algorithm is proven to be better than the two other algorithms because the difference in value of the section class and non-section class feature is more reasonable. Overall, the rhetorical sentence extraction using section class has a better performance compared to those without class section.
Keywords :
Bayes methods; decision trees; support vector machines; text analysis; Naive Bayes; SVM algorithm; automatic text summarization; decision tree; extraction process; frequency occurrence; rhetorical sentence basis; rhetorical sentence extraction; rhetorical sentences; tree algorithms; Abstracts; Classification algorithms; Decision trees; Feature extraction; Support vector machines; Training data; Automatic text summarization; rhetorical category; rhetorical sentence; section;
Conference_Titel :
Information and Communication Technology (ICoICT), 2014 2nd International Conference on
Conference_Location :
Bandung
DOI :
10.1109/ICoICT.2014.6914099