Title :
Likelihood calculation classification for Indonesian language news documents
Author :
Rachmania, Aini ; Jaafar, Jafreezal ; Zamin, Norshuhani
Author_Institution :
Dept. of Comput. & Inf. Sci., Univ. Teknol. PETRONAS, Tronoh, Malaysia
Abstract :
Text categorization has been an important research area that seeks to classify textual documents into a group of predetermined categories. Unfortunately, the interest towards Indonesian news classification has been very little. In this paper, we propose a text categorization algorithm based on Bracewell method that uses the likelihood calculation between the article and the category´s keywords. Through experiments, the algorithm succeeded in classifying Indonesian news corpus with accuracy as high as 93,84% in offline environment, 93,82% in online environment, and 80% benchmarking against human evaluation.
Keywords :
classification; information retrieval; natural language processing; text analysis; Bracewell method; Indonesian language news document; Indonesian news classification; article keywords; category keywords; likelihood calculation classification; text categorization; textual document; Indonesian documents; information retrieval; likelihood calculation; news domain; text categorization;
Conference_Titel :
Information Technology and Electrical Engineering (ICITEE), 2013 International Conference on
Conference_Location :
Yogyakarta
Print_ISBN :
978-1-4799-0423-5
DOI :
10.1109/ICITEED.2013.6676229