DocumentCode :
3208173
Title :
Automatic text categorization: case study
Author :
Corrêa, Renato Fernandes ; Ludermir, Teresa Bernarda
fYear :
2002
fDate :
2002
Firstpage :
150
Abstract :
Text categorization is a process of classifying documents with regard to a group of one or more existent categories according to themes or concepts present in their contents. The most common application of it is in information retrieval systems (IRS) to document indexing. A method to transform text categorization into a viable task is to use machine-learning algorithms to automate text classification, allowing it to be carried out fast, into concise manner and in broad range. The objective of this work is to present and compare the results of experiments on text categorization using artificial neural networks of multilayer perceptron and self-organizing map types, and traditional machine-learning algorithms used in this task: C4.5 decision tree, PART decision rules and Naive Bayes classifier.
Keywords :
classification; decision trees; information retrieval; learning (artificial intelligence); multilayer perceptrons; self-organising feature maps; text analysis; Naive Bayes classifier; PART decision rules; decision tree; document classification; document indexing; information retrieval systems; machine-learning; multilayer perceptron; neural networks; self-organizing maps; text categorization; Artificial neural networks; Computer aided software engineering; Decision trees; Electronic mail; Indexing; Information retrieval; Machine learning algorithms; Multilayer perceptrons; Self organizing feature maps; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2002. SBRN 2002. Proceedings. VII Brazilian Symposium on
Print_ISBN :
0-7695-1709-9
Type :
conf
DOI :
10.1109/SBRN.2002.1181457
Filename :
1181457
Link To Document :
بازگشت