Title :
Classification of documents by content
Author :
Jaillet, Simon ; Teisseire, Maguelonne ; Chauche, Jacquws ; Prince, Violaine
Author_Institution :
LIRMM, Montpellier, France
Abstract :
This paper deals with automated text categorization (or classification) into predefined categories. We present a concept-based approach where texts are considered in a more semantical way than classical vector of terms. An agreeing measure is also defined in order to improve the categorization process. Experimental results show the benefit obtained with our proposal.
Keywords :
classification; computational linguistics; text analysis; automated text categorization; categorization process; classical vector; concept-based approach; digital documents; document classification; document content; predefined categories; semantical way; Cognitive informatics;
Conference_Titel :
Cognitive Informatics, 2003. Proceedings. The Second IEEE International Conference on
Print_ISBN :
0-7695-1986-5
DOI :
10.1109/COGINF.2003.1225983