Title :
Meaningful Inner Link Objects for Automatic Text Categorization
Author :
Shen, Jau-Ji ; Wu, Jia-Chiuan
Author_Institution :
Dept. of Manage. Inf. Syst., Nat. Chung Hsing Univ., Taichung, Taiwan
Abstract :
This paper presents a novel approach for automatic text categorization. The mainstream of the research on rule-based classifier regards document as a container of term, and generates rules by using the term distribution in documents. General speaking, there must be existed some kind of semantic relevance between term and paragraph in a document. We call it Meaningful Inner Link Objects-MILO which must be varied with different semantics of a document itself. While this paper concentrates on using these MILOs that associate with semantic relevance for text categorization, hence we focus on two problems: (1) finding the best MILOs which associate with semantic relevance; and (2) using these specific MILOs to build a classifier for text categorization. From the experiment results, our proposed classification approach base on MILO has a better accuracy while other state of the art technique without considering the relevance between term and paragraph.
Keywords :
classification; knowledge based systems; learning (artificial intelligence); text analysis; automatic text categorization; meaningful inner link objects; rule generation; rule-based classifier; semantic relevance; term distribution; Association rules; Classification tree analysis; Containers; Couplings; Databases; Decision trees; Management information systems; Multimedia systems; Signal processing; Text categorization; Text categorization; information retrieval; rule-based; term distribution;
Conference_Titel :
Intelligent Information Hiding and Multimedia Signal Processing, 2009. IIH-MSP '09. Fifth International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-4717-6
Electronic_ISBN :
978-0-7695-3762-7
DOI :
10.1109/IIH-MSP.2009.21