Title :
Modeling textual document classification
Author :
Lam, Wai ; Ho, Chao Yang
Author_Institution :
Dept. of Syst. Eng. & Eng. Manage., Chinese Univ. of Hong Kong, Shatin, Hong Kong
fDate :
6/21/1905 12:00:00 AM
Abstract :
We investigate existing rule-based techniques for automatic textual document classification. The weakness of these techniques are identified. We propose a new technique known as the IBRI algorithm by unifying the strengths of rule-based and instance-based methods. Our algorithm adapts to the characteristic of text classification problems. Some experiments have been conducted to demonstrate the effectiveness of our IBRI algorithm. Moreover, we compare the performance with an existing rule-based and instance-based algorithms. The results show that our IBRI performs better most of the time
Keywords :
classification; data mining; document handling; visual databases; IBRI algorithm; experiments; instance-based methods; performance; rule-based techniques; text database; textual document classification modeling; Chaos; Decision trees; Ducts; Humans; Partitioning algorithms; Research and development management; Sampling methods; Spatial databases; Systems engineering and theory; Text categorization;
Conference_Titel :
Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
Conference_Location :
Tokyo
Print_ISBN :
0-7803-5731-0
DOI :
10.1109/ICSMC.1999.823355