DocumentCode
2669234
Title
Document categorisation by genetic algorithms
Author
Liu, Chih-Hung ; Lu, Cheng-Che ; Lee, Wei-Po
Author_Institution
Dept. of Manage. Inf. Syst., Nat. Pingtung Univ. of Sci. & Technol., Taiwan
Volume
5
fYear
2000
fDate
2000
Firstpage
3868
Abstract
Today, it is easy to provide information to and retrieve information from the Internet. However, the problem of information overload has to be overcome. One of the main issues to be addressed for the information overload problem is document classification. We present an evolutionary approach to automatically categorize documents into appropriate categories. Our approach deals with different categories of documents separately: it evolves a numerical list that consists of the corresponding weights of the feature words for each class of documents. Experimental results show that our approach can easily evolve the classifiers of numerical lists, and that the evolved classifiers perform better than those constructed by the traditional k-nearest neighbors approach
Keywords
Internet; classification; genetic algorithms; information retrieval; pattern classification; Internet; document categorisation; document classification; evolutionary approach; feature words; genetic algorithms; information overload; k-nearest neighbors approach; numerical list; Buildings; Business; Data mining; Genetic algorithms; IP networks; Information filtering; Information filters; Information retrieval; Internet; Management information systems;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics, 2000 IEEE International Conference on
Conference_Location
Nashville, TN
ISSN
1062-922X
Print_ISBN
0-7803-6583-6
Type
conf
DOI
10.1109/ICSMC.2000.886614
Filename
886614
Link To Document