• DocumentCode
    2669234
  • Title

    Document categorisation by genetic algorithms

  • Author

    Liu, Chih-Hung ; Lu, Cheng-Che ; Lee, Wei-Po

  • Author_Institution
    Dept. of Manage. Inf. Syst., Nat. Pingtung Univ. of Sci. & Technol., Taiwan
  • Volume
    5
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    3868
  • Abstract
    Today, it is easy to provide information to and retrieve information from the Internet. However, the problem of information overload has to be overcome. One of the main issues to be addressed for the information overload problem is document classification. We present an evolutionary approach to automatically categorize documents into appropriate categories. Our approach deals with different categories of documents separately: it evolves a numerical list that consists of the corresponding weights of the feature words for each class of documents. Experimental results show that our approach can easily evolve the classifiers of numerical lists, and that the evolved classifiers perform better than those constructed by the traditional k-nearest neighbors approach
  • Keywords
    Internet; classification; genetic algorithms; information retrieval; pattern classification; Internet; document categorisation; document classification; evolutionary approach; feature words; genetic algorithms; information overload; k-nearest neighbors approach; numerical list; Buildings; Business; Data mining; Genetic algorithms; IP networks; Information filtering; Information filters; Information retrieval; Internet; Management information systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 2000 IEEE International Conference on
  • Conference_Location
    Nashville, TN
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-6583-6
  • Type

    conf

  • DOI
    10.1109/ICSMC.2000.886614
  • Filename
    886614