• DocumentCode
    2861797
  • Title

    Concept Learning of Text Documents

  • Author

    An, Jiyuan ; Chen, Yi-Ping Phoebe

  • Author_Institution
    Deakin University, Australia
  • fYear
    2004
  • fDate
    20-24 Sept. 2004
  • Firstpage
    698
  • Lastpage
    701
  • Abstract
    Concept learning of text documents can be viewed as the problem of acquiring the definition of a general category of documents. To definite the category of a text document, the Conjunctive of keywords is usually be used. These keywords should be fewer and comprehensible. A naïve method is enumerating all combinations of keywords to extract suitable ones. However, because of the enormous number of keyword combinations, it is impossible to extract the most relevant keywords to describe the categories of documents by enumerating all possible combinations of keywords. Many heuristic methods are proposed, such as GA-base, immune based algorithm. In this work, we introduce pruning power technique and propose a robust enumeration-based concept learning algorithm. Experimental results show that the rules produce by our approach has more comprehensible and simplicity than by other methods.
  • Keywords
    Australia Council; Bioinformatics; Clustering algorithms; Immune system; Information technology; Phase measurement; Robustness; Terminology; Web mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on
  • Print_ISBN
    0-7695-2100-2
  • Type

    conf

  • DOI
    10.1109/WI.2004.10069
  • Filename
    1410900