DocumentCode
2861797
Title
Concept Learning of Text Documents
Author
An, Jiyuan ; Chen, Yi-Ping Phoebe
Author_Institution
Deakin University, Australia
fYear
2004
fDate
20-24 Sept. 2004
Firstpage
698
Lastpage
701
Abstract
Concept learning of text documents can be viewed as the problem of acquiring the definition of a general category of documents. To definite the category of a text document, the Conjunctive of keywords is usually be used. These keywords should be fewer and comprehensible. A naïve method is enumerating all combinations of keywords to extract suitable ones. However, because of the enormous number of keyword combinations, it is impossible to extract the most relevant keywords to describe the categories of documents by enumerating all possible combinations of keywords. Many heuristic methods are proposed, such as GA-base, immune based algorithm. In this work, we introduce pruning power technique and propose a robust enumeration-based concept learning algorithm. Experimental results show that the rules produce by our approach has more comprehensible and simplicity than by other methods.
Keywords
Australia Council; Bioinformatics; Clustering algorithms; Immune system; Information technology; Phase measurement; Robustness; Terminology; Web mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on
Print_ISBN
0-7695-2100-2
Type
conf
DOI
10.1109/WI.2004.10069
Filename
1410900
Link To Document