Concept Learning of Text Documents

Author

An, Jiyuan ; Chen, Yi-Ping Phoebe

Author_Institution

Deakin University, Australia

fYear

2004

fDate

20-24 Sept. 2004

Firstpage

698

Lastpage

701

Abstract

Concept learning of text documents can be viewed as the problem of acquiring the definition of a general category of documents. To definite the category of a text document, the Conjunctive of keywords is usually be used. These keywords should be fewer and comprehensible. A naïve method is enumerating all combinations of keywords to extract suitable ones. However, because of the enormous number of keyword combinations, it is impossible to extract the most relevant keywords to describe the categories of documents by enumerating all possible combinations of keywords. Many heuristic methods are proposed, such as GA-base, immune based algorithm. In this work, we introduce pruning power technique and propose a robust enumeration-based concept learning algorithm. Experimental results show that the rules produce by our approach has more comprehensible and simplicity than by other methods.

Keywords

Australia Council; Bioinformatics; Clustering algorithms; Immune system; Information technology; Phase measurement; Robustness; Terminology; Web mining;

fLanguage

English

Publisher

ieee

Conference_Titel

Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on

Print_ISBN

0-7695-2100-2

Type

conf

DOI

10.1109/WI.2004.10069

Filename

1410900

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=2861797