Title :
Automatically Constructing a Domain Ontology for Document Classification
Author_Institution :
Southern Taiwan Univ. of Technol., Taipei
Abstract :
For document classification, a domain ontology is first automatically constructed based on formal concept analysis. A document classification system, DCSO, is then proposed based the domain ontology. The features of DCSO are automatic construction of the system using the theorem of formal concept analysis, proposition of an XML knowledge-based schema for documents storage and quick search, and utilization of the hierarchy´s property of ontology offering the accuracy of document classification. Experimental results for the DCSO are illustrated: the domain ontology is automatically constructed for document classification; the behavior of the accuracy for classification with DCSO is well; the searching time for DCSO steadily and slightly even though the queried documents increase rapidly.
Keywords :
XML; document handling; knowledge based systems; ontologies (artificial intelligence); XML knowledge-based schema; document classification; documents storage; domain ontology; formal concept analysis; Abstracts; Cybernetics; Data mining; Information analysis; Information retrieval; Knowledge management; Machine learning; Ontologies; Terminology; Text analysis; Document classification; Formal concept analysis; Ontology;
Conference_Titel :
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-0973-0
Electronic_ISBN :
978-1-4244-0973-0
DOI :
10.1109/ICMLC.2007.4370465