• DocumentCode
    3102159
  • Title

    Automatic Domain-Ontology Relation Extraction from Semi-structured Texts

  • Author

    Xiao, Cheng ; Zheng, Dequan ; Yang, Yuhang ; Shao, Guojun

  • Author_Institution
    MOE-MS Key Lab. of Natural Language Process. & Speech, Harbin Inst. of Technol., Harbin, China
  • fYear
    2009
  • fDate
    7-9 Dec. 2009
  • Firstpage
    211
  • Lastpage
    216
  • Abstract
    This paper presents a new method to acquire domain-ontology relations from semi-structured data sources. First, obtain Web documents according to the co-occurrence of concept instance and attribute value. Further, define formats of relation patterns, and extract pattern instances from Web documents, including pattern clustering and pattern combining in each cluster. Finally, relation pattern instances are applied to gain attribute values of new concept instances in domain-ontology. Experiments are carried out in the field of film, the rate of pattern incorrect-division and pattern leakage are respectively 0.19% and 1.31%, the highest precision of combined relation patterns reaches 85%. Experimental results demonstrate that the method developed in this paper is fairly efficient.
  • Keywords
    information filtering; ontologies (artificial intelligence); text analysis; Web documents; automatic domain-ontology relation extraction; pattern clustering; pattern combining; pattern leakage; semistructured data sources; semistructured texts; Clustering algorithms; Data analysis; Data mining; Information analysis; Laboratories; Lattices; Natural language processing; Natural languages; Search engines; Terminology; ontology structure; pattern instance; relation extraction; semi-structure;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing, 2009. IALP '09. International Conference on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-0-7695-3904-1
  • Type

    conf

  • DOI
    10.1109/IALP.2009.51
  • Filename
    5380767