• DocumentCode
    3300865
  • Title

    A clustering based approach for domain relevant relation extraction

  • Author

    Yang, Yuhang ; Lu, Qin ; Zhao, Tiejun

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin
  • fYear
    2008
  • fDate
    19-22 Oct. 2008
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Most existing corpus based relation extraction techniques focus on predefined relations. In this paper, a clustering based method is presented for domain relevant relation extraction including both relation type discovery and relation instance extraction. Given two raw corpora, one in the general domain, one in an application domain, domain specific verbs connecting different instances are extracted based on syntactic dependency as well as a small set of domain concept instance seeds. Relation types are then discovered based on verb clustering followed by relation instance extraction. The proposed approach requires no predefined relation types, no prior training of domain knowledge, and no need for manually annotated corpora. This method is applicable to any domain corpus and it is especially useful for knowledge-limited and resource-limited domains. Evaluations conducted on Chinese football domain for relation extraction show that the approach discovers various relations with good performance.
  • Keywords
    information retrieval; natural language processing; pattern clustering; Chinese football domain; corpus based relation extraction techniques; domain corpus; domain relevant relation extraction; relation instance extraction; relation type discovery; verb clustering; Algorithm design and analysis; Association rules; Computer science; Data mining; Internet; Joining processes; Ontologies; Pattern matching; Search engines; World Wide Web; Relation extraction; domain verb extraction; information extraction; relation type discovery; verb clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-4515-8
  • Electronic_ISBN
    978-1-4244-2780-2
  • Type

    conf

  • DOI
    10.1109/NLPKE.2008.4906782
  • Filename
    4906782