• DocumentCode
    2365386
  • Title

    Semantic Text Mining with Linked Data

  • Author

    Huang, Zhaohui ; Chen, Huajun ; Yu, Tong ; Sheng, Hao ; Luo, Zhaobo ; Mao, Yuxin

  • Author_Institution
    Coll. of Comput. Sci., Zhejiang Univ., Hangzhou, China
  • fYear
    2009
  • fDate
    25-27 Aug. 2009
  • Firstpage
    338
  • Lastpage
    343
  • Abstract
    Linked Data is an open data space that emerges from the publication and interlinking of structured data on the Web using the Semantic Web technologies. How to utilize this wealth of data is currently a focused research theme of the Semantic Web community. In this paper, we aim to utilize Linked Data to generate semantic annotations for frequent patterns extracted from textual documents. First, we extract semantic relations from textual documents and merge them into a set of semantic graphs. Then, we apply a frequent subgraph discovery algorithm on the set of graphs to generate frequent patterns. Finally, we annotate the discovered patterns using Linked Data. Our approach can be applied in such domains as terrorist network analysis and biological network analysis. The efficacy of our approach is demonstrated through an empirical experiment that discovers and validates relationships between political figures from large number of news on the Web.
  • Keywords
    data mining; graph theory; information retrieval; semantic Web; text analysis; biological network analysis; frequent subgraph discovery algorithm; linked data; open data space; pattern extraction; semantic Web technologies; semantic annotations; semantic graphs; semantic text mining; structured data interlinking; structured data publication; terrorist network analysis; text documents; Computer science; Data engineering; Data mining; Educational institutions; OWL; Resource description framework; Semantic Web; Space technology; Text mining; Web pages; Entity Extraction; Frequent Subgraph; Linked Data; Massive Knowledge Relation; Semantic Graph; Semantic Graph Fusion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    INC, IMS and IDC, 2009. NCM '09. Fifth International Joint Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-1-4244-5209-5
  • Electronic_ISBN
    978-0-7695-3769-6
  • Type

    conf

  • DOI
    10.1109/NCM.2009.131
  • Filename
    5331702